Discovery of New Words in Tax-related Fields Based on Word Vector Representation,ERICDATA高等教育知識庫
高等教育出版
熱門: 朱丽彬  黃光男  王美玲  王善边  曾瓊瑤  崔雪娟  
高等教育出版
首頁 臺灣期刊   學校系所   學協會   民間出版   大陸/海外期刊   政府機關   學校系所   學協會   民間出版   DOI註冊服務
篇名
Discovery of New Words in Tax-related Fields Based on Word Vector Representation
並列篇名
Discovery of New Words in Tax-related Fields Based on Word Vector Representation
作者 Wei WeiWei LiuBeibei ZhangRafał SchererRobertas Damasevicius
英文摘要

New words detection, as basic research in natural language processing, has gained extensive concern from academic and business communities. When the existing Chinese word segmentation technology is applied in the specific field of tax-related finance, because it cannot correctly identify new words in the field, it will have an impact on subsequent information extraction and entity recognition. Aiming at the current problems in new word discovery, it proposed a new word detection method using statistical features that are based on the inner measurement and branch entropy and then combined with word vector representation. First, perform word segmentation preprocessing on the corpus, calculate the internal cohesion degree of words through statistics of scattered string mutual information, filter out candidate two-tuples, and then filter and expand the two-tuples; next, it locks the boundaries of new words through calculate the branch entropy. Finally, expand the new vocabulary dictionary according to the cosine similarity principle of word vector representation. The unsupervised neologism discovery proposed in this paper allows for automatic growth of the neologism lexicon, experimental results on large-scale corpus verify the effectiveness of this method.

 

起訖頁 923-930
關鍵詞 New word discoveryWord internal combination degreeBoundary degree of freedomWord vector representation
刊名 網際網路技術學刊  
期數 202307 (24:4期)
出版單位 台灣學術網路管理委員會
DOI 10.53106/160792642023072404010   複製DOI
QR Code
該期刊
上一篇
A Neural Network Method for Systematic Evaluation of Informatization Development Level in Smart Court Construction
該期刊
下一篇
Reliability Analysis of Cold-standby Systems with Subsystems Using Conditional Binary Decision Diagrams

高等教育知識庫  新書優惠  教育研究月刊  全球重要資料庫收錄  

教師服務
合作出版
期刊徵稿
聯絡高教
高教FB
讀者服務
圖書目錄
教育期刊
訂購服務
活動訊息
數位服務
高等教育知識庫
國際資料庫收錄
投審稿系統
DOI註冊
線上購買
高點網路書店 
元照網路書店
博客來網路書店
教育資源
教育網站
國際教育網站
關於高教
高教簡介
出版授權
合作單位
知識達 知識達 知識達 知識達 知識達 知識達
版權所有‧轉載必究 Copyright2011 高等教育文化事業股份有限公司  All Rights Reserved
服務信箱:edubook@edubook.com.tw 台北市館前路 26 號 6 樓 Tel:+886-2-23885899 Fax:+886-2-23892500