An E-mail Classification Algorithm based on Stacking Integrated Learning,ERICDATA高等教育知識庫
高等教育出版
熱門: 朱丽彬  黃光男  王美玲  王善边  曾瓊瑤  崔雪娟  
高等教育出版
首頁 臺灣期刊   學校系所   學協會   民間出版   大陸/海外期刊   政府機關   學校系所   學協會   民間出版   DOI註冊服務
閱讀全文
篇名
An E-mail Classification Algorithm based on Stacking Integrated Learning
並列篇名
An E-mail Classification Algorithm based on Stacking Integrated Learning
作者 Li-Xia WanWei-Xing HuangQing-Hua Tang
英文摘要

The text filtering of traditional anti spam system mainly focuses on keyword matching and text fingerprint analysis, which is difficult to accurately identify and classify spam. Therefore, an integrated learning algorithm based on stackin g is proposed in this paper. Firstly, the algorithm takes the manually marked text data of various categories as samples, uses TF-IDF algorithm to train the word vector space model, then selects linear SVC, xgboost and logistic regression algorithm to structure the base classifier, uses random forest algorithm to structure the meta classifier, and combines the stacking ensemble learning algorithm to structure the classification model. It achieves the function of dividing e-mail into five categories: illegal, advertisement, news, bill and recruitment. From the simulation results, the AUC values of the stacking integrated learning classification algorithm for each category are 0.92, 0.95, 1.00, 0.93 and 0.97 respectively, and the AP values are 0.86, 0.88, 1.00, 0.88 and 0.94 respectively, which realizes the high performance and high precision of text classification.

 

起訖頁 105-114
關鍵詞 anti spam systemintegrated learning algorithmTF-IDF algorithmword vector space modele-mail classification
刊名 電腦學刊  
期數 202204 (33:2期)
DOI 10.53106/199115992022043302009   複製DOI
QR Code
該期刊
上一篇
Research on Online and Offline Mixed Education Mode in Post Epidemic Era Based on Fuzzy Neural Network-Taking Introduction of Petrochemical Equipment Management as an Example
該期刊
下一篇
Named Entity Recognition Model Based on TextCNN-BiLSTM-CRF with Chinese Text Classification

高等教育知識庫  新書優惠  教育研究月刊  全球重要資料庫收錄  

教師服務
合作出版
期刊徵稿
聯絡高教
高教FB
讀者服務
圖書目錄
教育期刊
訂購服務
活動訊息
數位服務
高等教育知識庫
國際資料庫收錄
投審稿系統
DOI註冊
線上購買
高點網路書店 
元照網路書店
博客來網路書店
教育資源
教育網站
國際教育網站
關於高教
高教簡介
出版授權
合作單位
知識達 知識達 知識達 知識達 知識達 知識達
版權所有‧轉載必究 Copyright2011 高等教育文化事業股份有限公司  All Rights Reserved
服務信箱:edubook@edubook.com.tw 台北市館前路 26 號 6 樓 Tel:+886-2-23885899 Fax:+886-2-23892500