閱讀全文 | |
篇名 |
Multi-label Scientific Document Classification
|
---|---|
並列篇名 | Multi-label Scientific Document Classification |
作者 | Tariq Ali、Sohail Asghar |
英文摘要 | Scientific document label identification is a significant research area having numerous applications like digital libraries. The author assigns a category or categories to their document manually. Likewise, categories are structured in taxonomy in the form of tree such as ACM CCS. The dilemma becomes more complex when a document belongs to multiple categories. The problem of manual assignment becomes more complicated when the number of expected labels increases. Moreover, the accession schemes are insufficient for solutions with higher accuracy on real scientific document datasets. One way to handle the multi-label classification is to change the problem into a single-label classification. Another way is the variation of the algorithm to handle multi-label classification. The focus of our research is on conversion. Moreover, we propose a solution stimulated from the particle swarm optimization algorithm that can consign a label from the taxonomy. A set of similarity measures is evaluated as well for documentation relatedness that are used in the proposed approach. The designed solution is evaluated on two documents dataset that are retrieved from J. UCS and ACM with an average accuracy of 77 percent as compared to the state of the art algorithms. |
起訖頁 | 1707-1716 |
關鍵詞 | Digital libraries、Multi-label classification、PSO、Text similarity |
刊名 | 網際網路技術學刊 |
期數 | 201811 (19:6期) |
出版單位 | 台灣學術網路管理委員會 |
DOI |
|
QR Code | |
該期刊 上一篇
| Agile Business Intelligence: Combining Big Data and Business Intelligence to Responsive Decision Model |
該期刊 下一篇
| α-Fraction First Strategy for Hierarchical Model in Wireless Sensor Networks |