Proceedings of 2016 International Conference on Modeling, Simulation and Optimization Technologies and Applications (MSOTA2016)

Text Classification Method Based on Machine Learning and Domain Knowledge Ontology

Authors
Zhiyong Gao, Shuhan Qiao, Yongquan Liang
Corresponding Author
Zhiyong Gao
Available Online December 2016.
DOI
10.2991/msota-16.2016.74How to use a DOI?
Keywords
machine learning; ontology; text classification
Abstract

The use of machine learning method is discussed herein to produce a corpus by domain knowledge ontology and conduct text classification according to the ontology of professional knowledge domain. Nowadays, a large number of literature materials have been accumulated in each professional field, and it is still in rapid growth. This constitutes a great challenge for researchers in various fields. To be specific, not only the workload in literature retrieval and reading is constantly increased, but also the work efficiency of the study is affected. In this paper, ontology is taken as the text feature extractor for storage, processing, classification and retrieval through ontology development tools Prot,g,, Jena and natural language processing tool NLTK, so as to facilitate the researcher for literature retrieval and reading. The advantage of this text classification method lies in that category structure is no longer a single tree structure, but instead, different categories may intersect and new category may be grouped by themselves.

Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of 2016 International Conference on Modeling, Simulation and Optimization Technologies and Applications (MSOTA2016)
Series
Advances in Computer Science Research
Publication Date
December 2016
ISBN
10.2991/msota-16.2016.74
ISSN
2352-538X
DOI
10.2991/msota-16.2016.74How to use a DOI?
Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Zhiyong Gao
AU  - Shuhan Qiao
AU  - Yongquan Liang
PY  - 2016/12
DA  - 2016/12
TI  - Text Classification Method Based on Machine Learning and Domain Knowledge Ontology
BT  - Proceedings of 2016 International Conference on Modeling, Simulation and Optimization Technologies and Applications (MSOTA2016)
PB  - Atlantis Press
SP  - 344
EP  - 347
SN  - 2352-538X
UR  - https://doi.org/10.2991/msota-16.2016.74
DO  - 10.2991/msota-16.2016.74
ID  - Gao2016/12
ER  -