Proceedings of the 5th International Conference on Advanced Design and Manufacturing Engineering

Research on Bayes-based Text Automatic Classification

Authors
Xuan Zhang
Corresponding Author
Xuan Zhang
Available Online October 2015.
DOI
https://doi.org/10.2991/icadme-15.2015.104How to use a DOI?
Keywords
text automatic classification; Bayes; classification algorithms; feature extraction
Abstract
Enormous amount of information on the Internet, there are several of information and it is so complicated. Information retrieval is of blind and too much redundant information is in search results. In order for a user to much more effective at getting the information they needed, This paper researches the method of page text automatic classification based on the classification algorithm of Naive Bayes. Responding to the structure of pages, the paper analyses the structure components which are useful to the classification in the page tags in detail. And we apply Naive Bayes algorithm to classify with these effective features of HTML identifiers. It easy for users to more precise locate information on Internet through reduced the difficulty of Internet information retrieval.
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Proceedings
5th International Conference on Advanced Design and Manufacturing Engineering
Part of series
Advances in Engineering Research
Publication Date
October 2015
ISBN
978-94-6252-113-1
ISSN
2352-5401
DOI
https://doi.org/10.2991/icadme-15.2015.104How to use a DOI?
Open Access
This is an open access article distributed under the CC BY-NC license.

Cite this article

TY  - CONF
AU  - Xuan Zhang
PY  - 2015/10
DA  - 2015/10
TI  - Research on Bayes-based Text Automatic Classification
BT  - 5th International Conference on Advanced Design and Manufacturing Engineering
PB  - Atlantis Press
SN  - 2352-5401
UR  - https://doi.org/10.2991/icadme-15.2015.104
DO  - https://doi.org/10.2991/icadme-15.2015.104
ID  - Zhang2015/10
ER  -