Analysis and Research of Several Problems of Bad Short Message Filtering System
Authors
W.F Du, G.X Chen
Corresponding Author
W.F Du
Available Online June 2015.
- DOI
- 10.2991/cisia-15.2015.104How to use a DOI?
- Keywords
- message filtration; unreliable corpus; vector space model; IDF
- Abstract
The spread of bad message seriously affects the social ethos and disrupt the normal life order of people. It has considerable practical value to research and develop the filtering technology of bad short message. Two problems in text classification are studied in this paper, which can be used in the bad short message filtering. The first is the application of clustering method to purify unreliable corpus. Experiment shows that the method is quite obvious on purification effect of unreliable data; The second is about a little improvement of word weight index IDF.
- Copyright
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - W.F Du AU - G.X Chen PY - 2015/06 DA - 2015/06 TI - Analysis and Research of Several Problems of Bad Short Message Filtering System BT - Proceedings of the International Conference on Computer Information Systems and Industrial Applications PB - Atlantis Press SP - 384 EP - 387 SN - 2352-538X UR - https://doi.org/10.2991/cisia-15.2015.104 DO - 10.2991/cisia-15.2015.104 ID - Du2015/06 ER -