Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words
- https://doi.org/10.2991/jrnal.2015.2.3.10How to use a DOI?
- Clustering, Document classification, Extraction of representative document, Frequency of nouns.
We developed a method for classification of Japanese documents and ranking of representative documents by using the characteristic of the frequencies of nouns. A representative document is defined as a document whose feature vector is the closest to the center of gravity of the class in the feature vector space among all documents belonging to the class belonging to the class. The ranking of representative documents is decided in descending order of the number of documents belonging to the class.
- © 2013, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - JOUR AU - Jun Kimura AU - Yasunari Yoshitomi AU - Masayoshi Tabuse PY - 2015 DA - 2015/12/01 TI - Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words JO - Journal of Robotics, Networking and Artificial Life SP - 182 EP - 185 VL - 2 IS - 3 SN - 2352-6386 UR - https://doi.org/10.2991/jrnal.2015.2.3.10 DO - https://doi.org/10.2991/jrnal.2015.2.3.10 ID - Kimura2015 ER -