Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words

Jun Kimura; Yasunari Yoshitomi; Masayoshi Tabuse

doi:10.2991/jrnal.2015.2.3.10

<Previous Article In Issue

Next Article In Issue>

Volume 2, Issue 3, December 2015, Pages 182 - 185

Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words

Authors

Jun Kimura, Yasunari Yoshitomi, Masayoshi Tabuse

Corresponding Author

Jun Kimura

Available Online 1 December 2015.

DOI: 10.2991/jrnal.2015.2.3.10 How to use a DOI?
Keywords: Clustering, Document classification, Extraction of representative document, Frequency of nouns.
Abstract: We developed a method for classification of Japanese documents and ranking of representative documents by using the characteristic of the frequencies of nouns. A representative document is defined as a document whose feature vector is the closest to the center of gravity of the class in the feature vector space among all documents belonging to the class belonging to the class. The ranking of representative documents is decided in descending order of the number of documents belonging to the class.
Copyright: © 2013, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Issue

Next Article In Issue>

Journal: Journal of Robotics, Networking and Artificial Life
Volume-Issue: 2 - 3
Pages: 182 - 185
Publication Date: 2015/12/01
ISSN (Online): 2352-6386
ISSN (Print): 2405-9021
DOI: 10.2991/jrnal.2015.2.3.10 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - JOUR
AU  - Jun Kimura
AU  - Yasunari Yoshitomi
AU  - Masayoshi Tabuse
PY  - 2015
DA  - 2015/12/01
TI  - Classification of Japanese Documents and Ranking of Representative Documents by Using the Characteristic of the Frequencies of Words
JO  - Journal of Robotics, Networking and Artificial Life
SP  - 182
EP  - 185
VL  - 2
IS  - 3
SN  - 2352-6386
UR  - https://doi.org/10.2991/jrnal.2015.2.3.10
DO  - 10.2991/jrnal.2015.2.3.10
ID  - Kimura2015
ER  -

download .riscopy to clipboard