Proceedings of the 1st International Conference on E-Business Intelligence (ICEBI 2010)

Finding Topics in News Web Pages by Parameter-free Clustering

Authors
Ji Xiang, Neng Gao, Jiwu Jing
Corresponding Author
Ji Xiang
Available Online December 2010.
DOI
10.2991/icebi.2010.2How to use a DOI?
Keywords
Topic Detection, Clustering Algorithm, Parameter Free, Similarity Distribution
Abstract

Topic detection is a novel technology which structures news stories into several topics. Present topic detection approaches are mainly based on clustering algorithms such as single pass or agglomerative clustering, and all these algorithms need at least one input parameter. We proposed a novel clustering algorithm which automatically determines the parameters for each corpus. Experimental results show that the parameters derived are close to optimal, and our algorithm has similar accuracy as the UPGMA algorithm which is manually set with optimal parameters. Another advantage of our algorithm is that it runs much faster than the UPGMA algorithm.

Copyright
© 2010, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 1st International Conference on E-Business Intelligence (ICEBI 2010)
Series
Advances in Intelligent Systems Research
Publication Date
December 2010
ISBN
10.2991/icebi.2010.2
ISSN
1951-6851
DOI
10.2991/icebi.2010.2How to use a DOI?
Copyright
© 2010, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Ji Xiang
AU  - Neng Gao
AU  - Jiwu Jing
PY  - 2010/12
DA  - 2010/12
TI  - Finding Topics in News Web Pages by Parameter-free Clustering
BT  - Proceedings of the 1st International Conference on E-Business Intelligence (ICEBI 2010)
PB  - Atlantis Press
SP  - 8
EP  - 16
SN  - 1951-6851
UR  - https://doi.org/10.2991/icebi.2010.2
DO  - 10.2991/icebi.2010.2
ID  - Xiang2010/12
ER  -