Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications

Algorithm of Keywords Extraction about Power Documents Based on Hadoop

Authors
Tong Wang, Yongzhi Wang, Liang Jin, Yongheng Li
Corresponding Author
Tong Wang
Available Online January 2017.
DOI
10.2991/icmmita-16.2016.201How to use a DOI?
Keywords
Big data; Hadoop; HBase; Keywords; Power documents
Abstract

As the power big data is developing with a wide range of varieties, it is used to manage and analyze data, especially the unstructured data such as power documents. In this paper, HBase database and HDFS are applied to make the document searching easier and faster. Keywords extraction based on Hadoop is use to read the document in an easy way. Above all, it deserves management and design algorithm on keywords extraction based on Hadoop.

Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications
Series
Advances in Computer Science Research
Publication Date
January 2017
ISBN
10.2991/icmmita-16.2016.201
ISSN
2352-538X
DOI
10.2991/icmmita-16.2016.201How to use a DOI?
Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Tong Wang
AU  - Yongzhi Wang
AU  - Liang Jin
AU  - Yongheng Li
PY  - 2017/01
DA  - 2017/01
TI  - Algorithm of Keywords Extraction about Power Documents Based on Hadoop
BT  - Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications
PB  - Atlantis Press
SN  - 2352-538X
UR  - https://doi.org/10.2991/icmmita-16.2016.201
DO  - 10.2991/icmmita-16.2016.201
ID  - Wang2017/01
ER  -