Proceedings of the 2018 International Conference on Computer Modeling, Simulation and Algorithm (CMSA 2018)

Tibetan Character Recognition Based on Machine Learning of K-means Algorithm

Authors
Huiwen Gong, Wei Xiang
Corresponding Author
Huiwen Gong
Available Online April 2018.
DOI
10.2991/cmsa-18.2018.78How to use a DOI?
Keywords
artificial intelligence; machine learning; Tibetan character recognition; Tesseract -OCR; K-means algorithm
Abstract

In this paper, we analyze and extract the Tibetan text features structure based on k-means image character recognition algorithm. Through character library file generated from Tessract-ocr training, we improve the accuracy and recognition of image text recognition and extraction and realize the identification of Tibetan.

Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2018 International Conference on Computer Modeling, Simulation and Algorithm (CMSA 2018)
Series
Advances in Intelligent Systems Research
Publication Date
April 2018
ISBN
10.2991/cmsa-18.2018.78
ISSN
1951-6851
DOI
10.2991/cmsa-18.2018.78How to use a DOI?
Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Huiwen Gong
AU  - Wei Xiang
PY  - 2018/04
DA  - 2018/04
TI  - Tibetan Character Recognition Based on Machine Learning of K-means Algorithm
BT  - Proceedings of the 2018 International Conference on Computer Modeling, Simulation and Algorithm (CMSA 2018)
PB  - Atlantis Press
SP  - 340
EP  - 342
SN  - 1951-6851
UR  - https://doi.org/10.2991/cmsa-18.2018.78
DO  - 10.2991/cmsa-18.2018.78
ID  - Gong2018/04
ER  -