Proceedings of the 2nd International Conference on Science and Social Research (ICSSR 2013)

A Data-driven Approach for Cross Transformation Between Mongolian texts

Authors
Yidemucao Dawa, Niyazbek Muheyat, Amantay Ayjarken
Corresponding Author
Yidemucao Dawa
Available Online July 2013.
DOI
https://doi.org/10.2991/icssr-13.2013.84How to use a DOI?
Keywords
Mongolian texts; cross language transformation; DP; data driven approach
Abstract
This paper discusses a data-driven approach to transforming different graphic texts of Mongolian. Using the proposed approach, it is possible to transcribe or translate texts between similar languages such as Mongolian graphic texts used in different regions and countries, as well as the Altaic family languages like Uygur Turkic and Kazakh. The approach has been implemented based on DP (dynamic programming) matching supported by the knowledge-based sequence matching, referred to a multilingual dictionary and a data-driven approach of the target language corpus. Experimental results demonstrate that the proposed method achieves 86.4% transformation accuracy (in F-measure) for the NM (Cyrillic) to the TM (Traditional Mongolian) mainly used in the inner Mongolia, and 91.1% NM to Todo, which is mainly used in Xinjiang areas in China.
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Proceedings
2nd International Conference on Science and Social Research (ICSSR 2013)
Part of series
Advances in Intelligent Systems Research
Publication Date
July 2013
ISBN
978-90-78677-75-8
ISSN
1951-6851
DOI
https://doi.org/10.2991/icssr-13.2013.84How to use a DOI?
Open Access
This is an open access article distributed under the CC BY-NC license.

Cite this article

TY  - CONF
AU  - Yidemucao Dawa
AU  - Niyazbek Muheyat
AU  - Amantay Ayjarken
PY  - 2013/07
DA  - 2013/07
TI  - A Data-driven Approach for Cross Transformation Between Mongolian texts
BT  - 2nd International Conference on Science and Social Research (ICSSR 2013)
PB  - Atlantis Press
SN  - 1951-6851
UR  - https://doi.org/10.2991/icssr-13.2013.84
DO  - https://doi.org/10.2991/icssr-13.2013.84
ID  - Dawa2013/07
ER  -