A Data-driven Approach for Cross Transformation Between Mongolian texts
Yidemucao Dawa, Niyazbek Muheyat, Amantay Ayjarken
Available Online July 2013.
- https://doi.org/10.2991/icssr-13.2013.84How to use a DOI?
- Mongolian texts; cross language transformation; DP; data driven approach
- This paper discusses a data-driven approach to transforming different graphic texts of Mongolian. Using the proposed approach, it is possible to transcribe or translate texts between similar languages such as Mongolian graphic texts used in different regions and countries, as well as the Altaic family languages like Uygur Turkic and Kazakh. The approach has been implemented based on DP (dynamic programming) matching supported by the knowledge-based sequence matching, referred to a multilingual dictionary and a data-driven approach of the target language corpus. Experimental results demonstrate that the proposed method achieves 86.4% transformation accuracy (in F-measure) for the NM (Cyrillic) to the TM (Traditional Mongolian) mainly used in the inner Mongolia, and 91.1% NM to Todo, which is mainly used in Xinjiang areas in China.
- Open Access
- This is an open access article distributed under the CC BY-NC license.
Cite this article
TY - CONF AU - Yidemucao Dawa AU - Niyazbek Muheyat AU - Amantay Ayjarken PY - 2013/07 DA - 2013/07 TI - A Data-driven Approach for Cross Transformation Between Mongolian texts BT - 2nd International Conference on Science and Social Research (ICSSR 2013) PB - Atlantis Press SN - 1951-6851 UR - https://doi.org/10.2991/icssr-13.2013.84 DO - https://doi.org/10.2991/icssr-13.2013.84 ID - Dawa2013/07 ER -