Projected Characteristics and Content of Arabic Corpus in Indonesia
- Nur Hizbullah, Muchlis Madian Muhammad
- Corresponding Author
- Nur Hizbullah
Available Online December 2017.
- https://doi.org/10.2991/icclas-17.2018.42How to use a DOI?
- Arabic Corpus; corpus characteristic; corpus content; comparative corpus
- Utilization and integration between linguistics and information and communication technology produce a result in the form of a language corpus. Corpus is a collection of data prepared systemically and is developed in such a way to be used as research data. In general, the content of a corpus relates to the purpose preparation of the corpus itself in the context of linguistic researches. In addition, the corpus' content relates to the availability of data materials to be included in the corpus. With its long history and wide coverage of Arabic teaching in Indonesia, there are quite a plenty of materials and data on and in Arabic language that can be documented and compiled to be used as corpus. This ascertains that Arabic Corpus in Indonesia will be filled by various data materials. Under the descriptive- comparative method, this paper will describe various types of Arabic corpus, particularly the aspect of corpus content and compare the content in the corpus and the predicted availability of content materials in the context of the plan to prepare Arabic Corpus in Indonesia. By referring to the existing corpus, it can be projected that the Arabic Corpus to be made in Indonesia is a regional and diachronically corpus. This corpus contains seven distinct classifications in accordance with the availability of data in the field. Hence, the effort of drafting this corpus is important and strategic in order to make a documentation of the Arabic linguistic data that is real produced by the Indonesian speakers and this corpus will be able to showcase the richness of Arabic language in Indonesia for use to develop research in various Arabic studies in future.
- Open Access
- This is an open access article distributed under the CC BY-NC license.
Cite this article
TY - CONF AU - Nur Hizbullah AU - Muchlis Madian Muhammad PY - 2017/12 DA - 2017/12 TI - Projected Characteristics and Content of Arabic Corpus in Indonesia BT - International Conference on Culture and Language in Southeast Asia (ICCLAS 2017) PB - Atlantis Press SN - 2352-5398 UR - https://doi.org/10.2991/icclas-17.2018.42 DO - https://doi.org/10.2991/icclas-17.2018.42 ID - Hizbullah2017/12 ER -