Proceedings of the International Conference on Culture and Language in Southeast Asia (ICCLAS 2017)

Projected Characteristics and Content of Arabic Corpus in Indonesia

Authors
Nur Hizbullah, Muchlis Madian Muhammad
Corresponding Author
Nur Hizbullah
Available Online December 2017.
DOI
https://doi.org/10.2991/icclas-17.2018.42How to use a DOI?
Keywords
Arabic Corpus; corpus characteristic; corpus content; comparative corpus
Abstract
Utilization and integration between linguistics and information and communication technology produce a result in the form of a language corpus. Corpus is a collection of data prepared systemically and is developed in such a way to be used as research data. In general, the content of a corpus relates to the purpose preparation of the corpus itself in the context of linguistic researches. In addition, the corpus' content relates to the availability of data materials to be included in the corpus. With its long history and wide coverage of Arabic teaching in Indonesia, there are quite a plenty of materials and data on and in Arabic language that can be documented and compiled to be used as corpus. This ascertains that Arabic Corpus in Indonesia will be filled by various data materials. Under the descriptive- comparative method, this paper will describe various types of Arabic corpus, particularly the aspect of corpus content and compare the content in the corpus and the predicted availability of content materials in the context of the plan to prepare Arabic Corpus in Indonesia. By referring to the existing corpus, it can be projected that the Arabic Corpus to be made in Indonesia is a regional and diachronically corpus. This corpus contains seven distinct classifications in accordance with the availability of data in the field. Hence, the effort of drafting this corpus is important and strategic in order to make a documentation of the Arabic linguistic data that is real produced by the Indonesian speakers and this corpus will be able to showcase the richness of Arabic language in Indonesia for use to develop research in various Arabic studies in future.
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Cite this article

TY  - CONF
AU  - Nur Hizbullah
AU  - Muchlis Madian Muhammad
PY  - 2017/12
DA  - 2017/12
TI  - Projected Characteristics and Content of Arabic Corpus in Indonesia
BT  - International Conference on Culture and Language in Southeast Asia (ICCLAS 2017)
PB  - Atlantis Press
UR  - https://doi.org/10.2991/icclas-17.2018.42
DO  - https://doi.org/10.2991/icclas-17.2018.42
ID  - Hizbullah2017/12
ER  -