1 |
Automatic identification methods on a corpus of twenty five fine-grained Arabic dialects
|
|
|
|
In: Arabic Language Processing: From Theory to Practice7th International Conference, ICALP 2019, Nancy, France, October 16–17, 2019, Proceedings ; https://hal.archives-ouvertes.fr/hal-02314245 ; Arabic Language Processing: From Theory to Practice 7th International Conference, ICALP 2019, Nancy, France, October 16–17, 2019, Proceedings, Communications in Computer and Information Science book series (CCIS, volume 1108), 2019, ⟨10.1007/978-3-030-32959-4_6⟩ (2019)
|
|
Abstract:
International audience ; This research deals with Arabic dialect identification, a challenging issue related to Arabic NLP. Indeed, the increasing use of Arabic dialects in a written form especially in social media generates new needs in the area of Arabic dialect processing. For discriminating between dialects in a multi-dialect context, we use different approaches based on machine learning techniques. To this end, we explored several methods. We used a classification method based on symmetric Kullback-Leibler, and we experimented classical classification methods such as Naive Bayes Classifiers and more sophisticated methods like Word2Vec and Long Short-Term Memory neural network. We tested our approaches on a large database of 25 Arabic dialects in addition to MSA.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Arabic dialects; Automatic dialect identification; Dialect resources; Parallel dialectal corpora
|
|
URL: https://doi.org/10.1007/978-3-030-32959-4_6 https://hal.archives-ouvertes.fr/hal-02314245/document https://hal.archives-ouvertes.fr/hal-02314245/file/paper-15-icalp2019.pdf https://hal.archives-ouvertes.fr/hal-02314245
|
|
BASE
|
|
Hide details
|
|
2 |
The SMarT Classifier for Arabic Fine-Grained Dialect Identification
|
|
|
|
In: MADAR Shared Task: Arabic Fine-Grained Dialect Identification Dialect identification campaign ; The Fourth Arabic Natural Language Processing Workshop co-located with ACL ; https://hal.archives-ouvertes.fr/hal-02166384 ; The Fourth Arabic Natural Language Processing Workshop co-located with ACL, Aug 2019, Florence, Italy (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Script Independent Morphological Segmentation for Arabic Maghrebi Dialects: An Application to Machine Translation
|
|
|
|
In: ISSN: 1405-5546 ; EISSN: 2007-9737 ; Computación y sistemas ; https://hal.archives-ouvertes.fr/hal-02274533 ; Computación y sistemas, Instituto Politécnico Nacional IPN Centro de Investigación en Computación, In press, 23 (3), pp.979-989. ⟨10.13053/cys-23-3-3267⟩ (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Integrating Dialects and Dialectology in the Curriculum of Teaching Arabic As a Foreign Language (TAFL)
|
|
|
|
BASE
|
|
Show details
|
|
5 |
The phonology and micro-typology of Arabic R
|
|
|
|
In: Glossa: a journal of general linguistics; Vol 4, No 1 (2019); 131 ; 2397-1835 (2019)
|
|
BASE
|
|
Show details
|
|
|
|