DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
An Online Readability Leveled Arabic Thesaurus ...
BASE
Show details
2
Exploiting Arabic Diacritization for High Quality Automatic Annotation
In: Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-01349206 ; Language Resources and Evaluation Conference, 2016, Portoroz, Slovenia (2016)
Abstract: International audience ; We present a novel technique for Arabic morphological annotation. The technique utilizes diacritization to produce morphological annotations of quality comparable to human annotators. Although Arabic text is generally written without diacritics, diacritization is already available for large corpora of Arabic text in several genres. Furthermore, diacritization can be generated at a low cost for new text as it does not require specialized training beyond what educated Arabic typists know. The basic approach is to enrich the input to a state-of-the-art Arabic morphological analyzer with word diacritics (full or partial) to enhance its performance. When applied to fully diacritized text, our approach produces annotations with an accuracy of over 97% on lemma, part-of-speech, and tokenization combined.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Annotation; Arabic; Diacritization; Morphology
URL: https://hal.archives-ouvertes.fr/hal-01349206/document
https://hal.archives-ouvertes.fr/hal-01349206
https://hal.archives-ouvertes.fr/hal-01349206/file/LREC-2016-Exploiting-Diacritization.pdf
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern