1 |
تشْكيل الكَلِمات عَبْرَ مَوارد حاسوبيّة ; Restoring Arabic vowels through omission-tolerant dictionary lookup
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02113751 ; Language Resources and Evaluation, Springer Verlag, 2020, 54, pp.487-551. ⟨10.1007/s10579-019-09464-6⟩ ; https://link.springer.com/article/10.1007/s10579-019-09464-6 (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Why Microsoft Arabic Spell checker is ineffective
|
|
|
|
In: ISSN: 0851-6774 ; Linguistica Communicatio ; https://hal.archives-ouvertes.fr/hal-01081965 ; Linguistica Communicatio, http://www.al-erfan.com/, 2014, Arabic Language in Information Technology, 16, pp.55 ; http://www.al-erfan.com/ (2014)
|
|
BASE
|
|
Show details
|
|
5 |
الوزن والجذر في الصرف غير الإشتقاقي: جموع التكسير في اللغة العربية ; Pattern-and-root inflectional morphology: the Arabic broken plural
|
|
|
|
In: ISSN: 0388-0001 ; Language Sciences ; https://hal.archives-ouvertes.fr/hal-00831338 ; Language Sciences, Elsevier, 2013, 40, pp.221-250. ⟨10.1016/j.langsci.2013.06.002⟩ (2013)
|
|
BASE
|
|
Show details
|
|
6 |
A lexicon of Arabic verbs constructed on the basis of Semitic taxonomy and using finite-state transducers
|
|
|
|
In: WoLeR 2011 at ESSLLI International Workshop on Lexical Resources ; https://halshs.archives-ouvertes.fr/halshs-01186723 ; WoLeR 2011 at ESSLLI International Workshop on Lexical Resources, Aug 2011, Ljubliana, Slovenia ; http://alpage.inria.fr/~sagot/woler2011 (2011)
|
|
Abstract:
International audience ; We describe a lexicon of Arabic verbs constructed on the basis of Semitic patterns and used in a resource-based method of morphological annotation of written Arabic text. The annotated output is a graph of morphemes with accurate linguistic information. An enhanced FST implementation for Semitic languages was created. This system is adapted also for generating inflected forms. The language resources can be easily updated. The lexicon is constituted of 15 400 verbal entries. We propose an inflectional taxonomy that increases the lexicon readability and maintainability for Arabic speakers and lingui sts. Traditional grammar defines inflectional verbal classes by using verbal pattern-classes and root-classes, related to the nature of each of the triliteral root-consonants. Verbal pattern-classes are clearly defined but root-classes are complex. In our taxonomy, traditional pattern-classes are reused and root-classes are simply redefined. Our taxonomy provides a straightforward encoding scheme for inflectional variations and orthographic adjustments due to assimilation and agglutination. We have tested and evaluated our resource against 10 000 diacriticized verb occurrences in the Nemlar corpus and compared it to Buckwalter resources. The lexical coverage is 99.9 % and a laptop needs two minutes in order to generate and compress the inflected lexicon of 2.5 million forms into 4 Megabytes.
|
|
Keyword:
[SHS.INFO]Humanities and Social Sciences/Library and information sciences; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Arabic; Conjugation; Morphology; NLP Processing; Semitic
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-01186723/document https://halshs.archives-ouvertes.fr/halshs-01186723 https://halshs.archives-ouvertes.fr/halshs-01186723/file/A_lexicon_of_Arabic_verbs_constructed_on.pdf
|
|
BASE
|
|
Hide details
|
|
|
|