DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
THE PLACE DEIXIS OF MODERN STANDARD ARABIC: A CLOSER LOOK AT THE DIMENSIONAL SYSTEM AND THE FACTORS THAT CONTROL THE CHOICE OF PLACE DEICTIC EXPRESSIONS
BASE
Show details
2
A Framework for the Study of the Spread of English in Algeria: A Peaceful Transition to a Better Linguistic Environment
In: http://rave.ohiolink.edu/etdc/view?acc_num=toledo1333655702 (2012)
BASE
Show details
3
Sana’ani Dialect to Modern Standard Arabic: Rule-based Direct Machine Translation
In: http://world-comp.org/p2011/ICA8011.pdf
Abstract: Abstract- This paper presents an algorithm to normalize Sana’ani dialect to modern standard Arabic. The algorithm based on morphological rules related to Sana’ani dialect as well as Modern Standard Arabic. Such rules facilitate the dialect conversion to its corresponding MSA. The algorithm tokenizes the input dialect text and analyses each token into stem and its affixes; such affixes can be categorized into two categories: dialect affixes and/or MSA affixes. At the same time, the stem could be dialect stem or MSA stem. Therefore, our algorithm must pay attention to such situations. That is done by using of a simple MSA stemmer. Then our dialect stemmer is applied to strip the resulting token and extract dialect affixes. At this point, the rules take place to decide when to carry out the extraction of an affix. The experiment shows that Sana’ani dialect has three classes of distortions. They are prefixes, suffixes, and stems distortions. The algorithm normalizes such distortion based on the morphological rules. For each morphological rule the algorithm checks possibility of applying such rule. That means if rule conditions meet, then the dialect affix will be replaced by its MSA correspondence. If there is no restriction on applying the rule related to the distorted stem, then the rule can be considered as a parallel corpus of dialect and MSA. Finally, the experiment computes the distortion ratio of MSA in Sana’ani dialect. For Sana’ani dialect sample of 529 words, 6.98 % of them have distorted suffixes, 5.65 % have distorted prefixes and 11.72% contain distorted stems. These percentages are related to processed words only.
Keyword: Modern Standard Arabic; MSA; NLP; Rule-based Machine translation; Sana’a Dialect; Yemeni
URL: http://world-comp.org/p2011/ICA8011.pdf
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.217.6513
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern