DE eng

Search in the Catalogues and Directories

Hits 1 – 4 of 4

1
A Transformer-Based Neural Machine Translation Model for Arabic Dialects That Utilizes Subword Units
In: Sensors ; Volume 21 ; Issue 19 (2021)
Abstract: Languages that allow free word order, such as Arabic dialects, are of significant difficulty for neural machine translation (NMT) because of many scarce words and the inefficiency of NMT systems to translate these words. Unknown Word (UNK) tokens represent the out-of-vocabulary words for the reason that NMT systems run with vocabulary that has fixed size. Scarce words are encoded completely as sequences of subword pieces employing the Word-Piece Model. This research paper introduces the first Transformer-based neural machine translation model for Arabic vernaculars that employs subword units. The proposed solution is based on the Transformer model that has been presented lately. The use of subword units and shared vocabulary within the Arabic dialect (the source language) and modern standard Arabic (the target language) enhances the behavior of the multi-head attention sublayers for the encoder by obtaining the overall dependencies between words of input sentence for Arabic vernacular. Experiments are carried out from Levantine Arabic vernacular (LEV) to modern standard Arabic (MSA) and Maghrebi Arabic vernacular (MAG) to MSA, Gulf–MSA, Nile–MSA, Iraqi Arabic (IRQ) to MSA translation tasks. Extensive experiments confirm that the suggested model adequately addresses the unknown word issue and boosts the quality of translation from Arabic vernaculars to Modern standard Arabic (MSA).
Keyword: Arabic dialects; modern standard Arabic; multi-head attention; neural machine translation (NMT); self-attention; shared vocabulary; subword units; transformer
URL: https://doi.org/10.3390/s21196509
BASE
Hide details
2
Integrating Dialects and Dialectology in the Curriculum of Teaching Arabic As a Foreign Language (TAFL)
Özkan, H. (Hakan). - 2019
BASE
Show details
3
Creating Parallel Arabic Dialect Corpus: Pitfalls to Avoid
In: 18th International Conference on Computational Linguistics and Intelligent Text Processing (CICLING) ; https://hal.archives-ouvertes.fr/hal-01557405 ; 18th International Conference on Computational Linguistics and Intelligent Text Processing (CICLING), Apr 2017, Budapest, Hungary (2017)
BASE
Show details
4
Quel arabe pour demain ? Les derniers avatars d'une controverse millénaire
In: L'arabe moderne : Péripéties et enjeux ; https://halshs.archives-ouvertes.fr/halshs-01970199 ; Nejmeddine Khalfallah. L'arabe moderne : Péripéties et enjeux, Harmattan, 2015, 978-2-343-0490-52 (2015)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
4
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern