DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7...2.943
Hits 41 – 60 of 58.846

41
Peter Tangi and Jenny Kusum Gim - Matukar borders
BASE
Show details
42
Julie Paina Biographical Information
Danielle Barth; Julie Paina. - 2023-09-11
BASE
Show details
43
Wiliang Yasung and Cathy Samun Wiliang Biographical Information
BASE
Show details
44
Gilai Kubod and Veronica Kubod Biographical Information
BASE
Show details
45
Edddie Gim and Nancy Nabog 50 German Pictures
BASE
Show details
46
Sapak Magop and Makom Kubod Biographical Information
BASE
Show details
47
John Bogg and Berry Kuyau Biographical Information
BASE
Show details
48
Eddie Gim and Nancy Nabog Biographical Information
BASE
Show details
49
Jenny Kusum Gim and Peter Tangi Biographical Information
BASE
Show details
50
Anna Dagui and Augusta Silim Manam Life Stories
BASE
Show details
51
Gideon Kuyau and John Agid 50 German Pictures
BASE
Show details
52
Offline Corpus Augmentation for English-Amharic Machine Translation
In: 2022 The 5th International Conference on Information and Computer Technologies ; https://hal.archives-ouvertes.fr/hal-03547539 ; 2022 The 5th International Conference on Information and Computer Technologies, Mar 2022, New York, United States (2022)
Abstract: International audience ; The purpose of this study was to investigate the effect of corpus augmentation on the quality of English-Amharic Machine Translation (MT). In fact, trigram and four-gram Statistical Machine Translation (SMT) language models, as well as Neural Machine Translation (NMT) models based on Gated Recurrent Units (GRU) were used. They were trained independently using both the original and augmented corpus to see how the augmentation of the corpus affects the translation quality of these models. These two corpora (original and augmented) contain 225,304 and 463,796 English-Amharic parallel sentences respectively. To complete the corpus augmentation challenge, an offline token level tokenization technique was used. This technique (corpus augmentation) was used before any other MT processes were started. Among several token-level tokenization mechanisms, random insertion, replacement, deletion, and swapping approaches were chosen and implemented. After both models had been trained, the Bilingual Evaluation Understudy (BLEU) ratings were collected and analyzed. Our results demonstrate that the models trained with the augmented corpus outperform their corresponding models (models trained with the original corpus) in terms of BLEU scores. As a result, we can conclude that corpus augmentation did indeed help in the improvement of the performance of both SMT and NMT translation systems.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Amharic language; Corpus Augmentation; GRU; Machine Translation; NMT; SMT; Token level augmentation
URL: https://hal.archives-ouvertes.fr/hal-03547539
https://hal.archives-ouvertes.fr/hal-03547539/file/ICICT2022Augmented_corpusFinal%20Draft.pdf
https://hal.archives-ouvertes.fr/hal-03547539/document
BASE
Hide details
53
Within and Beyond Stereotypes of Arab Women: A Corpus-based Approach to Jordanian Women’s Portrayal in English Digital News
In: Journal of International Women's Studies (2022)
BASE
Show details
54
Positive Experiences, Dreams, and Expectations of International Master’s Students at a Southern Ontario University: An Appreciative Inquiry
Ankomah, William Sarfo. - : Brock University, 2022
BASE
Show details
55
"Into"-causatives in world Englishes
In: English world-wide. - Amsterdam [u.a.] : Benjamins 43 (2022) 1, 1-32
BLLDB
Show details
56
LEXICAL RESTRICTIONS ON GRAMMATICAL RELATIONS IN VOICE CONSTRUCTIONS (NORTHERN AMIS) ; Linguistique et typologie
In: ISSN: 2196-7148 ; STUF - Language Typology and Universals ; https://halshs.archives-ouvertes.fr/halshs-03483275 ; STUF - Language Typology and Universals , De Gruyter, In press (2022)
BASE
Show details
57
Frequency norms in Tashlhiyt, Part I
In: https://halshs.archives-ouvertes.fr/halshs-03511109 ; 2022 (2022)
BASE
Show details
58
From FreEM to D'AlemBERT ; From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
In: Proceedings of the 13th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-03596653 ; Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, Jun 2022, Marseille, France (2022)
BASE
Show details
59
Des corpus de textes pour développer le lexique des affects en FLE
In: Séminaire Modern Language Center ; https://hal.archives-ouvertes.fr/hal-03630507 ; Séminaire Modern Language Center, King's College London, Mar 2022, London, Royaume-Uni ; https://www.kcl.ac.uk/modern-language-centre (2022)
BASE
Show details
60
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
In: https://hal.inria.fr/hal-03536361 ; 2022 (2022)
BASE
Show details

Page: 1 2 3 4 5 6 7...2.943

Catalogues
916
11
1.197
0
0
17
55
Bibliographies
6.481
20
0
0
5
0
479
60
21
Linked Open Data catalogues
0
Online resources
501
49
35
0
Open access documents
51.122
28
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern