1 |
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages ...
|
|
|
|
Abstract:
Reproducible benchmarks are crucial in driving progress of machine translation research. However, existing machine translation benchmarks have been mostly limited to high-resource or well-represented languages. Despite an increasing interest in low-resource machine translation, there are no standardized reproducible benchmarks for many African languages, many of which are used by millions of speakers but have less digitized textual data. To tackle these challenges, we propose AfroMT, a standardized, clean, and reproducible machine translation benchmark for eight widely spoken African languages. We also develop a suite of analysis tools for system diagnosis taking into account the unique properties of these languages. Furthermore, we explore the newly considered case of low-resource focused pretraining and develop two novel data augmentation-based strategies, leveraging word-level alignment information and pseudo-monolingual data for pretraining multilingual sequence-to-sequence models. We demonstrate ... : EMNLP 2021 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2109.04715 https://arxiv.org/abs/2109.04715
|
|
BASE
|
|
Hide details
|
|
4 |
GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
DEEP: DEnoising Entity Pre-training for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Explicit Alignment Objectives for Multilingual Bidirectional Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Phrase-level Active Learning for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Explicit Alignment Objectives for Multilingual Bidirectional Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
On Learning Language-Invariant Representations for Universal Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Domain Adaptation of Neural Machine Translation by Lexicon Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Rapid Adaptation of Neural Machine Translation to New Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Structural Embedding of Syntactic Trees for Machine Comprehension ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Learning Lexical Entries for Robotic Commands using Crowdsourcing ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|