1 |
Multilingual Unsupervised Sentence Simplification
|
|
|
|
In: https://hal.inria.fr/hal-03109299 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Text Generation with and without Retrieval ; Génération de textes basés sur la connaissance avec et sans recherche
|
|
|
|
In: https://hal.univ-lorraine.fr/tel-03542634 ; Computer Science [cs]. Université de Lorraine, 2021. English. ⟨NNT : 2021LORR0164⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation ...
|
|
Goyal, Naman; Gao, Cynthia; Chaudhary, Vishrav; Chen, Peng-Jen; Wenzek, Guillaume; Ju, Da; Krishnan, Sanjana; Ranzato, Marc'Aurelio; Guzman, Francisco; Fan, Angela. - : arXiv, 2021
|
|
Abstract:
One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restricted domains, or are low quality because they are constructed using semi-automatic procedures. In this work, we introduce the FLORES-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. These sentences have been translated in 101 languages by professional translators through a carefully controlled process. The resulting dataset enables better assessment of model quality on the long tail of low-resource languages, including the evaluation of many-to-many multilingual translation systems, as all translations are multilingually aligned. By publicly releasing such a high-quality and high-coverage dataset, we hope to foster progress in the machine translation community and beyond. ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2106.03193 https://arxiv.org/abs/2106.03193
|
|
BASE
|
|
Hide details
|
|
6 |
Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Alternative Input Signals Ease Transfer in Multilingual Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Multilingual AMR-to-Text Generation
|
|
|
|
In: 2020 Conference on Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-02999676 ; 2020 Conference on Empirical Methods in Natural Language Processing, Nov 2020, Punta Cana, Dominican Republic (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Augmenting Transformers with KNN-Based Composite Memory for Dialog
|
|
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02999678 ; Transactions of the Association for Computational Linguistics, The MIT Press, In press, ⟨10.1162/tacl_a_00356⟩ ; https://transacl.org/index.php/tacl (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Beyond English-Centric Multilingual Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|