1 |
Using Interlinear Glosses as Pivot in Low-Resource Multilingual Machine Translation ...
|
|
|
|
Abstract:
We demonstrate a new approach to Neural Machine Translation (NMT) for low-resource languages using a ubiquitous linguistic resource, Interlinear Glossed Text (IGT). IGT represents a non-English sentence as a sequence of English lemmas and morpheme labels. As such, it can serve as a pivot or interlingua for NMT. Our contribution is four-fold. Firstly, we pool IGT for 1,497 languages in ODIN (54,545 glosses) and 70,918 glosses in Arapaho and train a gloss-to-target NMT system from IGT to English, with a BLEU score of 25.94. We introduce a multilingual NMT model that tags all glossed text with gloss-source language tags and train a universal system with shared attention across 1,497 languages. Secondly, we use the IGT gloss-to-target translation as a key step in an English-Turkish MT system trained on only 865 lines from ODIN. Thirdly, we we present five metrics for evaluating extremely low-resource translation when BLEU is no longer sufficient and evaluate the Turkish low-resource system using BLEU and also ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.1911.02709 https://arxiv.org/abs/1911.02709
|
|
BASE
|
|
Hide details
|
|
2 |
Expanding the domain of a multi-lingual speech-to-speech translation system
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Multi-lingual translation of spontaneously spoken language in a limited domain
|
|
|
|
BASE
|
|
Show details
|
|
4 |
A Multi-Perspective evaluation of the NESPOLE! Speech-to-Speech Translation System
|
|
|
|
In: ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems ; https://hal.inria.fr/inria-00326403 ; ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems, ACL, Jun 2002, Philadelphia - Pennsylvania, United States. 9 p (2002)
|
|
BASE
|
|
Show details
|
|
5 |
Enhancing the Usability and Performance of Nespole! - a Real-World Speech-to-Speech Translation System
|
|
|
|
In: Human Language Technologies 2002 ; https://hal.inria.fr/inria-00326412 ; Human Language Technologies 2002, Mar 2002, San Diego - California, United States. 6 p (2002)
|
|
BASE
|
|
Show details
|
|
10 |
Multi-lingual translation of spontaneously spoken language in a limited domain ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Expanding the domain of a multi-lingual speech-to-speech translation system ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|