1 |
Revisiting Multi-Domain Machine Translation
|
|
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03159743 ; Transactions of the Association for Computational Linguistics, The MIT Press, 2021, 9, pp.17-35 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Controlling Utterance Length in NMT-based Word Segmentation with Attention
|
|
|
|
In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02343206 ; International Workshop on Spoken Language Translation, Nov 2019, Hong-Kong, China (2019)
|
|
Abstract:
International audience ; One of the basic tasks of computational language documentation (CLD) is to identifyword boundaries in an unsegmented phonemic stream. While several unsupervisedmonolingual word segmentation algorithms exist in the literature,they are challenged in real-world CLD settings by the small amount of availabledata. A possible remedy is to take advantage of glosses or translation in a foreign,well-resourced, language, which often exist for such data. In this paper, we explore and compareways to exploit neural machine translation models to perform unsupervised boundary detection with bilingual information, notably introducing a new loss function for jointly learning alignment and segmentation. We experiment with an actual under-resourced language, Mboshi, and show that these techniques can effectively control the output segmentation length.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Computational Language Documentation; Machine Translation; Word Segmentation
|
|
URL: https://hal.archives-ouvertes.fr/hal-02343206 https://hal.archives-ouvertes.fr/hal-02343206/file/IWSLT2019_paper_5.pdf https://hal.archives-ouvertes.fr/hal-02343206/document
|
|
BASE
|
|
Hide details
|
|
3 |
Generic and Specialized Word Embeddings for Multi-Domain Machine Translation
|
|
|
|
In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02343215 ; International Workshop on Spoken Language Translation, Nov 2019, Hong-Kong, China. ⟨10.5281/zenodo.3524978⟩ ; https://zenodo.org/communities/iwslt2019/ (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Neural Baselines for Word Alignments
|
|
|
|
In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02343217 ; International Workshop on Spoken Language Translation, Nov 2019, Hong-Kong, China ; https://zenodo.org/communities/iwslt2019/ (2019)
|
|
BASE
|
|
Show details
|
|
5 |
Book Review
|
|
|
|
In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02267552 ; 2019, 4p (2019)
|
|
BASE
|
|
Show details
|
|
6 |
Reassessing the proper place of man and machine in translation: a pre-translation scenario
|
|
|
|
In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-01908305 ; Machine Translation, Springer Verlag, 2018, 32 (4), 31p. ⟨10.1007/s10590-018-9223-9⟩ (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Measuring the adequacy of cross-lingual paraphrases in a Machine Translation setting
|
|
|
|
In: International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01838574 ; International Conference on Computational Linguistics, Jan 2012, Mumbai, India (2012)
|
|
BASE
|
|
Show details
|
|
|
|