1 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
|
|
|
|
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
SIGTYP 2020 Shared Task: Prediction of Typological Features ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ...
|
|
|
|
Abstract:
The performance of neural machine translation systems is commonly evaluated in terms of BLEU. However, due to its reliance on target language properties and generation, the BLEU metric does not allow an assessment of which translation directions are more difficult to model. In this paper, we propose cross-mutual information (XMI): an asymmetric information-theoretic metric of machine translation difficulty that exploits the probabilistic nature of most neural machine translation models. XMI allows us to better evaluate the difficulty of translating text into the target language while controlling for the difficulty of the target-side generation component independent of the translation task. We then present the first systematic and controlled study of cross-lingual translation difficulties using modern neural translation systems. Code for replicating our experiments is available online at https://github.com/e-bug/nmt-difficulty. ... : Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics ...
|
|
URL: https://dx.doi.org/10.3929/ethz-b-000462309 http://hdl.handle.net/20.500.11850/462891
|
|
BASE
|
|
Hide details
|
|
5 |
Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
|
|
|
|
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
8 |
UniMorph 3.0: Universal Morphology
|
|
|
|
In: Proceedings of the 12th Language Resources and Evaluation Conference (2020)
|
|
BASE
|
|
Show details
|
|
10 |
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Unsupervised Disambiguation of Syncretism in Inflected Lexicons ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|