22 |
Towards Minimal Supervision BERT-based Grammar Error Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
23 |
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
It's not a Non-Issue: Negation as a Source of Error in Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
25 |
Automatic Extraction of Rules Governing Morphological Agreement ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ...
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Universal Phone Recognition with a Multilingual Allophone System ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
|
|
|
|
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
30 |
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models ...
|
|
|
|
Abstract:
Language models (LMs) have proven surprisingly successful at capturing factual knowledge by completing cloze-style fill-in-the-blank questions such as "Punta Cana is located in _." However, while knowledge is both written and queried in many languages, studies on LMs' factual representation ability have almost invariably been performed on English. To assess factual knowledge retrieval in LMs in different languages, we create a multilingual benchmark of cloze-style probes for 23 typologically diverse languages. To properly handle language variations, we expand probing methods from single- to multi-word entities, and develop several decoding algorithms to generate multi-token predictions. Extensive experimental results provide insights about how well (or poorly) current state-of-the-art LMs perform at this task in languages with more or fewer available resources. We further propose a code-switching-based method to improve the ability of multilingual LMs to access knowledge, and verify its effectiveness on ... : EMNLP 2020 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2010.06189 https://arxiv.org/abs/2010.06189
|
|
BASE
|
|
Hide details
|
|
31 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
32 |
Generalized Data Augmentation for Low-Resource Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Pushing the Limits of Low-Resource Morphological Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
37 |
A small Griko-Italian speech translation corpus
|
|
|
|
In: 6th international workshop on spoken language technologies for under-resourced languages(SLTU'18) ; https://hal.archives-ouvertes.fr/hal-01962528 ; 6th international workshop on spoken language technologies for under-resourced languages(SLTU'18), Aug 2018, New Delhi, India (2018)
|
|
BASE
|
|
Show details
|
|
38 |
A case study on using speech-to-translation alignments for language documentation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|