DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
In: EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03463108 ; EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kiev, Ukraine. pp.874-880, ⟨10.18653/v1/2021.eacl-main.74⟩ (2021)
BASE
Show details
2
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web ...
BASE
Show details
3
Beyond English-Centric Multilingual Machine Translation ...
BASE
Show details
4
Unsupervised Cross-lingual Representation Learning at Scale ...
Abstract: This paper shows that pretraining multilingual language models at scale leads to significant performance gains for a wide range of cross-lingual transfer tasks. We train a Transformer-based masked language model on one hundred languages, using more than two terabytes of filtered CommonCrawl data. Our model, dubbed XLM-R, significantly outperforms multilingual BERT (mBERT) on a variety of cross-lingual benchmarks, including +14.6% average accuracy on XNLI, +13% average F1 score on MLQA, and +2.4% F1 score on NER. XLM-R performs particularly well on low-resource languages, improving 15.7% in XNLI accuracy for Swahili and 11.4% for Urdu over previous XLM models. We also present a detailed empirical analysis of the key factors that are required to achieve these gains, including the trade-offs between (1) positive transfer and capacity dilution and (2) the performance of high and low resource languages at scale. Finally, we show, for the first time, the possibility of multilingual modeling without sacrificing ... : ACL 2020 (+ updated results) ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/1911.02116
https://dx.doi.org/10.48550/arxiv.1911.02116
BASE
Hide details
5
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the WEB ...
BASE
Show details
6
Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction ...
BASE
Show details
7
Colorless green recurrent networks dream hierarchically
In: Proceedings of the Society for Computation in Linguistics (2019)
BASE
Show details
8
Unsupervised Hyperalignment for Multilingual Word Embeddings ...
BASE
Show details
9
Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion ...
BASE
Show details
10
Unsupervised Alignment of Embeddings with Wasserstein Procrustes ...
BASE
Show details
11
Colorless green recurrent networks dream hierarchically ...
BASE
Show details
12
Colorless green recurrent networks dream hierarchically
In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018 P. 1195–1205 (2018)
BASE
Show details
13
A Markovian approach to distributional semantics with application to semantic compositionality
In: International Conference on Computational Linguistics (Coling) ; https://hal.inria.fr/hal-01080309 ; International Conference on Computational Linguistics (Coling), International Committee on Computational Linguistics (ICCL), Aug 2014, Dublin, Ireland. pp.1447 - 1456 ; http://www.coling-2014.org/ (2014)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern