1 |
XTREME-S: Evaluating Cross-lingual Speech Representations ...
|
|
Conneau, Alexis; Bapna, Ankur; Zhang, Yu; Ma, Min; von Platen, Patrick; Lozhkov, Anton; Cherry, Colin; Jia, Ye; Rivera, Clara; Kale, Mihir; Van Esch, Daan; Axelrod, Vera; Khanuja, Simran; Clark, Jonathan H.; Firat, Orhan; Auli, Michael; Ruder, Sebastian; Riesa, Jason; Johnson, Melvin. - : arXiv, 2022
|
|
Abstract:
We introduce XTREME-S, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, speech-to-text translation and retrieval. Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as well as catalyze research in "universal" speech representation learning. This paper describes the new benchmark and establishes the first speech-only and speech-text baselines using XLS-R and mSLAM on all downstream tasks. We motivate the design choices and detail how to use the benchmark. Datasets and fine-tuning scripts are made easily accessible at https://hf.co/datasets/google/xtreme_s. ... : Minor fix: language code for Filipino (Tagalog), "tg" -> "tl" ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2203.10752 https://arxiv.org/abs/2203.10752
|
|
BASE
|
|
Hide details
|
|
2 |
mSLAM: Massively multilingual joint pre-training for speech and text ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Larger-Scale Transformers for Multilingual Masked Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Unsupervised Cross-lingual Representation Learning for Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Unsupervised Cross-lingual Representation Learning at Scale ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Emerging Cross-lingual Structure in Pretrained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Specializing distributional vectors of all words for lexical entailment
|
|
|
|
BASE
|
|
Show details
|
|
10 |
What you can cram into a single \$&!#* vector: Probing sentence embeddings for linguistic properties
|
|
|
|
In: ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01898412 ; ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Jul 2018, Melbourne, Australia. pp.2126-2136 (2018)
|
|
BASE
|
|
Show details
|
|
11 |
XNLI: Evaluating Cross-lingual Sentence Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
What you can cram into a single vector: Probing sentence embeddings for linguistic properties ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Very Deep Convolutional Networks for Text Classification
|
|
|
|
In: European Chapter of the Association for Computational Linguistics EACL'17 ; https://hal.archives-ouvertes.fr/hal-01454940 ; European Chapter of the Association for Computational Linguistics EACL'17, 2017, Valencia, Spain (2017)
|
|
BASE
|
|
Show details
|
|
15 |
What you can cram into a single $&!#* vector: probing sentence embeddings for linguistic properties
|
|
|
|
BASE
|
|
Show details
|
|
|
|