1 |
Investigating alignment interpretability for low-resource NMT
|
|
|
|
In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-03139744 ; Machine Translation, Springer Verlag, 2021, ⟨10.1007/s10590-020-09254-w⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03372487 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
Abstract:
International audience ; Boosted by the simultaneous translation shared task at IWSLT 2020, promising end-to-end online speech translation approaches were recently proposed. They consist in incrementally encoding a speech input (in a source language) and decoding the corresponding text (in a target language) with the best possible trade-off between latency and translation quality. This paper investigates two key aspects of end-to-end simultaneous speech translation: (a) how to encode efficiently the continuous speech flow, and (b) how to segment the speech flow in order to alternate optimally between reading (R: encoding input) and writing (W: decoding output) operations. We extend our previously proposed end-to-end online decoding strategy and show that while replacing BLSTM by ULSTM encoding degrades performance in offline mode, it actually improves both efficiency and performance in online mode. We also measure the impact of different methods to segment the speech signal (using fixed interval boundaries, oracle word boundaries or randomly set boundaries) and show that our best end-to-end online decoding strategy is surprisingly the one that alternates R/W operations on fixed size blocks on our English-German speech translation setup.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; efficient speech technologies; online sequenceto-sequence models; simultaneous speech translation; speech segmentation
|
|
URL: https://hal.archives-ouvertes.fr/hal-03372487/file/2104.14470.pdf https://hal.archives-ouvertes.fr/hal-03372487/document https://hal.archives-ouvertes.fr/hal-03372487
|
|
BASE
|
|
Hide details
|
|
3 |
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
|
|
|
|
In: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03372802 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
5 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
6 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Contribution d'informations syntaxiques aux capacités de généralisation compositionelle des modèles seq2seq convolutifs
|
|
|
|
In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265890 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.134-141 (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Lightweight Adapter Tuning for Multilingual Speech Translation
|
|
|
|
In: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03294912 ; The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Aug 2021, Bangkok (Virtual), Thailand (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Visualizing Cross-Lingual Discourse Relations in Multilingual TED Corpora
|
|
|
|
In: Proceedings of the 2nd Workshop on Computational Approaches to Discourse ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse ; https://hal.archives-ouvertes.fr/hal-03642341 ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse, Nov 2021, Punta Cana, Dominican Republic. ⟨10.18653/v1/2021.codi-main.16⟩ (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?
|
|
|
|
In: Findings of ACL 2021 ; https://hal.archives-ouvertes.fr/hal-03299010 ; Findings of ACL 2021, Aug 2021, Bangkok (virtual), Thailand (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Investigating the Impact of Gender Representation in ASR Training Data: a Case Study on Librispeech
|
|
|
|
In: Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing ; 3rd Workshop on Gender Bias in Natural Language Processing ; https://hal.univ-grenoble-alpes.fr/hal-03472117 ; 3rd Workshop on Gender Bias in Natural Language Processing, Aug 2021, Online, France. pp.86-92, ⟨10.18653/v1/2021.gebnlp-1.10⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
FlauBERT: Unsupervised Language Model Pre-training for French
|
|
|
|
In: Proceedings of the 12th Language Resources and Evaluation Conference ; LREC ; https://hal.archives-ouvertes.fr/hal-02890258 ; LREC, 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
13 |
FlauBERT : Unsupervised Language Model Pre-training for French ; FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français
|
|
|
|
In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-02784776 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles, Jun 2020, Nancy, France. pp.268-278 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech
|
|
|
|
In: Conference on Natural Language Learning (CoNLL) ; https://hal.archives-ouvertes.fr/hal-02962275 ; Conference on Natural Language Learning (CoNLL), Nov 2020, Virtual, France (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Investigating Language Impact in Bilingual Approaches for Computational Language Documentation
|
|
|
|
In: Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), ; SLTU-CCURL workshop, LREC 2020 ; https://hal.archives-ouvertes.fr/hal-02895907 ; SLTU-CCURL workshop, LREC 2020, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
16 |
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
|
|
|
|
In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02962224 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shangai / Virtual, China (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Speech technology for unwritten languages
|
|
|
|
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
|
|
BASE
|
|
Show details
|
|
18 |
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
|
|
|
|
In: Proceedings of The 12th Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-02611059 ; Proceedings of The 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.6486 - 6493 (2020)
|
|
BASE
|
|
Show details
|
|
19 |
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
|
|
|
|
In: COLING 2020 (long paper) ; https://hal.archives-ouvertes.fr/hal-02991564 ; COLING 2020 (long paper), Dec 2020, Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
20 |
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
|
|
|
|
In: Proceedings of the 17th International Conference on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02895893 ; Proceedings of the 17th International Conference on Spoken Language Translation, Jul 2020, Seattle, WA, United States. pp.35-43, ⟨10.18653/v1/2020.iwslt-1.2⟩ (2020)
|
|
BASE
|
|
Show details
|
|
|
|