2 |
Investigating alignment interpretability for low-resource NMT
|
|
|
|
In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-03139744 ; Machine Translation, Springer Verlag, 2021, ⟨10.1007/s10590-020-09254-w⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03372487 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
|
|
|
|
In: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03372802 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ (2021)
|
|
Abstract:
International audience ; Inferring the prosody of a word in text-to-speech synthesis requires information about its surrounding context. In incremental text-to-speech synthesis, where the synthesizer produces an output before it has access to the complete input, the full context is often unknown which can result in a loss of naturalness. In this paper, we investigate whether the use of predicted future text from a transformer language model can attenuate this loss in a neural TTS system. We compare several test conditions of next future word: (a) unknown (zero-word), (b) language model predicted, (c) randomly predicted and (d) ground-truth. We measure the prosodic features (pitch, energy and duration) and find that predicted text provides significant improvements over a zero-word lookahead, but only slight gains over randomword lookahead. We confirm these results with a perceptive test.
|
|
Keyword:
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [INFO]Computer Science [cs]; Incremental text-to-speech; neural language models; prosody
|
|
URL: https://hal.archives-ouvertes.fr/hal-03372802 https://hal.archives-ouvertes.fr/hal-03372802/document https://hal.archives-ouvertes.fr/hal-03372802/file/Alternate_Endings_Interspeech-2.pdf https://doi.org/10.21437/Interspeech.2021-275
|
|
BASE
|
|
Hide details
|
|
5 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
6 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
7 |
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
|
|
|
|
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Contribution d'informations syntaxiques aux capacités de généralisation compositionelle des modèles seq2seq convolutifs
|
|
|
|
In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265890 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.134-141 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Lightweight Adapter Tuning for Multilingual Speech Translation
|
|
|
|
In: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03294912 ; The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Aug 2021, Bangkok (Virtual), Thailand (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Visualizing Cross-Lingual Discourse Relations in Multilingual TED Corpora
|
|
|
|
In: Proceedings of the 2nd Workshop on Computational Approaches to Discourse ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse ; https://hal.archives-ouvertes.fr/hal-03642341 ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse, Nov 2021, Punta Cana, Dominican Republic. ⟨10.18653/v1/2021.codi-main.16⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?
|
|
|
|
In: Findings of ACL 2021 ; https://hal.archives-ouvertes.fr/hal-03299010 ; Findings of ACL 2021, Aug 2021, Bangkok (virtual), Thailand (2021)
|
|
BASE
|
|
Show details
|
|
12 |
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
|
|
|
|
In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Segmentation en mots faiblement supervisée pour la documentation automatique des langues
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03477475 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Investigating the Impact of Gender Representation in ASR Training Data: a Case Study on Librispeech
|
|
|
|
In: Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing ; 3rd Workshop on Gender Bias in Natural Language Processing ; https://hal.univ-grenoble-alpes.fr/hal-03472117 ; 3rd Workshop on Gender Bias in Natural Language Processing, Aug 2021, Online, France. pp.86-92, ⟨10.18653/v1/2021.gebnlp-1.10⟩ (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads? ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Lightweight Adapter Tuning for Multilingual Speech Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Multilingual Unsupervised Neural Machine Translation with Denoising Adapters ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Lightweight Adapter Tuning for Multilingual Speech Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|