1 |
Controlling Utterance Length in NMT-based Word Segmentation with Attention
|
|
|
|
In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02343206 ; International Workshop on Spoken Language Translation, Nov 2019, Hong-Kong, China (2019)
|
|
BASE
|
|
Show details
|
|
2 |
Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings
|
|
|
|
In: Interspeech 2019 ; https://hal.archives-ouvertes.fr/hal-02193867 ; Interspeech 2019, Sep 2019, Graz, Austria (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Word Recognition, Competition, and Activation in a Model of Visually Grounded Speech
|
|
|
|
In: Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL) ; https://hal.archives-ouvertes.fr/hal-02359540 ; Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), Nov 2019, Hong Kong, China. pp.339-348, ⟨10.18653/v1/K19-1032⟩ (2019)
|
|
BASE
|
|
Show details
|
|
4 |
The Zero Resource Speech Challenge 2019: TTS without T
|
|
Dunbar, Ewan; Algayres, Robin; Karadayi, Julien; Bernard, Mathieu; Benjumea, Juan; Cao, Xuan-Nga; Miskic, Lucie; Dugrain, Charlotte; Ondel, Lucas; Black, Alan,; Besacier, Laurent; Sakti, Sakriani; Dupoux, Emmanuel
|
|
In: Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02274112 ; Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria (2019)
|
|
Abstract:
International audience ; We present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or pho-netic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery dataset) and align them to the voice recordings in a way that works best for the purpose of synthesizing novel utterances from novel speakers, similar to the target speaker's voice. We describe the metrics used for evaluation , a baseline system consisting of unsupervised subword unit discovery plus a standard TTS system, and a topline TTS using gold phoneme transcriptions. We present an overview of the 19 submitted systems from 10 teams and discuss the main results.
|
|
Keyword:
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [SPI.ACOU]Engineering Sciences [physics]/Acoustics [physics.class-ph]; Acoustic unit discovery; Speech synthesis; Unsupervised learning; Zero resource speech technology
|
|
URL: https://hal.archives-ouvertes.fr/hal-02274112 https://hal.archives-ouvertes.fr/hal-02274112/document https://hal.archives-ouvertes.fr/hal-02274112/file/1904.11469.pdf
|
|
BASE
|
|
Hide details
|
|
5 |
Models of Visually Grounded Speech Signal Pay Attention to Nouns: A Bilingual Experiment on English and Japanese
|
|
|
|
In: International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-02013984 ; International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2019, Brighton, United Kingdom. pp.8618-8622, ⟨10.1109/ICASSP.2019.8683069⟩ (2019)
|
|
BASE
|
|
Show details
|
|
6 |
A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages
|
|
|
|
In: ISSN: 1351-3249 ; EISSN: 1469-8110 ; Natural Language Engineering ; https://hal.archives-ouvertes.fr/hal-01976297 ; Natural Language Engineering, Cambridge University Press (CUP), 2019, 25 (01), pp.43-67. ⟨10.1017/S1351324918000293⟩ (2019)
|
|
BASE
|
|
Show details
|
|
7 |
How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages
|
|
|
|
In: Journées Scientifiques du Groupement de Recherche: Linguistique Informatique, Formelle et de Terrain (LIFT). ; https://hal.archives-ouvertes.fr/hal-02895895 ; Journées Scientifiques du Groupement de Recherche: Linguistique Informatique, Formelle et de Terrain (LIFT)., Nov 2019, Orléans, France (2019)
|
|
BASE
|
|
Show details
|
|
8 |
Controlling Utterance Length in NMT-based Word Segmentation with Attention ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Controlling Utterance Length in NMT-based Word Segmentation with Attention ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Controlling Utterance Length in NMT-based Word Segmentation with Attention ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Word Recognition, Competition, and Activation in a Model of Visually Grounded Speech ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|