1 |
The Zero Resource Speech Challenge 2021: Spoken language modelling
|
|
|
|
In: ISSN: 0162-8828 ; IEEE Transactions on Pattern Analysis and Machine Intelligence ; https://hal.inria.fr/hal-03329301 ; IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2021, pp.1-1. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
The Zero Resource Speech Challenge 2021: Spoken language modelling
|
|
|
|
In: Interspeech 2021 - Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-03329301 ; Interspeech 2021 - Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
|
|
|
|
In: NeuRIPS Workshop on Self-Supervised Learning for Speech and Audio Processing ; https://hal.archives-ouvertes.fr/hal-03070362 ; NeuRIPS Workshop on Self-Supervised Learning for Speech and Audio Processing, Dec 2020, Virtuel, France (2020)
|
|
BASE
|
|
Show details
|
|
4 |
The Perceptimatic English Benchmark for Speech Perception Models
|
|
|
|
In: CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society ; https://hal.archives-ouvertes.fr/hal-03087248 ; CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society, Jul 2020, Toronto / Virtual, Canada (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Perceptimatic: A human speech perception benchmark for unsupervised subword modelling
|
|
|
|
In: Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03087252 ; Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association, Oct 2020, Shanghai / Virtual, China ; http://www.interspeech2020.org/ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Modelling Perceptual Effects of Phonology with ASR Systems
|
|
|
|
In: CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society ; https://hal.archives-ouvertes.fr/hal-03070281 ; CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society, Jul 2020, Virtual, France (2020)
|
|
BASE
|
|
Show details
|
|
7 |
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
|
|
|
|
In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02962224 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shangai / Virtual, China (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Analogies minus analogy test: measuring regularities in word embeddings
|
|
|
|
In: CoNLL 2020 - 24th Conference on Computational Natural Language Learning ; https://hal.archives-ouvertes.fr/hal-03070260 ; CoNLL 2020 - 24th Conference on Computational Natural Language Learning, Nov 2020, Virtual, France (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Independent and Automatic Evaluation of Speaker-Independent Acoustic-to-Articulatory Reconstruction
|
|
|
|
In: Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03087264 ; Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association, Oct 2020, Shanghai / Virtual, China ; http://www.interspeech2020.org/ (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Analogies minus analogy test: measuring regularities in word embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Perceptimatic: A human speech perception benchmark for unsupervised subword modelling ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Tensor Product Decomposition Networks: Uncovering Representations of Structure Learned by Neural Networks
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Comparing unsupervised speech learning directly to human performance in speech perception
|
|
|
|
In: Proceedings of the Annual Conference of the Cognitive Science Society (Cog Sci) ; CogSci 2019 - 41st Annual Meeting of Cognitive Science Society ; https://hal.archives-ouvertes.fr/hal-02274499 ; CogSci 2019 - 41st Annual Meeting of Cognitive Science Society, Jul 2019, Montréal, Canada (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Generative grammar, neural networks, and the implementational mapping problem: Response to Pater
|
|
|
|
In: ISSN: 0097-8507 ; EISSN: 1535-0665 ; Language ; https://hal.archives-ouvertes.fr/hal-02274522 ; Language, Linguistic Society of America, 2019, 95 (1), pp.e87-e98. ⟨10.1353/lan.2019.0013⟩ (2019)
|
|
BASE
|
|
Show details
|
|
15 |
RNNs Implicitly Implement Tensor Product Representations
|
|
|
|
In: International Conference on Learning Representations ; ICLR 2019 - International Conference on Learning Representations ; https://hal.archives-ouvertes.fr/hal-02274498 ; ICLR 2019 - International Conference on Learning Representations, May 2019, New Orleans, United States (2019)
|
|
BASE
|
|
Show details
|
|
16 |
The Zero Resource Speech Challenge 2019: TTS without T
|
|
Dunbar, Ewan; Algayres, Robin; Karadayi, Julien; Bernard, Mathieu; Benjumea, Juan; Cao, Xuan-Nga; Miskic, Lucie; Dugrain, Charlotte; Ondel, Lucas; Black, Alan,; Besacier, Laurent; Sakti, Sakriani; Dupoux, Emmanuel
|
|
In: Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02274112 ; Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria (2019)
|
|
Abstract:
International audience ; We present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or pho-netic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery dataset) and align them to the voice recordings in a way that works best for the purpose of synthesizing novel utterances from novel speakers, similar to the target speaker's voice. We describe the metrics used for evaluation , a baseline system consisting of unsupervised subword unit discovery plus a standard TTS system, and a topline TTS using gold phoneme transcriptions. We present an overview of the 19 submitted systems from 10 teams and discuss the main results.
|
|
Keyword:
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [SPI.ACOU]Engineering Sciences [physics]/Acoustics [physics.class-ph]; Acoustic unit discovery; Speech synthesis; Unsupervised learning; Zero resource speech technology
|
|
URL: https://hal.archives-ouvertes.fr/hal-02274112 https://hal.archives-ouvertes.fr/hal-02274112/document https://hal.archives-ouvertes.fr/hal-02274112/file/1904.11469.pdf
|
|
BASE
|
|
Hide details
|
|
17 |
Mouse tracking as a window into decision making
|
|
|
|
In: ISSN: 1554-351X ; EISSN: 1554-3528 ; Behavior Research Methods ; https://hal.archives-ouvertes.fr/hal-02274523 ; Behavior Research Methods, Psychonomic Society, Inc, 2019, 51 (3), pp.1085-1101. ⟨10.3758/s13428-018-01194-x⟩ (2019)
|
|
BASE
|
|
Show details
|
|
18 |
The Zero Resource Speech Challenge 2017
|
|
|
|
In: ASRU 2017 ; https://hal.inria.fr/hal-01687504 ; ASRU 2017, Dec 2017, Okinawa, Japan (2017)
|
|
BASE
|
|
Show details
|
|
19 |
Learning Weakly Supervised Multimodal Phoneme Embeddings
|
|
|
|
In: Interspeech 2017 ; https://hal.inria.fr/hal-01687415 ; Interspeech 2017, 2017, Stockholm, Sweden. ⟨10.21437/Interspeech.2017-1689⟩ (2017)
|
|
BASE
|
|
Show details
|
|
20 |
Classification and automatic transcription of primate calls
|
|
|
|
In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.archives-ouvertes.fr/hal-02474093 ; Journal of the Acoustical Society of America, Acoustical Society of America, 2016, 140 (1), pp.EL26-EL30. ⟨10.1121/1.4954887⟩ (2016)
|
|
BASE
|
|
Show details
|
|
|
|