DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
The Zero Resource Speech Challenge 2021: Spoken language modelling
In: ISSN: 0162-8828 ; IEEE Transactions on Pattern Analysis and Machine Intelligence ; https://hal.inria.fr/hal-03329301 ; IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2021, pp.1-1. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
BASE
Show details
2
The Zero Resource Speech Challenge 2021: Spoken language modelling
In: Interspeech 2021 - Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-03329301 ; Interspeech 2021 - Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
BASE
Show details
3
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
In: INTERSPEECH 2021 - Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-03329245 ; INTERSPEECH 2021 - Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
4
Communicating artificial neural networks develop efficient color-naming systems
In: ISSN: 0027-8424 ; EISSN: 1091-6490 ; Proceedings of the National Academy of Sciences of the United States of America ; https://hal.inria.fr/hal-03329084 ; Proceedings of the National Academy of Sciences of the United States of America , National Academy of Sciences, 2021, 118 (12), ⟨10.1073/pnas.2016569118⟩ (2021)
BASE
Show details
5
How BPE Affects Memorization in Transformers ...
BASE
Show details
6
Generative Spoken Language Modeling from Raw Audio ...
BASE
Show details
7
The Zero Resource Speech Challenge 2021: Spoken language modelling ...
BASE
Show details
8
Textless Speech Emotion Conversion using Discrete and Decomposed Representations ...
BASE
Show details
9
Communicating artificial neural networks develop efficient color-naming systems
In: Proc Natl Acad Sci U S A (2021)
BASE
Show details
10
Compositionality and Generalization in Emergent Languages
In: ACL 2020 - 8th annual meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02959466 ; ACL 2020 - 8th annual meeting of the Association for Computational Linguistics, Jul 2020, Seattle / Virtual, United States (2020)
BASE
Show details
11
LIBRI-LIGHT: a benchmark for asr with limited or no supervision
In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-02959460 ; ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2020, Barcelona / Virtual, Spain. pp.7669-7673, ⟨10.1109/ICASSP40776.2020.9052942⟩ (2020)
BASE
Show details
12
Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
In: SLT 2020 - IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03070321 ; SLT 2020 - IEEE Spoken Language Technology Workshop, Dec 2020, Shenzhen / Virtual, China (2020)
Abstract: International audience ; Contrastive Predictive Coding (CPC), based on predicting future segments of speech based on past segments is emerging as a powerful algorithm for representation learning of speech signal. However, it still under-performs other methods on unsupervised evaluation benchmarks. Here, we introduce WavAugment, a time-domain data augmentation library and find that applying augmentation in the past is generally more efficient and yields better performances than other methods. We find that a combination of pitch modification, additive noise and reverberation substantially increase the performance of CPC (relative improvement of 18-22%), beating the reference Libri-light results with 600 times less data. Using an out-of-domain dataset, time-domain data augmentation can push CPC to be on par with the state of the art on the Zero Speech Benchmark 2017. We also show that time-domain data augmentation consistently improves downstream limited-supervision phoneme classification tasks by a factor of 12-15% relative.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; Contrastive predictive coding; Data augmentation; Speech recognition; Unsupervised representation learning
URL: https://hal.archives-ouvertes.fr/hal-03070321
https://hal.archives-ouvertes.fr/hal-03070321/file/2007.00991.pdf
https://hal.archives-ouvertes.fr/hal-03070321/document
BASE
Hide details
13
Anti-efficient encoding in emergent communication
In: https://hal.archives-ouvertes.fr/hal-02274205 ; 2019 (2019)
BASE
Show details
14
Word-order biases in deep-agent emergent communication
In: ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02274157 ; ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy (2019)
BASE
Show details
15
EGG: a toolkit for research on Emergence of lanGuage in Games
In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations ; https://hal.archives-ouvertes.fr/hal-02274229 ; Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations, Nov 2019, Hong Kong, China. ⟨10.18653/v1/D19-3010⟩ (2019)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern