Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
- Medium:
  - Online (15)
- Type
- BLLDB-Access:
  - free (15)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 15 of 15

1	The Zero Resource Speech Challenge 2021: Spoken language modelling
	Dunbar, Ewan; Bernard, Mathieu; Hamilakis, Nicolas...
	In: ISSN: 0162-8828 ; IEEE Transactions on Pattern Analysis and Machine Intelligence ; https://hal.inria.fr/hal-03329301 ; IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2021, pp.1-1. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
	BASE
	Show details

2	The Zero Resource Speech Challenge 2021: Spoken language modelling
	Dunbar, Ewan; Bernard, Mathieu; Hamilakis, Nicolas...
	In: Interspeech 2021 - Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-03329301 ; Interspeech 2021 - Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. ⟨10.1109/TPAMI.2021.3083839⟩ (2021)
	BASE
	Show details

3	Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
	Polyak, Adam; Adi, Yossi; Copet, Jade...
	In: INTERSPEECH 2021 - Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-03329245 ; INTERSPEECH 2021 - Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

4	Communicating artificial neural networks develop efficient color-naming systems
	Chaabouni, Rahma; Kharitonov, Eugene; Dupoux, Emmanuel...
	In: ISSN: 0027-8424 ; EISSN: 1091-6490 ; Proceedings of the National Academy of Sciences of the United States of America ; https://hal.inria.fr/hal-03329084 ; Proceedings of the National Academy of Sciences of the United States of America , National Academy of Sciences, 2021, 118 (12), ⟨10.1073/pnas.2016569118⟩ (2021)
	BASE
	Show details

5	How BPE Affects Memorization in Transformers ...
	Kharitonov, Eugene; Baroni, Marco; Hupkes, Dieuwke. - : arXiv, 2021
	BASE
	Show details

6	Generative Spoken Language Modeling from Raw Audio ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Adi, Yossi; Baevski, Alexei. - : Underline Science Inc., 2021
	BASE
	Show details

7	The Zero Resource Speech Challenge 2021: Spoken language modelling ...
	Dunbar, Ewan; Bernard, Mathieu; Hamilakis, Nicolas. - : arXiv, 2021
	BASE
	Show details

8	Textless Speech Emotion Conversion using Discrete and Decomposed Representations ...
	Kreuk, Felix; Polyak, Adam; Copet, Jade. - : arXiv, 2021
	BASE
	Show details

9	Communicating artificial neural networks develop efficient color-naming systems
	Chaabouni, Rahma; Kharitonov, Eugene; Dupoux, Emmanuel...
	In: Proc Natl Acad Sci U S A (2021)
	BASE
	Show details

10	Compositionality and Generalization in Emergent Languages
	Chaabouni, Rahma; Kharitonov, Eugene; Bouchacourt, Diane...
	In: ACL 2020 - 8th annual meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02959466 ; ACL 2020 - 8th annual meeting of the Association for Computational Linguistics, Jul 2020, Seattle / Virtual, United States (2020)
	BASE
	Show details

11	LIBRI-LIGHT: a benchmark for asr with limited or no supervision
	Kahn, Jacob; Rivière, Morgane; Zheng, Weiyi...
	In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-02959460 ; ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2020, Barcelona / Virtual, Spain. pp.7669-7673, ⟨10.1109/ICASSP40776.2020.9052942⟩ (2020)
	BASE
	Show details

12	Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
	Kharitonov, Eugene; Rivière, Morgane; Synnaeve, Gabriel; Wolf, Lior; Mazaré, Pierre-Emmanuel; Douze, Matthijs; Dupoux, Emmanuel
	In: SLT 2020 - IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03070321 ; SLT 2020 - IEEE Spoken Language Technology Workshop, Dec 2020, Shenzhen / Virtual, China (2020)
	Abstract: International audience ; Contrastive Predictive Coding (CPC), based on predicting future segments of speech based on past segments is emerging as a powerful algorithm for representation learning of speech signal. However, it still under-performs other methods on unsupervised evaluation benchmarks. Here, we introduce WavAugment, a time-domain data augmentation library and find that applying augmentation in the past is generally more efficient and yields better performances than other methods. We find that a combination of pitch modification, additive noise and reverberation substantially increase the performance of CPC (relative improvement of 18-22%), beating the reference Libri-light results with 600 times less data. Using an out-of-domain dataset, time-domain data augmentation can push CPC to be on par with the state of the art on the Zero Speech Benchmark 2017. We also show that time-domain data augmentation consistently improves downstream limited-supervision phoneme classification tasks by a factor of 12-15% relative.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; Contrastive predictive coding; Data augmentation; Speech recognition; Unsupervised representation learning
	URL: https://hal.archives-ouvertes.fr/hal-03070321 https://hal.archives-ouvertes.fr/hal-03070321/file/2007.00991.pdf https://hal.archives-ouvertes.fr/hal-03070321/document
	BASE
	Hide details

13	Anti-efficient encoding in emergent communication
	Chaabouni, Rahma; Kharitonov, Eugene; Dupoux, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-02274205 ; 2019 (2019)
	BASE
	Show details

14	Word-order biases in deep-agent emergent communication
	Chaabouni, Rahma; Kharitonov, Eugene; Lazaric, Alessandro...
	In: ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02274157 ; ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy (2019)
	BASE
	Show details

15	EGG: a toolkit for research on Emergence of lanGuage in Games
	Kharitonov, Eugene; Chaabouni, Rahma; Bouchacourt, Diane...
	In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations ; https://hal.archives-ouvertes.fr/hal-02274229 ; Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations, Nov 2019, Hong Kong, China. ⟨10.18653/v1/D19-3010⟩ (2019)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern