Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 8 of 8

1	Learning spectro-temporal representations of complex sounds with parameterized neural networks
	Riad, Rachid; Karadayi, Julien; Bachoud-Lévi, Anne-Catherine...
	In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.inria.fr/hal-03329261 ; Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 150 (1), pp.353-366. ⟨10.1121/10.0005482⟩ (2021)
	BASE
	Show details

2	The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
	Dunbar, Ewan; Karadayi, Julien; Bernard, Mathieu...
	In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02962224 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shangai / Virtual, China (2020)
	BASE
	Show details

3	LIBRI-LIGHT: a benchmark for asr with limited or no supervision
	Kahn, Jacob; Rivière, Morgane; Zheng, Weiyi; Kharitonov, Eugene; Xu, Qiantong; Mazaré, Pierre-Emmanuel; Karadayi, Julien; Liptchinsky, Vitaliy; Collobert, Ronan; Fügen, Christian; Likhomanenko, Tatiana; Synnaeve, Gabriel; Joulin, Armand; Abdelrahman, Mohamed,; Dupoux, Emmanuel
	In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-02959460 ; ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2020, Barcelona / Virtual, Spain. pp.7669-7673, ⟨10.1109/ICASSP40776.2020.9052942⟩ (2020)
	Abstract: International audience ; We introduce a new collection of spoken English audio suitable for training speech recognition systems under limited or no supervision. It is derived from open-source audio books from the LibriVox project. It contains over 60K hours of audio , which is, to our knowledge, the largest freely-available corpus of speech. The audio has been segmented using voice activity detection and is tagged with SNR, speaker ID and genre descriptions. Additionally, we provide baseline systems and evaluation metrics working under three settings: (1) the zero resource/unsupervised setting (ABX), (2) the semi-supervised setting (PER, CER) and (3) the distant supervision setting (WER). Settings (2) and (3) use limited textual resources (10 minutes to 10 hours) aligned with the speech. Setting (3) uses large amounts of unaligned text. They are evaluated on the standard LibriSpeech dev and test sets for comparison with the supervised state-of-the-art. Index Terms-unsupervised and semi-supervised learning , distant supervision, dataset, zero-and low resource ASR.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
	URL: https://doi.org/10.1109/ICASSP40776.2020.9052942 https://hal.archives-ouvertes.fr/hal-02959460/document https://hal.archives-ouvertes.fr/hal-02959460 https://hal.archives-ouvertes.fr/hal-02959460/file/1912.07875.pdf
	BASE
	Hide details

4	The Zero Resource Speech Challenge 2019: TTS without T
	Dunbar, Ewan; Algayres, Robin; Karadayi, Julien...
	In: Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02274112 ; Interspeech 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria (2019)
	BASE
	Show details

5	Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments
	Holzenberger, Nils; Du, Mingxing; Karadayi, Julien...
	In: Interspeech 2018 ; https://hal.archives-ouvertes.fr/hal-01888708 ; Interspeech 2018, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2364⟩ (2018)
	BASE
	Show details

6	Sampling strategies in Siamese Networks for unsupervised speech representation learning
	Riad, Rachid; Dancette, Corentin; Karadayi, Julien...
	In: Interspeech 2018 ; https://hal.archives-ouvertes.fr/hal-01888725 ; Interspeech 2018, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

7	A K-nearest neighbours approach to unsupervised spoken term discovery
	Thual, Alexis; Dancette, Corentin; Karadayi, Julien...
	In: IEEE Spoken Language Technology SLT-2018 ; https://hal.archives-ouvertes.fr/hal-01947953 ; IEEE Spoken Language Technology SLT-2018, Dec 2018, Athènes, Greece (2018)
	BASE
	Show details

8	The Zero Resource Speech Challenge 2017
	Dunbar, Ewan; Cao, Xuan-Nga; Benjumea, Juan...
	In: ASRU 2017 ; https://hal.inria.fr/hal-01687504 ; ASRU 2017, Dec 2017, Okinawa, Japan (2017)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern