Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 26

1	Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
	Sankar, Sanjana; Beautemps, Denis; Hueber, Thomas
	In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
	BASE
	Show details

2	Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
	Sankar, Sanjana; Beautemps, Denis; Hueber, Thomas
	In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
	BASE
	Show details

3	Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation ...
	Georges, Marc-Antoine; Diard, Julien; Girin, Laurent. - : arXiv, 2022
	BASE
	Show details

4	Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding ...
	Sankar, Sanjana; Beautemps, Denis; Hueber, Thomas. - : arXiv, 2022
	Abstract: This paper proposes a simple and effective approach for automatic recognition of Cued Speech (CS), a visual communication tool that helps people with hearing impairment to understand spoken language with the help of hand gestures that can uniquely identify the uttered phonemes in complement to lipreading. The proposed approach is based on a pre-trained hand and lips tracker used for visual feature extraction and a phonetic decoder based on a multistream recurrent neural network trained with connectionist temporal classification loss and combined with a pronunciation lexicon. The proposed system is evaluated on an updated version of the French CS dataset CSF18 for which the phonetic transcription has been manually checked and corrected. With a decoding accuracy at the phonetic level of 70.88%, the proposed system outperforms our previous CNN-HMM decoder and competes with more complex baselines. ...
	Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://arxiv.org/abs/2204.04965 https://dx.doi.org/10.48550/arxiv.2204.04965
	BASE
	Hide details

5	Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
	Stephenson, Brooke; Hueber, Thomas; Girin, Laurent...
	In: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03372802 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ (2021)
	BASE
	Show details

6	Speech rehabilitation in post-stroke aphasia using visual illustration of speech articulators. A case report study
	HALDIN, Célise; Loevenbruck, Hélène; Hueber, Thomas...
	In: ISSN: 0269-9206 ; EISSN: 1464-5076 ; Clinical Linguistics & Phonetics ; https://hal.archives-ouvertes.fr/hal-02879182 ; Clinical Linguistics & Phonetics, Taylor & Francis, 2021, 35 (3), pp.253-276. ⟨10.1080/02699206.2020.1780473⟩ (2021)
	BASE
	Show details

7	Learning robust speech representation with an articulatory-regularized variational autoencoder
	Georges, Marc-Antoine; Girin, Laurent; Schwartz, Jean-Luc...
	In: Proccedings of Interspeech 2021 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03373252 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

8	Learning robust speech representation with an articulatory-regularized variational autoencoder ...
	Georges, Marc-Antoine; Girin, Laurent; Schwartz, Jean-Luc. - : arXiv, 2021
	BASE
	Show details

9	Speech rehabilitation in chronic post-stroke aphasia using visual illustration of speech articulators.A case report study
	HALDIN, Célise; Loevenbruck, Hélène; Hueber, Thomas...
	In: WFN 2020 - 11th World Congress for NeuroRehabilitation ; https://hal.archives-ouvertes.fr/hal-03098915 ; WFN 2020 - 11th World Congress for NeuroRehabilitation, Oct 2020, Lyon (online), France ; https://www.wcnr-congress.org/ (2020)
	BASE
	Show details

10	Towards an articulatory-driven neural vocoder for speech synthesis
	Georges, Marc-Antoine; Badin, Pierre; Diard, Julien...
	In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.archives-ouvertes.fr/hal-03184762 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence (virtual), United States (2020)
	BASE
	Show details

11	Rehabilitation of speech disorders following glossectomy, based on ultrasound visual illustration and feedback
	Girod-Roux, Marion; Hueber, Thomas; Fabre, Diandra...
	In: ISSN: 0269-9206 ; EISSN: 1464-5076 ; Clinical Linguistics & Phonetics ; https://hal.archives-ouvertes.fr/hal-01977670 ; Clinical Linguistics & Phonetics, Taylor & Francis, 2020, 34 (9), pp.826-843. ⟨10.1080/02699206.2019.1700310⟩ ; https://www.tandfonline.com/doi/abs/10.1080/02699206.2019.1700310?tab=permissions&scroll=top (2020)
	BASE
	Show details

12	Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning
	Hueber, Thomas; Tatulli, Eric; Girin, Laurent...
	In: ISSN: 0899-7667 ; EISSN: 1530-888X ; Neural Computation ; https://hal.archives-ouvertes.fr/hal-03016083 ; Neural Computation, Massachusetts Institute of Technology Press (MIT Press), 2020, 32 (3), pp.596-625. ⟨10.1162/neco_a_01264⟩ (2020)
	BASE
	Show details

13	Csf18 ...
	Liu, Li; Hueber, Thomas; Feng, Gang. - : Zenodo, 2018
	BASE
	Show details

14	Csf18 ...
	Liu, Li; Hueber, Thomas; Feng, Gang. - : Zenodo, 2018
	BASE
	Show details

15	Deeppredspeech: Computational Models Of Predictive Speech Coding Based On Deep Learning ...
	Hueber, Thomas; Tatulli, Eric; Girin, Laurent. - : Zenodo, 2018
	BASE
	Show details

16	DeepPredSpeech: computational models of predictive speech coding based on deep learning ...
	Hueber, Thomas; Tatulli, Eric; Girin, Laurent. - : Zenodo, 2018
	BASE
	Show details

17	DeepPredSpeech: computational models of predictive speech coding based on deep learning ...
	Hueber, Thomas; Tatulli, Eric; Girin, Laurent. - : Zenodo, 2018
	BASE
	Show details

18	Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping
	Girin, Laurent; Hueber, Thomas; Alameda-Pineda, Xavier
	In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01485540 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩ (2017)
	BASE
	Show details

19	Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract
	Fabre, Diandra; Hueber, Thomas; Girin, Laurent...
	In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-01578315 ; Speech Communication, Elsevier : North-Holland, 2017, 93, pp.63 - 75. ⟨10.1016/j.specom.2017.08.002⟩ (2017)
	BASE
	Show details

20	Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces
	Bocquelet, Florent; Hueber, Thomas; Girin, Laurent...
	In: ISSN: 1553-734X ; EISSN: 1553-7358 ; PLoS Computational Biology ; https://hal.archives-ouvertes.fr/hal-01459706 ; PLoS Computational Biology, Public Library of Science, 2016, 12 (11), pp.e1005119. ⟨10.1371/journal.pcbi.1005119⟩ (2016)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern