DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 26

1
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
BASE
Show details
2
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
BASE
Show details
3
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation ...
BASE
Show details
4
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding ...
Abstract: This paper proposes a simple and effective approach for automatic recognition of Cued Speech (CS), a visual communication tool that helps people with hearing impairment to understand spoken language with the help of hand gestures that can uniquely identify the uttered phonemes in complement to lipreading. The proposed approach is based on a pre-trained hand and lips tracker used for visual feature extraction and a phonetic decoder based on a multistream recurrent neural network trained with connectionist temporal classification loss and combined with a pronunciation lexicon. The proposed system is evaluated on an updated version of the French CS dataset CSF18 for which the phonetic transcription has been manually checked and corrected. With a decoding accuracy at the phonetic level of 70.88%, the proposed system outperforms our previous CNN-HMM decoder and competes with more complex baselines. ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2204.04965
https://dx.doi.org/10.48550/arxiv.2204.04965
BASE
Hide details
5
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
In: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03372802 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ (2021)
BASE
Show details
6
Speech rehabilitation in post-stroke aphasia using visual illustration of speech articulators. A case report study
In: ISSN: 0269-9206 ; EISSN: 1464-5076 ; Clinical Linguistics & Phonetics ; https://hal.archives-ouvertes.fr/hal-02879182 ; Clinical Linguistics & Phonetics, Taylor & Francis, 2021, 35 (3), pp.253-276. ⟨10.1080/02699206.2020.1780473⟩ (2021)
BASE
Show details
7
Learning robust speech representation with an articulatory-regularized variational autoencoder
In: Proccedings of Interspeech 2021 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03373252 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
8
Learning robust speech representation with an articulatory-regularized variational autoencoder ...
BASE
Show details
9
Speech rehabilitation in chronic post-stroke aphasia using visual illustration of speech articulators.A case report study
In: WFN 2020 - 11th World Congress for NeuroRehabilitation ; https://hal.archives-ouvertes.fr/hal-03098915 ; WFN 2020 - 11th World Congress for NeuroRehabilitation, Oct 2020, Lyon (online), France ; https://www.wcnr-congress.org/ (2020)
BASE
Show details
10
Towards an articulatory-driven neural vocoder for speech synthesis
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.archives-ouvertes.fr/hal-03184762 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence (virtual), United States (2020)
BASE
Show details
11
Rehabilitation of speech disorders following glossectomy, based on ultrasound visual illustration and feedback
In: ISSN: 0269-9206 ; EISSN: 1464-5076 ; Clinical Linguistics & Phonetics ; https://hal.archives-ouvertes.fr/hal-01977670 ; Clinical Linguistics & Phonetics, Taylor & Francis, 2020, 34 (9), pp.826-843. ⟨10.1080/02699206.2019.1700310⟩ ; https://www.tandfonline.com/doi/abs/10.1080/02699206.2019.1700310?tab=permissions&scroll=top (2020)
BASE
Show details
12
Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning
In: ISSN: 0899-7667 ; EISSN: 1530-888X ; Neural Computation ; https://hal.archives-ouvertes.fr/hal-03016083 ; Neural Computation, Massachusetts Institute of Technology Press (MIT Press), 2020, 32 (3), pp.596-625. ⟨10.1162/neco_a_01264⟩ (2020)
BASE
Show details
13
Csf18 ...
Liu, Li; Hueber, Thomas; Feng, Gang. - : Zenodo, 2018
BASE
Show details
14
Csf18 ...
Liu, Li; Hueber, Thomas; Feng, Gang. - : Zenodo, 2018
BASE
Show details
15
Deeppredspeech: Computational Models Of Predictive Speech Coding Based On Deep Learning ...
BASE
Show details
16
DeepPredSpeech: computational models of predictive speech coding based on deep learning ...
BASE
Show details
17
DeepPredSpeech: computational models of predictive speech coding based on deep learning ...
BASE
Show details
18
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01485540 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩ (2017)
BASE
Show details
19
Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-01578315 ; Speech Communication, Elsevier : North-Holland, 2017, 93, pp.63 - 75. ⟨10.1016/j.specom.2017.08.002⟩ (2017)
BASE
Show details
20
Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces
In: ISSN: 1553-734X ; EISSN: 1553-7358 ; PLoS Computational Biology ; https://hal.archives-ouvertes.fr/hal-01459706 ; PLoS Computational Biology, Public Library of Science, 2016, 12 (11), pp.e1005119. ⟨10.1371/journal.pcbi.1005119⟩ (2016)
BASE
Show details

Page: 1 2

Catalogues
0
0
2
0
0
0
0
Bibliographies
2
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
24
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern