1 |
Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.univ-lorraine.fr/hal-03650212 ; Speech Communication, Elsevier : North-Holland, 2022, ⟨10.1016/j.specom.2022.04.004⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
MRI Vocal Tract Sagittal Slices Estimation during Speech Production of CV
|
|
|
|
In: EUSIPCO 2020 - 28th European Signal Processing Conference ; https://hal.inria.fr/hal-03090824 ; EUSIPCO 2020 - 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands ; https://eusipco2020.org/ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers
|
|
|
|
In: ISSN: 2052-4463 ; EISSN: 2052-4463 ; Scientific Data ; https://hal.archives-ouvertes.fr/hal-03507532 ; Scientific Data , Nature Publishing Group, 2021, 8 (1), pp.258. ⟨10.1038/s41597-021-01041-3⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Towards the prediction of the vocal tract shape from the sequence of phonemes to be articulated
|
|
|
|
In: iNTERSPEECH 2021 ; https://hal.inria.fr/hal-03360113 ; iNTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
Abstract:
International audience ; In this work, we address the prediction of speech articulators' temporal geometric position from the sequence of phonemes to be articulated. We start from a set of real-time MRI sequences uttered by a female French speaker. The contours of five articulators were tracked automatically in each of the frames in the MRI video. Then, we explore the capacity of a bidirectional GRU to correctly predict each articulator's shape and position given the sequence of phonemes and their duration. We propose a 5-fold cross-validation experiment to evaluate the generalization capacity of the model. In a second experiment, we evaluate our model's data efficiency by reducing training data. We evaluate the point-to-point Euclidean distance and the Pearson's correlations along time between the predicted and the target shapes. We also evaluate produced shapes of the critical articulators of specific phonemes. We show that our model can achieve good results with minimal data, producing very realistic vocal tract shapes.
|
|
Keyword:
[INFO]Computer Science [cs]; neural networks; phoneme-to-articulatory; speech production
|
|
URL: https://hal.inria.fr/hal-03360113/file/INTERSPEECH_2021_Phoneme_to_Articulatory.pdf https://hal.inria.fr/hal-03360113 https://hal.inria.fr/hal-03360113/document
|
|
BASE
|
|
Hide details
|
|
5 |
Measurement of Tongue Tip Velocity from Real-Time MRI and Phase-Contrast Cine-MRI in Consonant Production
|
|
|
|
In: ISSN: 2313-433X ; Journal of Imaging ; https://hal.univ-lorraine.fr/hal-02923466 ; Journal of Imaging, MDPI, 2020, 6 (5), pp.31. ⟨10.3390/jimaging6050031⟩ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-03090808 ; INTERSPEECH 2020, Oct 2020, Shangaï / Virtual, China ; http://www.interspeech2020.org/ (2020)
|
|
BASE
|
|
Show details
|
|
7 |
DNN-Based Parametric Speech Synthesis Enhanced With Articulatory Information
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090869 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Tracking the tongue contours in rt-MRI films with an autoencoder DNN approach
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090859 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Vocal tract sagittal slices estimation from MRI midsagittal slices during speech production of CV
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090865 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/program.html (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Synthesize MRI vocal tract data during CV production
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090873 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
11 |
F1 and F2 measurements for French oral vowel with a new pneumotachograph mask
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090851 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Acoustic impacts of geometric approximation at the level of velum and epiglottis on French vowels
|
|
|
|
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02180566 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
13 |
A Multimodal Real-Time MRI Articulatory Corpus of French for Speech Research
|
|
|
|
In: INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-02167756 ; INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Can static vocal tract positions represent articulatory targets in continuous speech? Matching static MRI captures against real-time MRI for the French language
|
|
|
|
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02181314 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Comparison between 2D and 3D models for speech production: a study of French vowels
|
|
|
|
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02180606 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Glottal Opening Measurements in VCV and VCCV Sequences
|
|
|
|
In: ICA 2019 - 23rd International Congress on Acoustics ; https://hal.inria.fr/hal-02180626 ; ICA 2019 - 23rd International Congress on Acoustics, Sep 2019, Aachen, Germany (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Effect of head posture on phonation of French vowels
|
|
|
|
In: ICPhS 2019 - Proceedings of International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02180486 ; ICPhS 2019 - Proceedings of International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Simulating alveolar trills using a two-mass model of the tongue tip
|
|
|
|
In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.archives-ouvertes.fr/hal-01525882 ; Journal of the Acoustical Society of America, Acoustical Society of America, In press, 142 (5), ⟨10.1121/1.5012688⟩ (2017)
|
|
BASE
|
|
Show details
|
|
19 |
Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets
|
|
|
|
In: ISSP 2017 - 11th International Seminar on Speech Production ; https://hal.archives-ouvertes.fr/hal-01643487 ; ISSP 2017 - 11th International Seminar on Speech Production, Oct 2017, Tianjin, China (2017)
|
|
BASE
|
|
Show details
|
|
20 |
End-to-End Acoustic Feedback in Language Learning for Correcting Devoiced French Final-Fricatives
|
|
|
|
In: Interspeech 2017 ; https://hal.inria.fr/hal-01721562 ; Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.1-5, ⟨10.21437/Interspeech.2017-1031⟩ (2017)
|
|
BASE
|
|
Show details
|
|
|
|