DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.univ-lorraine.fr/hal-03650212 ; Speech Communication, Elsevier : North-Holland, 2022, ⟨10.1016/j.specom.2022.04.004⟩ (2022)
Abstract: International audience ; Articulatory speech synthesis requires generating realistic vocal tract shapes from thesequence of phonemes to be articulated. This work proposes the first model trained fromrt-MRI films to automatically predict all of the vocal tract articulators’ contours. The dataare the contours tracked in the rt-MRI database recorded for one speaker. Those contourswere exploited to train an encoder-decoder network to map the sequence of phonemes andtheir durations to the exact gestures performed by the speaker. Different from other works,all the individual articulator contours are predicted separately, allowing the investigation oftheir interactions. We measure four tract variables closely coupled with critical articulatorsand observe their variations over time. The test demonstrates that our model can producehigh-quality shapes of the complete vocal tract with a good correlation between the predictedand the target variables observed in rt-MRI films, even though the tract variables are notincluded in the optimization procedure.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [SDV.IB.IMA]Life Sciences [q-bio]/Bioengineering/Imaging; [SHS.INFO]Humanities and Social Sciences/Library and information sciences; Phonetic-to-articulatory; Speech production; Vocal tract shape
URL: https://doi.org/10.1016/j.specom.2022.04.004
https://hal.univ-lorraine.fr/hal-03650212
BASE
Hide details
2
Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers
In: ISSN: 2052-4463 ; EISSN: 2052-4463 ; Scientific Data ; https://hal.archives-ouvertes.fr/hal-03507532 ; Scientific Data , Nature Publishing Group, 2021, 8 (1), pp.258. ⟨10.1038/s41597-021-01041-3⟩ (2021)
BASE
Show details
3
Towards the prediction of the vocal tract shape from the sequence of phonemes to be articulated
In: iNTERSPEECH 2021 ; https://hal.inria.fr/hal-03360113 ; iNTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern