1 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Differentially private speaker anonymization
|
|
|
|
In: https://hal.inria.fr/hal-03588932 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
|
|
|
|
In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02995862 ; PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Virtual, China (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
|
|
|
|
In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
MRI Vocal Tract Sagittal Slices Estimation during Speech Production of CV
|
|
|
|
In: EUSIPCO 2020 - 28th European Signal Processing Conference ; https://hal.inria.fr/hal-03090824 ; EUSIPCO 2020 - 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands ; https://eusipco2020.org/ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
|
|
|
|
In: ISSN: 0893-6080 ; Neural Networks ; https://hal.inria.fr/hal-03204193 ; Neural Networks, Elsevier, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Automated audio captioning by fine-tuning bart with audioset tags
|
|
|
|
In: DCASE 2021 - 6th Workshop on Detection and Classification of Acoustic Scenes and Events ; https://hal.inria.fr/hal-03522488 ; DCASE 2021 - 6th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2021, Virtual, Spain (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Audio-driven speech animation using recurrent neutral network
|
|
|
|
In: https://hal.inria.fr/hal-03167213 ; United States, Patent n° : WO2021023861. 2021 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Measurement of Tongue Tip Velocity from Real-Time MRI and Phase-Contrast Cine-MRI in Consonant Production
|
|
|
|
In: ISSN: 2313-433X ; Journal of Imaging ; https://hal.univ-lorraine.fr/hal-02923466 ; Journal of Imaging, MDPI, 2020, 6 (5), pp.31. ⟨10.3390/jimaging6050031⟩ (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-03090808 ; INTERSPEECH 2020, Oct 2020, Shangaï / Virtual, China ; http://www.interspeech2020.org/ (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02907046 ; Language Resources and Evaluation, Springer Verlag, 2020, ⟨10.1007/s10579-020-09500-w⟩ ; https://link.springer.com/article/10.1007%2Fs10579-020-09500-w (2020)
|
|
BASE
|
|
Show details
|
|
12 |
DNN-Based Parametric Speech Synthesis Enhanced With Articulatory Information
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090869 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Tracking the tongue contours in rt-MRI films with an autoencoder DNN approach
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090859 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Vocal tract sagittal slices estimation from MRI midsagittal slices during speech production of CV
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090865 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/program.html (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Synthesize MRI vocal tract data during CV production
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090873 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Parametric synthesis of Arabic speech ; Synthèse paramétrique de la parole Arabe
|
|
|
|
In: https://hal.univ-lorraine.fr/tel-03050597 ; Traitement du signal et de l'image [eess.SP]. Université de Lorraine; Université de Tunis El Manar (Tunisie), 2020. Français. ⟨NNT : 2020LORR0116⟩ (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Introducing the VoicePrivacy initiative
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02562199 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
18 |
A comparative study of speech anonymization metrics
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02907918 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
19 |
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
|
|
|
|
In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Duration modelling and evaluation for Arabic statistical parametric speech synthesis
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.inria.fr/hal-03007287 ; Multimedia Tools and Applications, Springer Verlag, 2020, ⟨10.1007/s11042-020-09901-7⟩ (2020)
|
|
BASE
|
|
Show details
|
|
|
|