1 |
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
|
|
|
|
In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
MRI Vocal Tract Sagittal Slices Estimation during Speech Production of CV
|
|
|
|
In: EUSIPCO 2020 - 28th European Signal Processing Conference ; https://hal.inria.fr/hal-03090824 ; EUSIPCO 2020 - 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands ; https://eusipco2020.org/ (2021)
|
|
Abstract:
International audience ; In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; Image transformation; RtMRI data; speech resources enrichment; vocal tract
|
|
URL: https://hal.inria.fr/hal-03090824/document https://hal.inria.fr/hal-03090824 https://hal.inria.fr/hal-03090824/file/3D_EUSIPCO_2020.pdf
|
|
BASE
|
|
Hide details
|
|
3 |
Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02907046 ; Language Resources and Evaluation, Springer Verlag, 2020, ⟨10.1007/s10579-020-09500-w⟩ ; https://link.springer.com/article/10.1007%2Fs10579-020-09500-w (2020)
|
|
BASE
|
|
Show details
|
|
4 |
DNN-Based Parametric Speech Synthesis Enhanced With Articulatory Information
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090869 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Synthesize MRI vocal tract data during CV production
|
|
|
|
In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090873 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
|
|
|
|
In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Duration modelling and evaluation for Arabic statistical parametric speech synthesis
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.inria.fr/hal-03007287 ; Multimedia Tools and Applications, Springer Verlag, 2020, ⟨10.1007/s11042-020-09901-7⟩ (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Comparison between 2D and 3D models for speech production: a study of French vowels
|
|
|
|
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02180606 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
9 |
Glottal Opening Measurements in VCV and VCCV Sequences
|
|
|
|
In: ICA 2019 - 23rd International Congress on Acoustics ; https://hal.inria.fr/hal-02180626 ; ICA 2019 - 23rd International Congress on Acoustics, Sep 2019, Aachen, Germany (2019)
|
|
BASE
|
|
Show details
|
|
10 |
A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information
|
|
|
|
In: Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-01862585 ; Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India (2018)
|
|
BASE
|
|
Show details
|
|
11 |
Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
|
|
|
|
In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.inria.fr/hal-01936963 ; International Journal of Speech Technology, Springer Verlag, 2018, pp.1-12. ⟨10.1007/s10772-018-09558-6⟩ (2018)
|
|
BASE
|
|
Show details
|
|
12 |
About vocabulary adaptation for automatic speech recognition of video data
|
|
|
|
In: ICNLSSP'2017 - International Conference on Natural Language, Signal and Speech Processing ; https://hal.inria.fr/hal-01649057 ; ICNLSSP'2017 - International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. pp.1-5 (2017)
|
|
BASE
|
|
Show details
|
|
13 |
An analysis of environment, microphone and data simulation mismatches in robust speech recognition
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.inria.fr/hal-01399180 ; Computer Speech and Language, Elsevier, 2017, 46, pp.535-557. ⟨10.1016/j.csl.2016.11.005⟩ (2017)
|
|
BASE
|
|
Show details
|
|
14 |
Prosodic Parameters and Prosodic Structures of French Emotional Data
|
|
|
|
In: Speech Prosody 2016 ; https://hal.inria.fr/hal-01293516 ; Speech Prosody 2016, May 2016, Boston, United States (2016)
|
|
BASE
|
|
Show details
|
|
15 |
A French corpus for distant-microphone speech processing in real homes
|
|
|
|
In: Interspeech 2016 ; https://hal.inria.fr/hal-01343060 ; Interspeech 2016, Sep 2016, San Francisco, United States (2016)
|
|
BASE
|
|
Show details
|
|
16 |
The IFCASL Corpus of French and German Non-native and Native Read Speech
|
|
|
|
In: LREC'2016, 10th edition of the Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01293935 ; LREC'2016, 10th edition of the Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Adding new words into a language model using parameters of known words with similar behavior
|
|
|
|
In: Proceedings ICNLSP'2015, International Conference on Natural Language and Speech Processing ; International Conference on Natural Language and Speech Processing ; https://hal.inria.fr/hal-01184194 ; International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria (2015)
|
|
BASE
|
|
Show details
|
|
18 |
Sound synchronization and motion compensated reconstruction for speech Cine MRI
|
|
|
|
In: ISMRM 2015 Annual Meeting ; https://hal.inria.fr/hal-01183504 ; ISMRM 2015 Annual Meeting, May 2015, Toronto, Canada (2015)
|
|
BASE
|
|
Show details
|
|
19 |
The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
|
|
|
|
In: 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015) ; https://hal.inria.fr/hal-01211376 ; 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), Dec 2015, Scottsdale, AZ, United States (2015)
|
|
BASE
|
|
Show details
|
|
20 |
Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition
|
|
|
|
In: Proceedings ICNLSP'2015, International Conference on Natural Language and Speech Processing ; International Conference on Natural Language and Speech Processing ; https://hal.inria.fr/hal-01184192 ; International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria (2015)
|
|
BASE
|
|
Show details
|
|
|
|