Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 52

1	Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
	Sen, Nirmalya; Sahidullah, Md; Patil, Hemant...
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
	BASE
	Show details

2	MRI Vocal Tract Sagittal Slices Estimation during Speech Production of CV
	Douros, Ioannis,; Kulkarni, Ajinkya; Xie, Yu; Dourou, Chrysanthi; Felblinger, Jacques; Isaieva, Karyna; Vuissoz, Pierre-André; Laprie, Yves
	In: EUSIPCO 2020 - 28th European Signal Processing Conference ; https://hal.inria.fr/hal-03090824 ; EUSIPCO 2020 - 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands ; https://eusipco2020.org/ (2021)
	Abstract: International audience ; In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; Image transformation; RtMRI data; speech resources enrichment; vocal tract
	URL: https://hal.inria.fr/hal-03090824/document https://hal.inria.fr/hal-03090824 https://hal.inria.fr/hal-03090824/file/3D_EUSIPCO_2020.pdf
	BASE
	Hide details

3	Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform
	Dahmani, Sara; Colotte, Vincent; Ouni, Slim
	In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02907046 ; Language Resources and Evaluation, Springer Verlag, 2020, ⟨10.1007/s10579-020-09500-w⟩ ; https://link.springer.com/article/10.1007%2Fs10579-020-09500-w (2020)
	BASE
	Show details

4	DNN-Based Parametric Speech Synthesis Enhanced With Articulatory Information
	Tsukanova, Anastasiia; Douros, Ioannis,; Laprie, Yves
	In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090869 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
	BASE
	Show details

5	Synthesize MRI vocal tract data during CV production
	Douros, Ioannis,; Dourou, Chrysanthi; Xie, Yu...
	In: ISSP 2020 - 12th International Seminar on Speech Production ; https://hal.inria.fr/hal-03090873 ; ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence / Virtual, United States ; https://issp2020.yale.edu/ (2020)
	BASE
	Show details

6	CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
	Watanabe, Shinji; Mandel, Michael; Barker, Jon...
	In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
	BASE
	Show details

7	Duration modelling and evaluation for Arabic statistical parametric speech synthesis
	Zangar, Imene; Mnasri, Zied; Colotte, Vincent...
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.inria.fr/hal-03007287 ; Multimedia Tools and Applications, Springer Verlag, 2020, ⟨10.1007/s11042-020-09901-7⟩ (2020)
	BASE
	Show details

8	Comparison between 2D and 3D models for speech production: a study of French vowels
	Douros, Ioannis,; Vuissoz, Pierre-André; Laprie, Yves
	In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02180606 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
	BASE
	Show details

9	Glottal Opening Measurements in VCV and VCCV Sequences
	Elie, Benjamin; Amelot, Angelique; Laprie, Yves...
	In: ICA 2019 - 23rd International Congress on Acoustics ; https://hal.inria.fr/hal-02180626 ; ICA 2019 - 23rd International Congress on Acoustics, Sep 2019, Aachen, Germany (2019)
	BASE
	Show details

10	A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information
	Terissi, Lucas; Sad, Gonzalo; Cerda, Mauricio...
	In: Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-01862585 ; Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

11	Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
	Houidhek, Amal; Colotte, Vincent; Mnasri, Zied...
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.inria.fr/hal-01936963 ; International Journal of Speech Technology, Springer Verlag, 2018, pp.1-12. ⟨10.1007/s10772-018-09558-6⟩ (2018)
	BASE
	Show details

12	About vocabulary adaptation for automatic speech recognition of video data
	Jouvet, Denis; Langlois, David; Menacer, Mohamed Amine...
	In: ICNLSSP'2017 - International Conference on Natural Language, Signal and Speech Processing ; https://hal.inria.fr/hal-01649057 ; ICNLSSP'2017 - International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. pp.1-5 (2017)
	BASE
	Show details

13	An analysis of environment, microphone and data simulation mismatches in robust speech recognition
	Vincent, Emmanuel; Watanabe, Shinji; Nugraha, Aditya Arie...
	In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.inria.fr/hal-01399180 ; Computer Speech and Language, Elsevier, 2017, 46, pp.535-557. ⟨10.1016/j.csl.2016.11.005⟩ (2017)
	BASE
	Show details

14	Prosodic Parameters and Prosodic Structures of French Emotional Data
	Bartkova, Katarina; Jouvet, Denis; Delais-Roussarie, Elisabeth
	In: Speech Prosody 2016 ; https://hal.inria.fr/hal-01293516 ; Speech Prosody 2016, May 2016, Boston, United States (2016)
	BASE
	Show details

15	A French corpus for distant-microphone speech processing in real homes
	Bertin, Nancy; Camberlein, Ewen; Vincent, Emmanuel...
	In: Interspeech 2016 ; https://hal.inria.fr/hal-01343060 ; Interspeech 2016, Sep 2016, San Francisco, United States (2016)
	BASE
	Show details

16	The IFCASL Corpus of French and German Non-native and Native Read Speech
	Trouvain, Jürgen; Bonneau, Anne; Colotte, Vincent...
	In: LREC'2016, 10th edition of the Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01293935 ; LREC'2016, 10th edition of the Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia (2016)
	BASE
	Show details

17	Adding new words into a language model using parameters of known words with similar behavior
	Orosanu, Luiza; Jouvet, Denis
	In: Proceedings ICNLSP'2015, International Conference on Natural Language and Speech Processing ; International Conference on Natural Language and Speech Processing ; https://hal.inria.fr/hal-01184194 ; International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria (2015)
	BASE
	Show details

18	Sound synchronization and motion compensated reconstruction for speech Cine MRI
	Vuissoz, Pierre-André; Odille, Freddy; Laprie, Yves...
	In: ISMRM 2015 Annual Meeting ; https://hal.inria.fr/hal-01183504 ; ISMRM 2015 Annual Meeting, May 2015, Toronto, Canada (2015)
	BASE
	Show details

19	The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
	Barker, Jon; Marxer, Ricard; Vincent, Emmanuel...
	In: 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015) ; https://hal.inria.fr/hal-01211376 ; 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), Dec 2015, Scottsdale, AZ, United States (2015)
	BASE
	Show details

20	Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition
	Mezzoudj, Freha; Langlois, David; Jouvet, Denis...
	In: Proceedings ICNLSP'2015, International Conference on Natural Language and Speech Processing ; International Conference on Natural Language and Speech Processing ; https://hal.inria.fr/hal-01184192 ; International Conference on Natural Language and Speech Processing, Oct 2015, Alger, Algeria (2015)
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern