Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 7 of 7

1	Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
	Sivasankaran, Sunit; Vincent, Emmanuel; Fohr, Dominique
	In: EUSIPCO 2020 - 28th European Signal Processing Conference ; https://hal.inria.fr/hal-02355669 ; EUSIPCO 2020 - 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands. ⟨10.23919/Eusipco47968.2020.9287541⟩ ; https://eusipco2020.org/ (2021)
	BASE
	Show details

2	SLOGD: Speaker Location Guided Deflation Approach to Speech Separation
	Sivasankaran, Sunit; Vincent, Emmanuel; Fohr, Dominique
	In: ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing ; https://hal.inria.fr/hal-02355613 ; ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain (2020)
	BASE
	Show details

3	VoiceHome-2, an extended corpus for multichannel speech processing in real homes
	Bertin, Nancy; Camberlein, Ewen; Lebarbenchon, Romain...
	In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.inria.fr/hal-01923108 ; Speech Communication, Elsevier : North-Holland, 2019, 106, pp.68-78. ⟨10.1016/j.specom.2018.11.002⟩ (2019)
	BASE
	Show details

4	Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment
	Sivasankaran, Sunit; Vincent, Emmanuel; Fohr, Dominique
	In: Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01817519 ; Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

5	Phone Merging for Code-switched Speech Recognition
	Sivasankaran, Sunit; Srivastava, Brij Mohan Lal; Sitaram, Sunayana...
	In: Third Workshop on Computational Approaches to Linguistic Code-switching ; https://hal.inria.fr/hal-01800466 ; Third Workshop on Computational Approaches to Linguistic Code-switching, collocated with ACL 2018 Jul 2018, Melbourne, Australia (2018)
	BASE
	Show details

6	A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions
	Sivasankaran, Sunit; Vincent, Emmanuel; Illina, Irina
	In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.inria.fr/hal-01461382 ; Computer Speech and Language, Elsevier, 2017, 46, pp.444-460. ⟨10.1016/j.csl.2017.02.003⟩ (2017)
	Abstract: International audience ; Robustness to reverberation is a key concern for distant-microphone ASR. Various approaches have been proposed, including single-channel or multichannel dereverberation, robust feature extraction, alternative acoustic models, and acoustic model adaptation. However, to the best of our knowledge, a detailed study of these techniques in varied reverberation conditions is still missing in the literature. In this paper, we conduct a series of experiments to assess the impact of various dereverberation and acoustic model adaptation approaches on the ASR performance in the range of reverberation conditions found in real domestic environments. We consider both established approaches such as WPE and newer approaches such as learning hidden unit contribution (LHUC) adaptations, whose performance has not been reported before in this context, and we employ them in combination. Our results indicate that performing weighted prediction error (WPE) dereverberation on a reverberated test speech utterance and decoding using an deep neural network (DNN) acoustic model trained with multi-condition reverberated speech with feature-space maximum likelihood linear regression (fMLLR) transformed features, outperforms more recent approaches and helps significantly reduce the word error rate (WER).
	Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; acoustic model adaptation; dereverberation; evaluation; robust ASR
	URL: https://hal.inria.fr/hal-01461382/file/sivasankaran_CSL17.pdf https://hal.inria.fr/hal-01461382 https://doi.org/10.1016/j.csl.2017.02.003 https://hal.inria.fr/hal-01461382/document
	BASE
	Hide details

7	A French corpus for distant-microphone speech processing in real homes
	Bertin, Nancy; Camberlein, Ewen; Vincent, Emmanuel...
	In: Interspeech 2016 ; https://hal.inria.fr/hal-01343060 ; Interspeech 2016, Sep 2016, San Francisco, United States (2016)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern