Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 68

1	Influence of Highly Inflected Word Forms and Acoustic Background on the Robustness of Automatic Speech Recognition for Human–Computer Interaction
	Andrej Zgank
	In: Mathematics; Volume 10; Issue 5; Pages: 711 (2022)
	BASE
	Show details

2	Discriminative feature modeling for statistical speech recognition ...
	Tüske, Zoltán. - : RWTH Aachen University, 2021
	BASE
	Show details

3	Cross-lingual acoustic modeling in upper sorbian - preliminary study
	Duckhorn, Frank; Rjelka, Marek; Wolff, Matthias...
	In: Fraunhofer IKTS (2021)
	BASE
	Show details

4	Glottal Stops in Upper Sorbian: A Data-Driven Approach
	Wolff, Matthias; Tschöpe, Constanze; Duckhorn, Frank...
	In: Fraunhofer IKTS (2021)
	BASE
	Show details

5	Estimating the Degree of Sleepiness by Integrating Articulatory Feature Knowledge in Raw Waveform Based CNNS ...
	Fritsch, Julian; S. Pavankumar Dubagunta; Magimai.-Doss, Mathew. - : Zenodo, 2020
	BASE
	Show details

6	Estimating the Degree of Sleepiness by Integrating Articulatory Feature Knowledge in Raw Waveform Based CNNS ...
	Fritsch, Julian; S. Pavankumar Dubagunta; Magimai.-Doss, Mathew. - : Zenodo, 2020
	BASE
	Show details

7	Dealing with linguistic mismatches for automatic speech recognition
	Yang, Xuesong. - 2019
	Abstract: Recent breakthroughs in automatic speech recognition (ASR) have resulted in a word error rate (WER) on par with human transcribers on the English Switchboard benchmark. However, dealing with linguistic mismatches between the training and testing data is still a significant challenge that remains unsolved. Under the monolingual environment, it is well-known that the performance of ASR systems degrades significantly when presented with the speech from speakers with different accents, dialects, and speaking styles than those encountered during system training. Under the multi-lingual environment, ASR systems trained on a source language achieve even worse performance when tested on another target language because of mismatches in terms of the number of phonemes, lexical ambiguity, and power of phonotactic constraints provided by phone-level n-grams. In order to address the issues of linguistic mismatches for current ASR systems, my dissertation investigates both knowledge-gnostic and knowledge-agnostic solutions. In the first part, classic theories relevant to acoustics and articulatory phonetics that present capability of being transferred across a dialect continuum from local dialects to another standardized language are re-visited. Experiments demonstrate the potentials that acoustic correlates in the vicinity of landmarks could help to build a bridge for dealing with mismatches across difference local or global varieties in a dialect continuum. In the second part, we design an end-to-end acoustic modeling approach based on connectionist temporal classification loss and propose to link the training of acoustics and accent altogether in a manner similar to the learning process in human speech perception. This joint model not only performed well on ASR with multiple accents but also boosted accuracies of accent identification task in comparison to separately-trained models.
	Keyword: Acoustic Landmarks; Acoustic Modeling; Acoustic Phonetics; Automatic Speech Recognition; Connectionist Temporal Classification; Deep Learning; Distinctive Features; End-to-End; Model Compression; Multi-Accents; Multi-Lingual; Multi-Task Learning; Pronunciation Error Detection
	URL: http://hdl.handle.net/2142/105187
	BASE
	Hide details

8	Speech recognition with probabilistic transcriptions and end-to-end systems using deep learning
	Das, Amit. - 2018
	BASE
	Show details

9	Phonetic Context Embeddings for DNN-HMM Phone Recognition
	Badino, Leonardo
	In: Interspeech 2016 ; https://hal.sorbonne-universite.fr/hal-02166078 ; Interspeech 2016, Sep 2016, SAN FRANCISCO, United States. pp.405-409, ⟨10.21437/Interspeech.2016-1036⟩ (2016)
	BASE
	Show details

10	Robust automatic speech recognition for children ...
	Gurunath Shivakumar, Prashanth. - : University of Southern California Digital Library (USC.DL), 2015
	BASE
	Show details

11	Modeling of a rise-fall intonation pattern in the language of young Paris Speakers
	Paternostro, Roberto; Goldman, Jean-Philippe
	In: Speech Prosody ; https://halshs.archives-ouvertes.fr/halshs-01069584 ; Speech Prosody, 2014, 7, pp.814-818 (2014)
	BASE
	Show details

12	Vers une modélisation acoustique de l'intonation des jeunes en région parisienne : une question de " proximité " ?
	Paternostro, Roberto; Goldman, Jean-Philippe
	In: ISSN: 1661-8246 ; EISSN: 1661-8246 ; Nouveaux Cahiers de Linguistique Française ; https://halshs.archives-ouvertes.fr/halshs-01069593 ; Nouveaux Cahiers de Linguistique Française, Université de Genève, 2014, 31, pp.257-171 (2014)
	BASE
	Show details

13	Towards the automatic processing of Yongning Na (Sino-Tibetan): developing a 'light' acoustic model of the target language and testing 'heavyweight' models from five national languages
	Do, Thi-Ngoc-Diep; Michaud, Alexis; Castelli, Eric
	In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU 2014) ; 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU 2014) ; https://halshs.archives-ouvertes.fr/halshs-00980431 ; 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU 2014), May 2014, St Petersburg, Russia. pp.153-160 (2014)
	BASE
	Show details

14	Modélisation acoustico-phonétique de langues peu dotées : Études phonétiques et travaux de reconnaissance automatique en luxembourgois
	Adda-Decker, Martine; Lamel, Lori; Adda, Gilles
	In: Journées d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843399 ; Journées d'Etude sur la Parole, Jan 2014, Le Mans, France (2014)
	BASE
	Show details

15	Speech Alignment and Recognition Experiments for Luxembourgish
	Adda-Decker, Martine; Lamel, Lori; Adda, Gilles
	In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Underresourced Languages ; 4th International Workshop on Spoken Language Technologies for Underresourced Languages ; https://hal.archives-ouvertes.fr/hal-01134824 ; 4th International Workshop on Spoken Language Technologies for Underresourced Languages, May 2014, Saint-Petersbourg, Russia. pp.53-60 ; http://www.mica.edu.vn/sltu2014/ (2014)
	BASE
	Show details

16	A First LVCSR System for Luxembourgish, a Low-Resourced European Language
	Adda-Decker, Martine; Lamel, Lori; Adda, Gilles...
	In: Human Language Technology Challenges for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01135103 ; Zygmunt Vetulani; Joseph Mariani. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.479-490, 2014, 5th Language and Technology Conference, LTC 2011, Poznań, Poland, November 25--27, 2011, Revised Selected Papers, 978-3-319-08957-7. ⟨10.1007/978-3-319-08958-4_39⟩ (2014)
	BASE
	Show details

17	Impact of Video Modeling Techniques on Efficiency and Effectiveness of Clinical Voice Assessment
	Bowyer, Samantha Lauren
	In: http://rave.ohiolink.edu/etdc/view?acc_num=miami1398686540 (2014)
	BASE
	Show details

18	Anger Recognition in Speech Using Acoustic and Linguistic Cues
	: Elsevier, 2013
	BASE
	Show details

19	Detection of acoustic-phonetic landmarks in mismatched conditions using a biomimetic model of human auditory processing
	Sarah King
	In: http://www.isle.uiuc.edu/%7Esborys/king_coling12.pdf (2012)
	BASE
	Show details

20	Detection of acoustic-phonetic landmarks in mismatched conditions using a biomimetic model of human auditory processing
	Sarah King; M Johnson
	In: http://aclweb.org/anthology/C/C12/C12-2058.pdf (2012)
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern