Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition ...
	Pritzen, Julia; Gref, Michael; Zühlke, Dietlind. - : arXiv, 2021
	BASE
	Show details

2	Improving Grapheme-to-Phoneme Conversion for Anglicisms in German Speech Recognition
	Pritzen, Julia Maria. - 2021
	In: Fraunhofer IAIS (2021)
	Abstract: This work designs and evaluates methods for improving the recognition of anglicisms in German speech recognition. Focusing on the pronunciation dictionary of an ASR system, three approaches were designed and implemented for creating supplementary anglicism pronunciation dictionaries. In the first approach, anglicism pronunciations were directly derived from the German Wiktionary. In the second approach, anglicism pronunciations were generated with both a German and an English G2P model. By comparing the confidence measures, the respective best pronunciation was chosen to be added to the resulting anglicism pronunciation dictionary. An additional P2P model was created for this approach that maps English phonemes to their German equivalents. In the third approach, multitask learning was util ized by adding an additional anglicism classification task to a German Seq2Seq G2P model. By distinguishing anglicisms and native German words, the G2P model was able to generate different pronunciations for each respective case. For each resulting anglicism pronunciation dictionary, a dedicated ASR model was created with similar settings. All ASR models including a baseline model were evaluated on a dedicated anglicism test set and two additional German test sets from the broadcast domain to prevent performance issues in other use cases. Ten out of thirteen models performed better than the baseline. The best model resulted from the comparative approach. For the anglicism test set, the WER could be decreased by 0.21 percentage points with 22 more anglicism being recognized compared to the baseline model. The mean WER based on all test sets was decreased by 0.08 percentage points. More anglicism data of better quality and refined model implementations are needed to further improve the anglicism recognition results.
	Keyword: Anglicisms; ASR; automatic speech recognition; G2P; grapheme-to-phoneme; loanwords; MTL; multitask learning; P2P; phoneme-to-phoneme; Seq2Seq; sequence-to-sequence
	URL: http://publica.fraunhofer.de/documents/N-634677.html
	BASE
	Hide details

3	Using Automatic Speech Recognition in Spoken Corpus Curation
	Gorisch, Jan [Verfasser]; Gref, Michael [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

4	Using Automatic Speech Recognition in Spoken Corpus Curation
	Calzolari, Nicoletta [Herausgeber]; Mazo, Hélène [Herausgeber]; Gorisch, Jan [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

5	Using Automatic Speech Recognition in Spoken Corpus Curation
	Gorisch, Jan; Gref, Michael; Schmidt, Thomas
	In: Fraunhofer IAIS (2020)
	BASE
	Show details

6	Speech Analytics in Research Based on Qualitative Interviews. Experiences from KA3
	Leh, Almut; Köhler, Joachim; Gref, Michael...
	In: Fraunhofer IAIS (2018)
	BASE
	Show details

7	KA3. Weiterentwicklung von Sprachtechnologien im Kontext der Oral History
	Köhler, Joachim; Gref, Michael; Leh, Almut
	In: Fraunhofer IAIS (2017)
	BASE
	Show details

8	Using Automatic Speech Recognition in Spoken Corpus Curation [Online resource]
	Gorisch, Jan; Gref, Michael; Schmidt, Thomas
	IDS-Repository
	Show details

9	Using Automatic Speech Recognition in Spoken Corpus Curation [Online resource]
	Gorisch, Jan; Gref, Michael; Schmidt, Thomas
	IDS-Repository
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern