DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition ...
BASE
Show details
2
Improving Grapheme-to-Phoneme Conversion for Anglicisms in German Speech Recognition
In: Fraunhofer IAIS (2021)
Abstract: This work designs and evaluates methods for improving the recognition of anglicisms in German speech recognition. Focusing on the pronunciation dictionary of an ASR system, three approaches were designed and implemented for creating supplementary anglicism pronunciation dictionaries. In the first approach, anglicism pronunciations were directly derived from the German Wiktionary. In the second approach, anglicism pronunciations were generated with both a German and an English G2P model. By comparing the confidence measures, the respective best pronunciation was chosen to be added to the resulting anglicism pronunciation dictionary. An additional P2P model was created for this approach that maps English phonemes to their German equivalents. In the third approach, multitask learning was util ized by adding an additional anglicism classification task to a German Seq2Seq G2P model. By distinguishing anglicisms and native German words, the G2P model was able to generate different pronunciations for each respective case. For each resulting anglicism pronunciation dictionary, a dedicated ASR model was created with similar settings. All ASR models including a baseline model were evaluated on a dedicated anglicism test set and two additional German test sets from the broadcast domain to prevent performance issues in other use cases. Ten out of thirteen models performed better than the baseline. The best model resulted from the comparative approach. For the anglicism test set, the WER could be decreased by 0.21 percentage points with 22 more anglicism being recognized compared to the baseline model. The mean WER based on all test sets was decreased by 0.08 percentage points. More anglicism data of better quality and refined model implementations are needed to further improve the anglicism recognition results.
Keyword: Anglicisms; ASR; automatic speech recognition; G2P; grapheme-to-phoneme; loanwords; MTL; multitask learning; P2P; phoneme-to-phoneme; Seq2Seq; sequence-to-sequence
URL: http://publica.fraunhofer.de/documents/N-634677.html
BASE
Hide details
3
Using Automatic Speech Recognition in Spoken Corpus Curation
Gorisch, Jan [Verfasser]; Gref, Michael [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
4
Using Automatic Speech Recognition in Spoken Corpus Curation
Calzolari, Nicoletta [Herausgeber]; Mazo, Hélène [Herausgeber]; Gorisch, Jan [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
5
Using Automatic Speech Recognition in Spoken Corpus Curation
In: Fraunhofer IAIS (2020)
BASE
Show details
6
Speech Analytics in Research Based on Qualitative Interviews. Experiences from KA3
In: Fraunhofer IAIS (2018)
BASE
Show details
7
KA3. Weiterentwicklung von Sprachtechnologien im Kontext der Oral History
In: Fraunhofer IAIS (2017)
BASE
Show details
8
Using Automatic Speech Recognition in Spoken Corpus Curation [Online resource]
IDS-Repository
Show details
9
Using Automatic Speech Recognition in Spoken Corpus Curation [Online resource]
IDS-Repository
Show details

Catalogues
0
0
0
0
2
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
2
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern