DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Automatic Pronunciation Generation by Utilizing a Semi-supervised Deep Neural Networks ...
Abstract: Phonemic or phonetic sub-word units are the most commonly used atomic elements to represent speech signals in modern ASRs. However they are not the optimal choice due to several reasons such as: large amount of effort required to handcraft a pronunciation dictionary, pronunciation variations, human mistakes and under-resourced dialects and languages. Here, we propose a data-driven pronunciation estimation and acoustic modeling method which only takes the orthographic transcription to jointly estimate a set of sub-word units and a reliable dictionary. Experimental results show that the proposed method which is based on semi-supervised training of a deep neural network largely outperforms phoneme based continuous speech recognition on the TIMIT dataset. ... : Proc. of 17th Interspeech (2016), San Francisco, California, USA ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG; Sound cs.SD
URL: https://arxiv.org/abs/1606.05007
https://dx.doi.org/10.48550/arxiv.1606.05007
BASE
Hide details
2
The SIWIS database: a multilingual speech database with acted emphasis
In: http://infoscience.epfl.ch/record/218855 (2016)
BASE
Show details
3
Syntactic language modeling with formal grammars
In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 6, 715-731
BLLDB
OLC Linguistik
Show details
4
Text analysis and language identification for polyglot text-to-speech synthesis
In: Speech communication. - Amsterdam [u.a.] : Elsevier 49 (2007) 9, 697-724
BLLDB
OLC Linguistik
Show details
5
Text analysis and language identification for polyglot text-to-speech synthesis
In: Speech Communication (2007) 9, 697-724
IDS Bibliografie zur deutschen Grammatik
Show details
6
Fundamentals of speech synthesis and speech recognition : basic concepts, state of the art and future challenges
Zellner, Brigitte (Mitarb.); Pfister, Beat (Mitarb.); Summerfield, Quentin (Mitarb.). - Chichester [u.a.] : Wiley, 1994
BLLDB
UB Frankfurt Linguistik
Show details
7
Text-to-speech synthesis : an introduction and a case study
In: Fundamentals of speech synthesis and speech recognition (Chichester [etc.], 1994), p. 87-108
MPI für Psycholinguistik
Show details

Catalogues
1
0
2
0
0
0
0
Bibliographies
3
0
1
0
0
0
0
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern