2 |
The Personal Pronouns in the Germanic Languages : A Study of Personal Pronoun Morphology and Change in the Germanic Languages from the First Records to the Present Day
|
|
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
5 |
New Proposals for the Design of Integrated Online Wine Industry Dictionaries
|
|
|
|
In: Lexikos. Journal of the African Association for Lexicography 23 (2013), 209-227
|
|
IDS OBELEX meta
|
|
Show details
|
|
6 |
IsiXhosa Lexicography: Past, Present and Future
|
|
|
|
In: Lexikos. Journal of the African Association for Lexicography 23 (2013), 348-370
|
|
IDS OBELEX meta
|
|
Show details
|
|
7 |
Some issues affecting the transcription of hungarian broadcast audio
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843430 ; Annual Conference of the International Speech Communication Association , Aug 2013, Lyon, France (2013)
|
|
Abstract:
International audience ; This paper reports on a speech-to-text (STT) transcription system for Hungarian broadcast audio developed for the 2012 Quaero evaluations. For this evaluation, no manually transcribed audio data were provided for model training, however a small amount of development data were provided to assess system performance. As a consequence, the acoustic models were developed in an unsupervised manner, with the only supervision provided indirectly by the language model. The language models were trained on texts downloaded from various websites, also without any speech transcripts. This contrasts with other STT systems for Hungarian broadcast audio which use at least 10 to 50 hours of manually transcribed data for acoustic training, and typically include speech transcripts in the language models. Based on mixed results previously reported applying morph-based approaches to agglutinative languages such as Hungarian, word-based language models were used. The initial Word Error Rate (WER) of the system using context-independent seed models from other languages of 59.8% on the 3h development corpus was reduced to 25.0% after successive training iterations and system refinement. The same system obtained a WER of 23.3% on the independent Quaero 2012 evaluation corpus (a mix of broadcast news and broadcast conversation data). These results compare well with previously reported systems on similar data. Various issues affecting system performance are discussed, such as amount of training data, the acoustic features and choice of text sources for language model training.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; agglutinative languages; Bottleneck MLP features; broadcast news transcription; Hungarian language; Large vocabulary continuous speech recognition (LVCSR); unsupervised training
|
|
URL: https://hal.archives-ouvertes.fr/hal-01843430
|
|
BASE
|
|
Hide details
|
|
8 |
Two-dimensional (2D) languages and application to handwritten graphical parsing
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-00861080 ; 2013 (2013)
|
|
BASE
|
|
Show details
|
|
9 |
Separating Regular Languages by Piecewise Testable and Unambiguous Languages
|
|
|
|
In: Mathematical Foundations of Computer Science 2013 ; Mathematical Foundations of Computer Science ; https://hal.archives-ouvertes.fr/hal-00948943 ; Mathematical Foundations of Computer Science, Aug 2013, Austria. pp.729-740, ⟨10.1007/978-3-642-40313-2_64⟩ (2013)
|
|
BASE
|
|
Show details
|
|
10 |
Popularity, Interoperability, and Impact of Programming Languages in 100,000 Open Source Projects
|
|
|
|
In: Proceedings of the 37th Annual International Computer Software & Applications Conference (COMPSAC 2013) ; 37th Annual International Computer Software & Applications Conference (COMPSAC 2013) ; https://hal.archives-ouvertes.fr/hal-00809451 ; 37th Annual International Computer Software & Applications Conference (COMPSAC 2013), Jul 2013, Kyoto, Japan. pp.1-10 (2013)
|
|
BASE
|
|
Show details
|
|
11 |
On languages of one-dimensional overlapping tiles
|
|
|
|
In: 39th International Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM) ; SOFSEM ; https://hal.archives-ouvertes.fr/hal-00659202 ; SOFSEM, Jan 2013, Špindlerův Mlýn, Czech Republic. pp.244-256, ⟨10.1007/978-3-642-35843-2_22⟩ (2013)
|
|
BASE
|
|
Show details
|
|
12 |
Chinese Language Learning: Immersion and Classroom Settings
|
|
|
|
In: 2013 New England Association for Asian Studies Conference (2013)
|
|
BASE
|
|
Show details
|
|
13 |
That Poor Little Thing: The Emotive Meanings of Diminutives in Polish and Russian Translations of Alice in Wonderland
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Metaphoric Truth: Seeing and Saying in Merleau-Ponty and Ricoeur, and a Broader Ethics Via Zuidervaart
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Effets de l’enseignement réciproque sur la compréhension en lecture d’élèves allophones immigrants nouvellement arrivés en situation de grand retard scolaire au secondaire
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Translation of the Implicit: Tracing How Language Works Beyond Gendlin and Derrida
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Speech as Metaphor of Human Becoming According to St. Augustine of Hippo
|
|
|
|
BASE
|
|
Show details
|
|
18 |
The use of email attachments to increase reading compliance in foreign language classes.
|
|
|
|
In: Scholarship and Professional Work - LAS (2013)
|
|
BASE
|
|
Show details
|
|
19 |
How to Ask for a Favor: An Exploration of Speech Act Pragmatics in Heritage Russian
|
|
|
|
In: Bryn Mawr College Dissertations and Theses (2013)
|
|
BASE
|
|
Show details
|
|
20 |
Observaciones Sobre el Estado del Sonido Fricativo Palatal Sordo en el Español Salvadoreño.
|
|
|
|
In: Scholarship and Professional Work - LAS (2013)
|
|
BASE
|
|
Show details
|
|
|
|