1 |
Xie, X., Liu, L., & Jaeger, T. F. (2021-JEP:G). Cross-talker generalization in the perception of non-nativespeech: a large-scale replication ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy
|
|
|
|
In: Sensors; Volume 22; Issue 8; Pages: 3027 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Influence of Highly Inflected Word Forms and Acoustic Background on the Robustness of Automatic Speech Recognition for Human–Computer Interaction
|
|
|
|
In: Mathematics; Volume 10; Issue 5; Pages: 711 (2022)
|
|
Abstract:
Automatic speech recognition is essential for establishing natural communication with a human–computer interface. Speech recognition accuracy strongly depends on the complexity of language. Highly inflected word forms are a type of unit present in some languages. The acoustic background presents an additional important degradation factor influencing speech recognition accuracy. While the acoustic background has been studied extensively, the highly inflected word forms and their combined influence still present a major research challenge. Thus, a novel type of analysis is proposed, where a dedicated speech database comprised solely of highly inflected word forms is constructed and used for tests. Dedicated test sets with various acoustic backgrounds were generated and evaluated with the Slovenian UMB BN speech recognition system. The baseline word accuracy of 93.88% and 98.53% was reduced to as low as 23.58% and 15.14% for the various acoustic backgrounds. The analysis shows that the word accuracy degradation depends on and changes with the acoustic background type and level. The highly inflected word forms’ test sets without background decreased word accuracy from 93.3% to only 63.3% in the worst case. The impact of highly inflected word forms on speech recognition accuracy was reduced with the increased levels of acoustic background and was, in these cases, similar to the non-highly inflected test sets. The results indicate that alternative methods in constructing speech databases, particularly for low-resourced Slovenian language, could be beneficial.
|
|
Keyword:
acoustic background; acoustic modeling; automatic speech recognition; highly inflected word forms; human–computer interaction
|
|
URL: https://doi.org/10.3390/math10050711
|
|
BASE
|
|
Hide details
|
|
5 |
What Do Cognitive Networks Do? Simulations of Spoken Word Recognition Using the Cognitive Network Science Approach
|
|
|
|
BASE
|
|
Show details
|
|
6 |
An examination of reading, reading development and disorder in a highly transparent orthography: the case of Turkish
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The Effect of Orthographic Transparency on Auditory Word Recognition Across the Development of Reading Proficiency
|
|
|
|
In: ISSN: 1664-1078 ; Frontiers in Psychology ; https://hal.archives-ouvertes.fr/hal-03340208 ; Frontiers in Psychology, Frontiers, 2021, 12, ⟨10.3389/fpsyg.2021.691989⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
COSMO-Onset: A Neurally-Inspired Computational Model of Spoken Word Recognition, Combining Top-Down Prediction and Bottom-Up Detection of Syllabic Onsets
|
|
|
|
In: ISSN: 1662-5137 ; Frontiers in Systems Neuroscience ; https://hal.archives-ouvertes.fr/hal-03318691 ; Frontiers in Systems Neuroscience, Frontiers, 2021, 15, pp.653975. ⟨10.3389/fnsys.2021.653975⟩ (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Online activation of L1 Danish orthography enhances spoken word recognition of Swedish
|
|
|
|
In: ISSN: 0332-5865 ; Nordic Journal of Linguistics ; https://hal-amu.archives-ouvertes.fr/hal-03283527 ; Nordic Journal of Linguistics, 2021, pp.1-19. ⟨10.1017/S0332586521000056⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Perceptual vowel contrast reduction in Australian English /l/-final rimes
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 9 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
The representation of variable tone sandhi patterns in Shanghai Wu
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 15 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Phonological co-activation in L2 visual word recognition : Cross-script phonological priming with bilingual readers of Chinese and English ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Cohort-Selective Gamma Rhythms Support Hierarchical Visual Processing During Word Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
The inhibitory effect of a masked word-prime in a lexical decision task: effect of the relative lexical frequency and the previous exposure ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
The interplay between linguistic and embodied systems in conceptual processing ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Phonological precision for word recognition in skilled readers ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Reading proficiency and compound word processing in monolinguals and bilinguals ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
A Hybrid Speech Enhancement Algorithm for Voice Assistance Application
|
|
|
|
In: Sensors ; Volume 21 ; Issue 21 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Is Developmental Dyslexia Due to a Visual and Not a Phonological Impairment?
|
|
|
|
In: Brain Sciences ; Volume 11 ; Issue 10 (2021)
|
|
BASE
|
|
Show details
|
|
|
|