1 |
Xie, X., Liu, L., & Jaeger, T. F. (2021-JEP:G). Cross-talker generalization in the perception of non-nativespeech: a large-scale replication ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy
|
|
|
|
In: Sensors; Volume 22; Issue 8; Pages: 3027 (2022)
|
|
Abstract:
Automatic speech recognition (ASR) is an essential technique of human–computer interactions; gain control is a commonly used operation in ASR. However, inappropriate gain control strategies can lead to an increase in the word error rate (WER) of ASR. As there is a current lack of sufficient theoretical analyses and proof of the relationship between gain control and WER, various unconstrained gain control strategies have been adopted on realistic ASR systems, and the optimal gain control with respect to the lowest WER, is rarely achieved. A gain control strategy named maximized original signal transmission (MOST) is proposed in this study to minimize the adverse impact of gain control on ASR systems. First, by modeling the gain control strategy, the quantitative relationship between the gain control strategy and the ASR performance was established using the noise figure index. Second, through an analysis of the quantitative relationship, an optimal MOST gain control strategy with minimal performance degradation was theoretically deduced. Finally, comprehensive comparative experiments on a Mandarin dataset show that the proposed MOST gain control strategy can significantly reduce the WER of the experimental ASR system, with a 10% mean absolute WER reduction at −9 dB gain.
|
|
Keyword:
automatic speech recognition (ASR); gain control; human–computer interaction; maximized original signal transmission (MOST); noise figure; word error rate (WER)
|
|
URL: https://doi.org/10.3390/s22083027
|
|
BASE
|
|
Hide details
|
|
3 |
Influence of Highly Inflected Word Forms and Acoustic Background on the Robustness of Automatic Speech Recognition for Human–Computer Interaction
|
|
|
|
In: Mathematics; Volume 10; Issue 5; Pages: 711 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
What Do Cognitive Networks Do? Simulations of Spoken Word Recognition Using the Cognitive Network Science Approach
|
|
|
|
BASE
|
|
Show details
|
|
6 |
An examination of reading, reading development and disorder in a highly transparent orthography: the case of Turkish
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The Effect of Orthographic Transparency on Auditory Word Recognition Across the Development of Reading Proficiency
|
|
|
|
In: ISSN: 1664-1078 ; Frontiers in Psychology ; https://hal.archives-ouvertes.fr/hal-03340208 ; Frontiers in Psychology, Frontiers, 2021, 12, ⟨10.3389/fpsyg.2021.691989⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
COSMO-Onset: A Neurally-Inspired Computational Model of Spoken Word Recognition, Combining Top-Down Prediction and Bottom-Up Detection of Syllabic Onsets
|
|
|
|
In: ISSN: 1662-5137 ; Frontiers in Systems Neuroscience ; https://hal.archives-ouvertes.fr/hal-03318691 ; Frontiers in Systems Neuroscience, Frontiers, 2021, 15, pp.653975. ⟨10.3389/fnsys.2021.653975⟩ (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Online activation of L1 Danish orthography enhances spoken word recognition of Swedish
|
|
|
|
In: ISSN: 0332-5865 ; Nordic Journal of Linguistics ; https://hal-amu.archives-ouvertes.fr/hal-03283527 ; Nordic Journal of Linguistics, 2021, pp.1-19. ⟨10.1017/S0332586521000056⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Perceptual vowel contrast reduction in Australian English /l/-final rimes
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 9 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
The representation of variable tone sandhi patterns in Shanghai Wu
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 15 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Phonological co-activation in L2 visual word recognition : Cross-script phonological priming with bilingual readers of Chinese and English ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Cohort-Selective Gamma Rhythms Support Hierarchical Visual Processing During Word Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
The inhibitory effect of a masked word-prime in a lexical decision task: effect of the relative lexical frequency and the previous exposure ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
The interplay between linguistic and embodied systems in conceptual processing ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Phonological precision for word recognition in skilled readers ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Reading proficiency and compound word processing in monolinguals and bilinguals ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
A Hybrid Speech Enhancement Algorithm for Voice Assistance Application
|
|
|
|
In: Sensors ; Volume 21 ; Issue 21 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Is Developmental Dyslexia Due to a Visual and Not a Phonological Impairment?
|
|
|
|
In: Brain Sciences ; Volume 11 ; Issue 10 (2021)
|
|
BASE
|
|
Show details
|
|
|
|