DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...151
Hits 1 – 20 of 3.018

1
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
BASE
Show details
2
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
BASE
Show details
3
Fine-tuning pre-trained models for Automatic Speech Recognition: experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
In: https://halshs.archives-ouvertes.fr/halshs-03647315 ; 2022 (2022)
BASE
Show details
4
ASR training dataset for Croatian ParlaSpeech-HR v1.0
Ljubešić, Nikola; Koržinek, Danijel; Rupnik, Peter. - : Jožef Stefan Institute, 2022
BASE
Show details
5
Robust speech recognition for low-resource languages ...
Romanenko, Aleksei. - : Universität Ulm, 2022
BASE
Show details
6
Treasure Hunters 2: exploration of speech training efficacy ...
Ganzeboom, Mario; Bakker, Marjoke; Beijer, Lilian. - : Radboud University, 2022
BASE
Show details
7
MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension
In: Applied Sciences; Volume 12; Issue 2; Pages: 804 (2022)
Abstract: This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politècnica de València for the Albayzín-RTVE 2020 Speech-to-Text Challenge, and includes an extension of the work consisting of building and evaluating equivalent systems under the closed data conditions from the 2018 challenge. The primary system (p-streaming_1500ms_nlt) was a hybrid ASR system using streaming one-pass decoding with a context window of 1.5 seconds. This system achieved 16.0% WER on the test-2020 set. We also submitted three contrastive systems. From these, we highlight the system c2-streaming_600ms_t which, following a similar configuration as the primary system with a smaller context window of 0.6 s, scored 16.9% WER points on the same test set, with a measured empirical latency of 0.81 ± 0.09 s (mean ± stdev). That is, we obtained state-of-the-art latencies for high-quality automatic live captioning with a small WER degradation of 6% relative. As an extension, the equivalent closed-condition systems obtained 23.3% WER and 23.5% WER, respectively. When evaluated with an unconstrained language model, we obtained 19.9% WER and 20.4% WER; i.e., not far behind the top-performing systems with only 5% of the full acoustic data and with the extra ability of being streaming-capable. Indeed, all of these streaming systems could be put into production environments for automatic captioning of live media streams.
Keyword: automatic speech recognition; natural language processing; streaming
URL: https://doi.org/10.3390/app12020804
BASE
Hide details
8
Evaluating Novel Speech Transcription Architectures on the Spanish RTVE2020 Database
In: Applied Sciences; Volume 12; Issue 4; Pages: 1889 (2022)
BASE
Show details
9
Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy
In: Sensors; Volume 22; Issue 8; Pages: 3027 (2022)
BASE
Show details
10
Automatic Speech Recognition (ASR) Systems for Children: A Systematic Literature Review
In: Applied Sciences; Volume 12; Issue 9; Pages: 4419 (2022)
BASE
Show details
11
An Empirical Performance Analysis of the Speak Correct Computerized Interface
In: Processes; Volume 10; Issue 3; Pages: 487 (2022)
BASE
Show details
12
Influence of Highly Inflected Word Forms and Acoustic Background on the Robustness of Automatic Speech Recognition for Human–Computer Interaction
In: Mathematics; Volume 10; Issue 5; Pages: 711 (2022)
BASE
Show details
13
Fine-tuning pre-trained models for Automatic Speech Recognition: experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
In: https://halshs.archives-ouvertes.fr/halshs-03647315 ; 2022 (2022)
BASE
Show details
14
Applying phonetics : speech science in everyday life
Munro, Murray J.. - Chichester, West Sussex : Wiley Blackwell, 2021
BLLDB
UB Frankfurt Linguistik
Show details
15
Automatic Speech Recognition systems errors for accident-prone sleepiness detection through voice
In: EUSIPCO 2021 ; https://hal.archives-ouvertes.fr/hal-03324033 ; EUSIPCO 2021, Aug 2021, Dublin (en ligne), Ireland. ⟨10.23919/EUSIPCO54536.2021.9616299⟩ (2021)
BASE
Show details
16
Automatic Speech Recognition systems errors for objective sleepiness detection through voice
In: Proceedings Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03328827 ; Interspeech 2021, Aug 2021, Brno (virtual), Czech Republic. pp.2476-2480, ⟨10.21437/Interspeech.2021-291⟩ (2021)
BASE
Show details
17
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
18
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
19
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
20
Re-synchronization using the Hand Preceding Model for Multi-modal Fusion in Automatic Continuous Cued Speech Recognition
In: ISSN: 1520-9210 ; IEEE Transactions on Multimedia ; https://hal.archives-ouvertes.fr/hal-02433830 ; IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2021, 23, pp.292-305. ⟨10.1109/TMM.2020.2976493⟩ (2021)
BASE
Show details

Page: 1 2 3 4 5...151

Catalogues
189
0
864
0
0
20
4
Bibliographies
2.389
0
0
0
0
0
0
2
9
Linked Open Data catalogues
0
Online resources
2
0
0
0
Open access documents
601
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern