DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
Modeling the effect of military oxygen masks on speech characteristics
In: Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03325087 ; Interspeech 2021, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
2
Simulating reading mistakes for child speech Transformer-based phone recognition
In: Annual Conference of the International Speech Communication Association (INTERSPEECH) ; https://hal.archives-ouvertes.fr/hal-03257870 ; Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2021, Brno, Czech Republic (2021)
Abstract: International audience ; Current performance of automatic speech recognition (ASR) for children is below that of the latest systems dedicated to adult speech. Child speech is particularly difficult to recognise, and substantial corpora are missing to train acoustic models. Furthermore, in the scope of our reading assistant for 5-8-year-old children learning to read, models need to cope with disfluencies and reading mistakes, which remain considerable challenges even for state-of-the-art ASR systems. In this paper, we adapt an end-to-end Transformer acoustic model to speech from children learning to read. Transfer learning (TL) with a small amount of child speech improves the phone error rate (PER) by 48.7% relative over an adult model and outperforms a TL-adapted DNN-HMM model by 21.0% relative PER. Multi-objective training with a Connectionist Temporal Classification (CTC) function further reduces the PER by 4.8% relative. We propose a method of reading mistakes data augmentation, where we simulate word-level repetitions and substitutions with phonetically or graphically close words. Combining these two types of reading mistakes reaches a 19.9% PER, with a 13.1% relative improvement over the baseline. A detailed analysis shows that both the CTC multi-objective training and the augmentation with synthetic repetitions help the attention mechanisms better detect children's disfluencies.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; child speech; connectionist temporal classification; data augmentation; synthetic reading mistakes; transformer
URL: https://hal.archives-ouvertes.fr/hal-03257870/document
https://hal.archives-ouvertes.fr/hal-03257870/file/Paper_Interspeech2021_LucileGelin.pdf
https://hal.archives-ouvertes.fr/hal-03257870
BASE
Hide details
3
A Data Augmentation Approach for Sign-Language-To-Text Translation In-The-Wild ...
Nunnari, Fabrizio; España-Bonet, Cristina; Avramidis, Eleftherios. - : Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021
BASE
Show details
4
Effekten av textaugmenteringsstrategier på träffsäkerhet, F1-värde och viktat F1-värde ; The effect of text data augmentation strategies on Accuracy, F1-score, and weighted F1-score
Shmas, George; Svedberg, Jonatan. - : KTH, Hälsoinformatik och logistik, 2021
BASE
Show details
5
Using Data Augmentation and Time-Scale Modification to Improve ASR of Children’s Speech in Noisy Environments
In: Applied Sciences ; Volume 11 ; Issue 18 (2021)
BASE
Show details
6
Generating Synthetic Disguised Faces with Cycle-Consistency Loss and an Automated Filtering Algorithm
In: Mathematics; Volume 10; Issue 1; Pages: 4 (2021)
BASE
Show details
7
Volumetric changes at implant sites: A systematic appraisal of traditional methods and optical scanning- based digital technologies
Tavelli, Lorenzo; Barootchi, Shayan; Majzoub, Jad. - : Wiley Periodicals, Inc., 2021
BASE
Show details
8
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
Sánchez-Cartagena, Víctor M.; Sánchez-Martínez, Felipe; Pérez-Ortiz, Juan Antonio. - : Association for Computational Linguistics, 2021
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern