DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 29

1
Emotional Speech Recognition Using Deep Neural Networks
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
BASE
Show details
2
Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
In: Sustainability; Volume 14; Issue 2; Pages: 614 (2022)
BASE
Show details
3
Text Data Augmentation for the Korean Language
In: Applied Sciences; Volume 12; Issue 7; Pages: 3425 (2022)
BASE
Show details
4
Emotional Speech Recognition Using Deep Neural Networks
In: Sensors; Volume 22; Issue 4; Pages: 1414 (2022)
BASE
Show details
5
A Study of Data Augmentation for ASR Robustness in Low Bit Rate Contact Center Recordings Including Packet Losses
In: Applied Sciences; Volume 12; Issue 3; Pages: 1580 (2022)
BASE
Show details
6
Modeling the effect of military oxygen masks on speech characteristics
In: Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03325087 ; Interspeech 2021, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
7
Simulating reading mistakes for child speech Transformer-based phone recognition
In: Annual Conference of the International Speech Communication Association (INTERSPEECH) ; https://hal.archives-ouvertes.fr/hal-03257870 ; Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
8
A Data Augmentation Approach for Sign-Language-To-Text Translation In-The-Wild ...
Nunnari, Fabrizio; España-Bonet, Cristina; Avramidis, Eleftherios. - : Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021
BASE
Show details
9
Effekten av textaugmenteringsstrategier på träffsäkerhet, F1-värde och viktat F1-värde ; The effect of text data augmentation strategies on Accuracy, F1-score, and weighted F1-score
Shmas, George; Svedberg, Jonatan. - : KTH, Hälsoinformatik och logistik, 2021
BASE
Show details
10
Using Data Augmentation and Time-Scale Modification to Improve ASR of Children’s Speech in Noisy Environments
In: Applied Sciences ; Volume 11 ; Issue 18 (2021)
Abstract: Current ASR systems show poor performance in recognition of children’s speech in noisy environments because recognizers are typically trained with clean adults’ speech and therefore there are two mismatches between training and testing phases (i.e., clean speech in training vs. noisy speech in testing and adult speech in training vs. child speech in testing). This article studies methods to tackle the effects of these two mismatches in recognition of noisy children’s speech by investigating two techniques: data augmentation and time-scale modification. In the former, clean training data of adult speakers are corrupted with additive noise in order to obtain training data that better correspond to the noisy testing conditions. In the latter, the fundamental frequency (F0) and speaking rate of children’s speech are modified in the testing phase in order to reduce differences in the prosodic characteristics between the testing data of child speakers and the training data of adult speakers. A standard ASR system based on DNN–HMM was built and the effects of data augmentation, F0 modification, and speaking rate modification on word error rate (WER) were evaluated first separately and then by combining all three techniques. The experiments were conducted using children’s speech corrupted with additive noise of four different noise types in four different signal-to-noise (SNR) categories. The results show that the combination of all three techniques yielded the best ASR performance. As an example, the WER value averaged over all four noise types in the SNR category of 5 dB dropped from 32.30% to 12.09% when the baseline system, in which no data augmentation or time-scale modification were used, was replaced with a recognizer that was built using a combination of all three techniques. In summary, in recognizing noisy children’s speech with ASR systems trained with clean adult speech, considerable improvements in the recognition performance can be achieved by combining data augmentation based on noise addition in the system training phase and time-scale modification based on modifying F0 and speaking rate of children’s speech in the testing phase.
Keyword: data augmentation; DNN; recognition of children’s speech; time-scale modification
URL: https://doi.org/10.3390/app11188420
BASE
Hide details
11
Generating Synthetic Disguised Faces with Cycle-Consistency Loss and an Automated Filtering Algorithm
In: Mathematics; Volume 10; Issue 1; Pages: 4 (2021)
BASE
Show details
12
Volumetric changes at implant sites: A systematic appraisal of traditional methods and optical scanning- based digital technologies
Tavelli, Lorenzo; Barootchi, Shayan; Majzoub, Jad. - : Wiley Periodicals, Inc., 2021
BASE
Show details
13
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
Sánchez-Cartagena, Víctor M.; Sánchez-Martínez, Felipe; Pérez-Ortiz, Juan Antonio. - : Association for Computational Linguistics, 2021
BASE
Show details
14
Improving Short Text Classification Through Global Augmentation Methods
In: Lecture Notes in Computer Science ; 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE) ; https://hal.inria.fr/hal-03414750 ; 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2020, Dublin, Ireland. pp.385-399, ⟨10.1007/978-3-030-57321-8_21⟩ (2020)
BASE
Show details
15
Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
In: SLT 2020 - IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03070321 ; SLT 2020 - IEEE Spoken Language Technology Workshop, Dec 2020, Shenzhen / Virtual, China (2020)
BASE
Show details
16
Characterization and classification of semantic image-text relations ...
Otto, Christian; Springstein, Matthias; Anand, Avishek. - : London : Springer, 2020
BASE
Show details
17
Characterization and classification of semantic image-text relations ...
Otto, C.; Springstein, M.; Anand, A.. - : Berlin : Springer Nature, 2020
BASE
Show details
18
Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems
In: Sensors ; Volume 20 ; Issue 9 (2020)
BASE
Show details
19
NAT: Noise-Aware Training for Robust Neural Sequence Labeling
In: Fraunhofer IAIS (2020)
BASE
Show details
20
MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity
In: Proceedings of the Society for Computation in Linguistics (2020)
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
29
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern