1 |
Emotional Speech Recognition Using Deep Neural Networks
|
|
|
|
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
|
|
Abstract:
International audience ; The expression of emotions in human communication plays a very important role in the information that needs to be conveyed to the partner. The forms of expression of human emotions are very rich. It could be body language, facial expressions, eye contact, laughter, and tone of voice. The languages of the world’s peoples are different, but even without understanding a language in communication, people can almost understand part of the message that the other partner wants to convey with emotional expressions as mentioned. Among the forms of human emotional expression, the expression of emotions through voice is perhaps the most studied. This article presents our research on speech emotion recognition using deep neural networks such as CNN, CRNN, and GRU. We used the Interactive Emotional Dyadic Motion Capture (IEMOCAP) corpus for the study with four emotions: anger, happiness, sadness, and neutrality. The feature parameters used for recognition include the Mel spectral coefficients and other parameters related to the spectrum and the intensity of the speech signal. The data augmentation was used by changing the voice and adding white noise. The results show that the GRU model gave the highest average recognition accuracy of 97.47%. This result is superior to existing studies on speech emotion recognition with the IEMOCAP corpus.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; CNN; CRNN; data augmentation; emotion; GRU; IEMOCAP; recognition; speech
|
|
URL: https://hal.archives-ouvertes.fr/hal-03632853/document https://hal.archives-ouvertes.fr/hal-03632853 https://doi.org/10.3390/s22041414 https://hal.archives-ouvertes.fr/hal-03632853/file/sensors-22-01414-v2.pdf
|
|
BASE
|
|
Hide details
|
|
2 |
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
|
|
|
|
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Islands and Bridges of Language: Bio-Inspired Structural Analysis of Language Embedding Data
|
|
|
|
BASE
|
|
Show details
|
|
6 |
TIPD : Taiwan Indigenous Peoples open research Data 台灣原住民基礎開放研究資料庫 ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
FAIRsharing record for: Document management -- Electronic document file format for long-term preservation -- Part 1: Use of PDF 1.4 (PDF/A-1) ... : ISO 19005-1:2005 ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
FAIRsharing record for: Systems to manage terminology, knowledge and content -- Design, implementation and maintenance of terminology management systems ... : ISO 26162:2012 ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
FAIRsharing record for: General Ontology for Linguistic Description ... : GOLD ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
FAIRsharing record for: Information technology -- Hypermedia/Time-based Structuring Language (HyTime) ... : ISO/IEC 10744:1997 ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
FAIRsharing record for: Language resource management -- Feature structures -- Part 1: Feature structure representation ... : ISO 24610-1:2006 ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Between Deterministic and Nondeterministic Quantitative Automata (Invited Talk)
|
|
Boker, Udi. - : LIPIcs - Leibniz International Proceedings in Informatics. 30th EACSL Annual Conference on Computer Science Logic (CSL 2022), 2022
|
|
BASE
|
|
Show details
|
|
14 |
Open access dataset of task-free hemodynamic activity in 4-month-old infants during sleep using fNIRS ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Corpus of Political Speeches: Policy responses to the Great Recession in the United Kingdom and Spain (2008-2014) ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Conditional Adversarial Learning to Enhance Bot Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Dialectics of Liberation Congress Digital Archive Audio File Project ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Dialectics of Liberation Congress Digital Archive Audio File Project ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Protocol for the development of the international population registry for aphasia after stroke (I-PRAISE)
|
|
|
|
In: Research outputs 2014 to 2021 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Utilising a systematic review-based approach to create a database of individual participant data for meta- and network meta-analyses: The RELEASE database of aphasia after stroke
|
|
|
|
In: Research outputs 2014 to 2021 (2022)
|
|
BASE
|
|
Show details
|
|
|
|