1 |
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
|
|
|
|
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
|
|
BASE
|
|
Show details
|
|
2 |
A fine-grained recognition of Named Entities in ELTeC collection using cascades
|
|
|
|
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
The genetic architecture of language functional connectivity
|
|
|
|
In: ISSN: 1053-8119 ; EISSN: 1095-9572 ; NeuroImage ; https://hal.sorbonne-universite.fr/hal-03566120 ; NeuroImage, Elsevier, 2022, 249, pp.118795. ⟨10.1016/j.neuroimage.2021.118795⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
The "Fat Face" illusion: A robust adaptation for processing pairs of faces
|
|
|
|
In: ISSN: 0042-6989 ; EISSN: 0042-6989 ; Vision Research ; https://hal.archives-ouvertes.fr/hal-03579276 ; Vision Research, Elsevier, 2022, 195, pp.108015. ⟨10.1016/j.visres.2022.108015⟩ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Emotional Speech Recognition Using Deep Neural Networks
|
|
|
|
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
|
|
Abstract:
International audience ; The expression of emotions in human communication plays a very important role in the information that needs to be conveyed to the partner. The forms of expression of human emotions are very rich. It could be body language, facial expressions, eye contact, laughter, and tone of voice. The languages of the world’s peoples are different, but even without understanding a language in communication, people can almost understand part of the message that the other partner wants to convey with emotional expressions as mentioned. Among the forms of human emotional expression, the expression of emotions through voice is perhaps the most studied. This article presents our research on speech emotion recognition using deep neural networks such as CNN, CRNN, and GRU. We used the Interactive Emotional Dyadic Motion Capture (IEMOCAP) corpus for the study with four emotions: anger, happiness, sadness, and neutrality. The feature parameters used for recognition include the Mel spectral coefficients and other parameters related to the spectrum and the intensity of the speech signal. The data augmentation was used by changing the voice and adding white noise. The results show that the GRU model gave the highest average recognition accuracy of 97.47%. This result is superior to existing studies on speech emotion recognition with the IEMOCAP corpus.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; CNN; CRNN; data augmentation; emotion; GRU; IEMOCAP; recognition; speech
|
|
URL: https://hal.archives-ouvertes.fr/hal-03632853/document https://hal.archives-ouvertes.fr/hal-03632853 https://doi.org/10.3390/s22041414 https://hal.archives-ouvertes.fr/hal-03632853/file/sensors-22-01414-v2.pdf
|
|
BASE
|
|
Hide details
|
|
7 |
From Biological Synapses to “Intelligent” Robots
|
|
|
|
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Biological constraints on configural odour mixture perception
|
|
|
|
In: ISSN: 0022-0949 ; EISSN: 1477-9145 ; Journal of Experimental Biology ; https://hal-cnrs.archives-ouvertes.fr/hal-03610253 ; Journal of Experimental Biology, The Company of Biologists, 2022, 225 (6), pp.jeb242274. ⟨10.1242/jeb.242274⟩ ; https://journals.biologists.com/jeb/article-abstract/225/6/jeb242274/274695/Biological-constraints-on-configural-odour-mixture (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Face recognition improvements in adults and children with face recognition difficulties
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Face masks versus sunglasses: Limited effects of time and individual differences in the ability to judge facial identity and social traits
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Efficient localization of the cortical language network and its functional neuroanatomy in dyslexia
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Anxious voice and avoidant language in interaction with a woman wearing an Islamic headscarf: field-experimental evidence from the Paris metro
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03140246 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Anxious voice and avoidant language in interaction with a woman wearing an Islamic headscarf: field-experimental evidence from the Paris metro
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03140246 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
16 |
A lexical approach for identifying behavioural action sequences
|
|
|
|
In: ISSN: 1553-734X ; EISSN: 1553-7358 ; PLoS Computational Biology ; https://hal.sorbonne-universite.fr/hal-03521462 ; PLoS Computational Biology, Public Library of Science, 2022, 18 (1), pp.e1009672. ⟨10.1371/journal.pcbi.1009672⟩ (2022)
|
|
BASE
|
|
Show details
|
|
17 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Hippocampal and auditory contributions to speech segmentation
|
|
|
|
In: ISSN: 0010-9452 ; Cortex ; https://hal.archives-ouvertes.fr/hal-03604957 ; Cortex, Elsevier, 2022, ⟨10.1016/j.cortex.2022.01.017⟩ (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Évaluation de la perception des sons de parole chez les populations pédiatriques : réflexion sur les épreuves existantes
|
|
|
|
In: ISSN: 0298-6477 ; EISSN: 2117-7155 ; Glossa ; https://hal.archives-ouvertes.fr/hal-03646757 ; Glossa, UNADREO - Union NAtionale pour le Développement de la Recherche en Orthophonie, 2022, 132, pp.1-27 ; https://www.glossa.fr/index.php/glossa/article/view/1043 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Metacognitive improvement: Disentangling adaptive training from experimental confounds.
|
|
|
|
In: ISSN: 0096-3445 ; Journal of Experimental Psychology: General ; https://hal.archives-ouvertes.fr/hal-03581013 ; Journal of Experimental Psychology: General, American Psychological Association, In press, ⟨10.1037/xge0001185⟩ (2022)
|
|
BASE
|
|
Show details
|
|
|
|