1 |
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
|
|
|
|
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
|
|
|
|
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
A fine-grained recognition of Named Entities in ELTeC collection using cascades
|
|
|
|
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Multiagent Dynamics of Gradual Argumentation Semantics
|
|
|
|
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
|
|
BASE
|
|
Show details
|
|
5 |
The Impact of Game Elements on Learner Motivation: Influence of Initial Motivation and Player Profile
|
|
|
|
In: EISSN: 1939-1382 ; IEEE Transactions on Learning Technologies ; https://hal.univ-lyon2.fr/hal-03579428 ; IEEE Transactions on Learning Technologies, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TLT.2022.3153239⟩ (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Psychiatry on Twitter: Content Analysis of the Use of Psychiatric Terms in French
|
|
|
|
In: ISSN: 2561-326X ; JMIR Formative Research ; https://hal.archives-ouvertes.fr/hal-03614832 ; JMIR Formative Research, JMIR Publications 2022, 6 (2), pp.e18539. ⟨10.2196/18539⟩ ; https://formative.jmir.org/2022/2/e18539 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
|
|
BASE
|
|
Show details
|
|
8 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
|
|
BASE
|
|
Show details
|
|
10 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
|
|
|
|
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Emotional Speech Recognition Using Deep Neural Networks
|
|
|
|
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
|
|
BASE
|
|
Show details
|
|
13 |
From Biological Synapses to “Intelligent” Robots
|
|
|
|
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
|
|
BASE
|
|
Show details
|
|
14 |
Multiagent Dynamics of Gradual Argumentation Semantics
|
|
|
|
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters
|
|
|
|
In: ISCRAM ; https://hal.archives-ouvertes.fr/hal-03631387 ; ISCRAM, May 2022, Tarbes, France (2022)
|
|
BASE
|
|
Show details
|
|
16 |
END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
|
|
|
|
In: The International Conference on Acoustics, Speech, & Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-03601224 ; The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Question-Based Explainability in Abstract Argumentation
|
|
|
|
In: https://hal-univ-tlse3.archives-ouvertes.fr/hal-03647896 ; [Research Report] IRIT/RR--2022--01--FR, IRIT : Institut de Recherche en Informatique de Toulouse, France. 2022, pp.1-64 (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Source or target first? Comparison of two post-editing strategies with translation students
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03546151 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
Abstract:
International audience ; This paper investigates the impact of head movements on audiovisual speech enhancement (AVSE). Although being a common conversational feature, head movements have been ignored by past and recent studies: they challenge today's learning-based methods as they often degrade the performance of models that are trained on clean, frontal, and steady face images. To alleviate this problem, we propose to use robust face frontalization (RFF) in combination with an AVSE method based on a variational auto-encoder (VAE) model. We briefly describe the basic ingredients of the proposed pipeline and we perform experiments with a recently released audiovisual dataset. In the light of these experiments, and based on three standard metrics, namely STOI, PESQ and SI-SDR, we conclude that RFF improves the performance of AVSE by a considerable margin.
|
|
Keyword:
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; Audio-visual speech enhancement; face frontalization; variational auto-encoder
|
|
URL: https://hal.inria.fr/hal-03551610 https://hal.inria.fr/hal-03551610v2/file/Kang-ICASSP22-CR.pdf https://hal.inria.fr/hal-03551610v2/document
|
|
BASE
|
|
Hide details
|
|
|
|