DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...833
Hits 1 – 20 of 16.644

1
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
BASE
Show details
2
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
BASE
Show details
3
A fine-grained recognition of Named Entities in ELTeC collection using cascades
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
BASE
Show details
4
Multiagent Dynamics of Gradual Argumentation Semantics
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
BASE
Show details
5
The Impact of Game Elements on Learner Motivation: Influence of Initial Motivation and Player Profile
In: EISSN: 1939-1382 ; IEEE Transactions on Learning Technologies ; https://hal.univ-lyon2.fr/hal-03579428 ; IEEE Transactions on Learning Technologies, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TLT.2022.3153239⟩ (2022)
BASE
Show details
6
Psychiatry on Twitter: Content Analysis of the Use of Psychiatric Terms in French
In: ISSN: 2561-326X ; JMIR Formative Research ; https://hal.archives-ouvertes.fr/hal-03614832 ; JMIR Formative Research, JMIR Publications 2022, 6 (2), pp.e18539. ⟨10.2196/18539⟩ ; https://formative.jmir.org/2022/2/e18539 (2022)
BASE
Show details
7
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
BASE
Show details
8
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
BASE
Show details
9
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
BASE
Show details
10
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
BASE
Show details
11
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
BASE
Show details
12
Emotional Speech Recognition Using Deep Neural Networks
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
BASE
Show details
13
From Biological Synapses to “Intelligent” Robots
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
BASE
Show details
14
Multiagent Dynamics of Gradual Argumentation Semantics
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
BASE
Show details
15
Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters
In: ISCRAM ; https://hal.archives-ouvertes.fr/hal-03631387 ; ISCRAM, May 2022, Tarbes, France (2022)
BASE
Show details
16
END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
In: The International Conference on Acoustics, Speech, & Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-03601224 ; The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), May 2022, Singapour, Singapore (2022)
BASE
Show details
17
Question-Based Explainability in Abstract Argumentation
In: https://hal-univ-tlse3.archives-ouvertes.fr/hal-03647896 ; [Research Report] IRIT/RR--2022--01--FR, IRIT : Institut de Recherche en Informatique de Toulouse, France. 2022, pp.1-64 (2022)
BASE
Show details
18
Source or target first? Comparison of two post-editing strategies with translation students
In: https://hal.archives-ouvertes.fr/hal-03546151 ; 2022 (2022)
BASE
Show details
19
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
BASE
Show details
20
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
Abstract: International audience ; This paper investigates the impact of head movements on audiovisual speech enhancement (AVSE). Although being a common conversational feature, head movements have been ignored by past and recent studies: they challenge today's learning-based methods as they often degrade the performance of models that are trained on clean, frontal, and steady face images. To alleviate this problem, we propose to use robust face frontalization (RFF) in combination with an AVSE method based on a variational auto-encoder (VAE) model. We briefly describe the basic ingredients of the proposed pipeline and we perform experiments with a recently released audiovisual dataset. In the light of these experiments, and based on three standard metrics, namely STOI, PESQ and SI-SDR, we conclude that RFF improves the performance of AVSE by a considerable margin.
Keyword: [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; Audio-visual speech enhancement; face frontalization; variational auto-encoder
URL: https://hal.inria.fr/hal-03551610
https://hal.inria.fr/hal-03551610v2/file/Kang-ICASSP22-CR.pdf
https://hal.inria.fr/hal-03551610v2/document
BASE
Hide details

Page: 1 2 3 4 5...833

Catalogues
29
0
13
0
0
0
3
Bibliographies
79
0
0
0
0
0
0
6
83
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
16.472
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern