1 |
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
|
|
|
|
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
|
|
|
|
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
A fine-grained recognition of Named Entities in ELTeC collection using cascades
|
|
|
|
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Multiagent Dynamics of Gradual Argumentation Semantics
|
|
|
|
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
|
|
BASE
|
|
Show details
|
|
5 |
The Impact of Game Elements on Learner Motivation: Influence of Initial Motivation and Player Profile
|
|
|
|
In: EISSN: 1939-1382 ; IEEE Transactions on Learning Technologies ; https://hal.univ-lyon2.fr/hal-03579428 ; IEEE Transactions on Learning Technologies, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TLT.2022.3153239⟩ (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Psychiatry on Twitter: Content Analysis of the Use of Psychiatric Terms in French
|
|
|
|
In: ISSN: 2561-326X ; JMIR Formative Research ; https://hal.archives-ouvertes.fr/hal-03614832 ; JMIR Formative Research, JMIR Publications 2022, 6 (2), pp.e18539. ⟨10.2196/18539⟩ ; https://formative.jmir.org/2022/2/e18539 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
|
|
BASE
|
|
Show details
|
|
8 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
|
|
BASE
|
|
Show details
|
|
10 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
Abstract:
International audience ; The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR. In both cases, personalized acoustic models (AM), i.e. fine-tuned AM with specific speaker data, can be built. A question that naturally arises is whether the dissemination of personalized acoustic models can leak personal information. In this paper, we show that it is possible to retrieve the gender of the speaker, but also his identity, by just exploiting the weight matrix changes of a neural acoustic model locally adapted to this speaker. Incidentally we observe phenomena that may be useful towards explainability of deep neural networks in the context of speech processing. Gender can be identified almost surely using only the first layers and speaker verification performs well when using middle-up layers. Our experimental study on the TED-LIUM 3 dataset with HMM/TDNN models shows an accuracy of 95% for gender detection, and an Equal Error Rate of 9.07% for a speaker verification task by only exploiting the weights from personalized models that could be exchanged instead of user data.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; acoustic model; Automatic speech recognition; collaborative learning; personalized acoustic models; speaker information
|
|
URL: https://hal.archives-ouvertes.fr/hal-03539741 https://hal.archives-ouvertes.fr/hal-03539741/document https://hal.archives-ouvertes.fr/hal-03539741/file/ICASSP_2022_SpeakerAnalysisInfoPrivacyVF.pdf
|
|
BASE
|
|
Hide details
|
|
11 |
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
|
|
|
|
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Emotional Speech Recognition Using Deep Neural Networks
|
|
|
|
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
|
|
BASE
|
|
Show details
|
|
13 |
From Biological Synapses to “Intelligent” Robots
|
|
|
|
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
|
|
BASE
|
|
Show details
|
|
14 |
Multiagent Dynamics of Gradual Argumentation Semantics
|
|
|
|
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters
|
|
|
|
In: ISCRAM ; https://hal.archives-ouvertes.fr/hal-03631387 ; ISCRAM, May 2022, Tarbes, France (2022)
|
|
BASE
|
|
Show details
|
|
16 |
END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
|
|
|
|
In: The International Conference on Acoustics, Speech, & Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-03601224 ; The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Question-Based Explainability in Abstract Argumentation
|
|
|
|
In: https://hal-univ-tlse3.archives-ouvertes.fr/hal-03647896 ; [Research Report] IRIT/RR--2022--01--FR, IRIT : Institut de Recherche en Informatique de Toulouse, France. 2022, pp.1-64 (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Source or target first? Comparison of two post-editing strategies with translation students
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03546151 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
BASE
|
|
Show details
|
|
|
|