1 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
Abstract:
International audience ; The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR. In both cases, personalized acoustic models (AM), i.e. fine-tuned AM with specific speaker data, can be built. A question that naturally arises is whether the dissemination of personalized acoustic models can leak personal information. In this paper, we show that it is possible to retrieve the gender of the speaker, but also his identity, by just exploiting the weight matrix changes of a neural acoustic model locally adapted to this speaker. Incidentally we observe phenomena that may be useful towards explainability of deep neural networks in the context of speech processing. Gender can be identified almost surely using only the first layers and speaker verification performs well when using middle-up layers. Our experimental study on the TED-LIUM 3 dataset with HMM/TDNN models shows an accuracy of 95% for gender detection, and an Equal Error Rate of 9.07% for a speaker verification task by only exploiting the weights from personalized models that could be exchanged instead of user data.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; acoustic model; Automatic speech recognition; collaborative learning; personalized acoustic models; speaker information
|
|
URL: https://hal.archives-ouvertes.fr/hal-03539741 https://hal.archives-ouvertes.fr/hal-03539741/document https://hal.archives-ouvertes.fr/hal-03539741/file/ICASSP_2022_SpeakerAnalysisInfoPrivacyVF.pdf
|
|
BASE
|
|
Hide details
|
|
2 |
From FreEM to D'AlemBERT ; From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
|
|
|
|
In: Proceedings of the 13th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-03596653 ; Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, Jun 2022, Marseille, France (2022)
|
|
BASE
|
|
Show details
|
|
3 |
A gentle introduction to Girard's Transcendental Syntax for the linear logician
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Learning and controlling the source-filter representation of speech with a variational autoencoder
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03650569 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events.
|
|
|
|
In: Nature communications, vol 13, iss 1 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Changes in the midst of a construction network: a diachronic construction grammar approach to complex prepositions denoting internal location
|
|
|
|
In: ISSN: 0936-5907 ; EISSN: 1613-3641 ; Cognitive Linguistics ; https://halshs.archives-ouvertes.fr/halshs-03637056 ; Cognitive Linguistics, De Gruyter, 2022, ⟨10.1515/cog-2021-0128⟩ (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Changes in the midst of a construction network: a diachronic construction grammar approach to complex prepositions denoting internal location
|
|
|
|
In: ISSN: 0936-5907 ; EISSN: 1613-3641 ; Cognitive Linguistics ; https://halshs.archives-ouvertes.fr/halshs-03637056 ; Cognitive Linguistics, De Gruyter, In press, ⟨10.1515/cog-2021-0128⟩ (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Le modèle Transformer: un « couteau suisse » pour le traitement automatique des langues
|
|
|
|
In: Techniques de l'Ingenieur ; https://hal.archives-ouvertes.fr/hal-03619077 ; Techniques de l'Ingenieur, Techniques de l'ingénieur, 2022, ⟨10.51257/a-v1-in195⟩ ; https://www.techniques-ingenieur.fr/base-documentaire/innovation-th10/innovations-en-electronique-et-tic-42257210/transformer-des-reseaux-de-neurones-pour-le-traitement-automatique-des-langues-in195/ (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
|
|
|
|
In: ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03613101 ; ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, May 2022, Dublin, Ireland (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Imputing out-of-vocabulary embeddings with LOVE makes language models robust with little cost
|
|
|
|
In: ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03613101 ; ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, May 2022, Dublin, Ireland (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Structured, flexible, and robust: comparing linguistic plans and explanations generated by humans and large language models ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
From bag-of-words towards natural language: adapting topic models to avoid stop word removal ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Collection of Classroom Instruction ... : A Collection of Classroom Instruction ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Biodiversity: how big is our global biodiversity debt and what can we do about it? ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Bayesian data analysis in the phonetic sciences: A tutorial introduction ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
How Cognitive Abilities May Support Children’s Bilingual Literacy Development in a Multilingual Society ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages ...
|
|
Chen, Fuxiang. - : Federated Research Data Repository / dépôt fédéré de données de recherche, 2022
|
|
BASE
|
|
Show details
|
|
|
|