DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...1.878
Hits 1 – 20 of 37.556

1
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
BASE
Show details
2
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
BASE
Show details
3
A fine-grained recognition of Named Entities in ELTeC collection using cascades
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
BASE
Show details
4
Multiagent Dynamics of Gradual Argumentation Semantics
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
BASE
Show details
5
The Impact of Game Elements on Learner Motivation: Influence of Initial Motivation and Player Profile
In: EISSN: 1939-1382 ; IEEE Transactions on Learning Technologies ; https://hal.univ-lyon2.fr/hal-03579428 ; IEEE Transactions on Learning Technologies, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TLT.2022.3153239⟩ (2022)
BASE
Show details
6
Psychiatry on Twitter: Content Analysis of the Use of Psychiatric Terms in French
In: ISSN: 2561-326X ; JMIR Formative Research ; https://hal.archives-ouvertes.fr/hal-03614832 ; JMIR Formative Research, JMIR Publications 2022, 6 (2), pp.e18539. ⟨10.2196/18539⟩ ; https://formative.jmir.org/2022/2/e18539 (2022)
BASE
Show details
7
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
BASE
Show details
8
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
BASE
Show details
9
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
BASE
Show details
10
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
Abstract: International audience ; The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR. In both cases, personalized acoustic models (AM), i.e. fine-tuned AM with specific speaker data, can be built. A question that naturally arises is whether the dissemination of personalized acoustic models can leak personal information. In this paper, we show that it is possible to retrieve the gender of the speaker, but also his identity, by just exploiting the weight matrix changes of a neural acoustic model locally adapted to this speaker. Incidentally we observe phenomena that may be useful towards explainability of deep neural networks in the context of speech processing. Gender can be identified almost surely using only the first layers and speaker verification performs well when using middle-up layers. Our experimental study on the TED-LIUM 3 dataset with HMM/TDNN models shows an accuracy of 95% for gender detection, and an Equal Error Rate of 9.07% for a speaker verification task by only exploiting the weights from personalized models that could be exchanged instead of user data.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; acoustic model; Automatic speech recognition; collaborative learning; personalized acoustic models; speaker information
URL: https://hal.archives-ouvertes.fr/hal-03539741
https://hal.archives-ouvertes.fr/hal-03539741/document
https://hal.archives-ouvertes.fr/hal-03539741/file/ICASSP_2022_SpeakerAnalysisInfoPrivacyVF.pdf
BASE
Hide details
11
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
BASE
Show details
12
Emotional Speech Recognition Using Deep Neural Networks
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
BASE
Show details
13
From Biological Synapses to “Intelligent” Robots
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
BASE
Show details
14
Multiagent Dynamics of Gradual Argumentation Semantics
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022) ; https://hal.archives-ouvertes.fr/hal-03584238 ; 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), May 2022, Auckland (virtual), New Zealand (2022)
BASE
Show details
15
Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters
In: ISCRAM ; https://hal.archives-ouvertes.fr/hal-03631387 ; ISCRAM, May 2022, Tarbes, France (2022)
BASE
Show details
16
END-TO-END SPEECH RECOGNITION FROM FEDERATED ACOUSTIC MODELS
In: The International Conference on Acoustics, Speech, & Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-03601224 ; The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), May 2022, Singapour, Singapore (2022)
BASE
Show details
17
Question-Based Explainability in Abstract Argumentation
In: https://hal-univ-tlse3.archives-ouvertes.fr/hal-03647896 ; [Research Report] IRIT/RR--2022--01--FR, IRIT : Institut de Recherche en Informatique de Toulouse, France. 2022, pp.1-64 (2022)
BASE
Show details
18
Source or target first? Comparison of two post-editing strategies with translation students
In: https://hal.archives-ouvertes.fr/hal-03546151 ; 2022 (2022)
BASE
Show details
19
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
BASE
Show details
20
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
BASE
Show details

Page: 1 2 3 4 5...1.878

Catalogues
466
69
465
0
0
9
26
Bibliographies
1.969
1
0
0
0
0
0
14
280
Linked Open Data catalogues
0
Online resources
105
18
0
0
Open access documents
35.037
5
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern