DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5
Hits 1 – 20 of 84

1
Evaluation of Speaker Anonymization on Emotional Speech ; Analyse de l'anonymisation du locuteur sur de la parole émotionnelle
In: JEP2022 - Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-03636737 ; JEP2022 - Journées d'Études sur la Parole, Jun 2022, Île de Noirmoutier, France (2022)
BASE
Show details
2
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02995862 ; PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Virtual, China (2021)
BASE
Show details
3
Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
In: Interspeech ; https://hal.archives-ouvertes.fr/hal-03228823 ; Interspeech, Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
4
Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
In: https://hal.archives-ouvertes.fr/hal-03228823 ; 2021 (2021)
BASE
Show details
5
Adapting Language Models When Training on Privacy-Transformed Data
In: INTERSPEECH 2021 ; https://hal.inria.fr/hal-03189354 ; 2021 (2021)
Abstract: Submitted to INTERSPEECH 2021 ; International audience ; In recent years, voice-controlled personal assistants have revolutionized the interaction with smart devices and mobile applications. These dialogue tools are then used by system providers to improve and retrain the language models (LMs). Each spoken message reveals personal information, hence, it is necessary to remove the private data from the input utterances. However, this may harm the LM training because privacy-transformed data is unlikely to match the test distribution. This paper aims to fill the gap by focusing on the adaptation of LM initially trained on privacy-transformed utterances. Our data sanitization process relies on named-entity recognition. We propose an LM adaptation strategy over the private data with minimum losses. Class-based modeling is an effective approach to overcome data sparsity in the context of n-gram model training. On the other hand, neural LMs can handle longer contexts which can yield better predictions. Our methodology combines the predictive power of class-based models and the generalization capability of neural models together. With privacy transformation, we have a relative 11% word error rate (WER) increase compared to an LM trained on the clean data. Despite the privacy-preserving, we can still achieve comparable accuracy. Empirical evaluations attain a relative WER improvement of 8% over the initial model.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; class-based language modeling; language model adaptation; privacy-preserving learning; speech recognition
URL: https://hal.inria.fr/hal-03189354/file/Paper_1854.pdf
https://hal.inria.fr/hal-03189354
https://hal.inria.fr/hal-03189354/document
BASE
Hide details
6
Evaluation of Speaker Anonymization on Emotional Speech
In: 1st ISCA Symposium on Security and Privacy in Speech Communication ; https://hal.inria.fr/hal-03377797 ; 1st ISCA Symposium on Security and Privacy in Speech Communication, Nov 2021, Virtual, Germany (2021)
BASE
Show details
7
Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02907929 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
8
Duration modelling and evaluation for Arabic statistical parametric speech synthesis
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.inria.fr/hal-03007287 ; Multimedia Tools and Applications, Springer Verlag, 2020, ⟨10.1007/s11042-020-09901-7⟩ (2020)
BASE
Show details
9
Speaker information modification in the VoicePrivacy 2020 toolchain
In: https://hal.archives-ouvertes.fr/hal-02995855 ; [Research Report] INRIA Nancy, équipe Multispeech; LIUM - Laboratoire d'Informatique de l'Université du Mans. 2020 (2020)
BASE
Show details
10
Correlation between prosody and pragmatics: case study of discourse markers in French and English
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02968475 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
11
Extractive Text-Based Summarization of Arabic videos: Issues, Approaches and Evaluations
In: ICALP: International Conference on Arabic Language Processing ; https://hal.archives-ouvertes.fr/hal-02314238 ; ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.65-78, ⟨10.1007/978-3-030-32959-4_5⟩ (2019)
BASE
Show details
12
F0 modeling using DNN for Arabic parametric speech synthesis
In: INNSBDDL 2019 - INNS Big Data and Deep Learning ; https://hal.inria.fr/hal-02177496 ; INNSBDDL 2019 - INNS Big Data and Deep Learning, Apr 2019, Sestri Levante, Italy (2019)
BASE
Show details
13
A Fine-grained Multilingual Analysis Based on the Appraisal Theory: Application to Arabic and English Videos
In: ICALP: International Conference on Arabic Language Processing ; https://hal.archives-ouvertes.fr/hal-02314244 ; ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.49-61, ⟨10.1007/978-3-030-32959-4_4⟩ (2019)
BASE
Show details
14
Machine Translation on a parallel Code-Switched Corpus
In: Canadian AI 2019 - 32nd Conference on Canadian Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02106010 ; Canadian AI 2019 - 32nd Conference on Canadian Artificial Intelligence, May 2019, Ontario, Canada (2019)
BASE
Show details
15
Summarizing videos into a target language: Methodology, architectures and evaluation
In: ISSN: 1064-1246 ; EISSN: 1875-8967 ; Journal of Intelligent and Fuzzy Systems ; https://hal.archives-ouvertes.fr/hal-02271287 ; Journal of Intelligent and Fuzzy Systems, IOS Press, 2019, 1, pp.1-12. ⟨10.3233/JIFS-179350⟩ (2019)
BASE
Show details
16
Speech Processing and Prosody
In: TSD 2019 - 22nd International Conference of Text, Speech and Dialogue ; https://hal.inria.fr/hal-02177210 ; TSD 2019 - 22nd International Conference of Text, Speech and Dialogue, Sep 2019, Ljubljana, Slovenia (2019)
BASE
Show details
17
Can prosody meet pragmatics? Case of discourse particles in French
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02177202 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
BASE
Show details
18
Adaptation of speech recognition vocabularies for improved transcription of YouTube videos
In: ISSN: 2351-8715 ; Journal of International Science and General Applications ; https://hal.archives-ouvertes.fr/hal-01873801 ; Journal of International Science and General Applications, ISGA, 2018, 1 (1), pp.1-9 ; http://journal-isga.ma/ (2018)
BASE
Show details
19
Duration modeling using DNN for Arabic speech synthesis
In: 9th International Conference on Speech Prosody ; https://hal.inria.fr/hal-01889917 ; 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland (2018)
BASE
Show details
20
A Proposed Methodology for Subjective Evaluation of Video and Text Summarization
In: MISSI 2018 - 11th edition of the International Conference on Multimedia and Network Information Systems ; https://hal.archives-ouvertes.fr/hal-01873685 ; MISSI 2018 - 11th edition of the International Conference on Multimedia and Network Information Systems, Sep 2018, Wrocław, Poland. pp.396-404, ⟨10.1007/978-3-319-98678-4_40⟩ (2018)
BASE
Show details

Page: 1 2 3 4 5

Catalogues
0
0
2
0
3
0
0
Bibliographies
9
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
69
0
3
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern