1 |
Evaluation of Speaker Anonymization on Emotional Speech ; Analyse de l'anonymisation du locuteur sur de la parole émotionnelle
|
|
|
|
In: JEP2022 - Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-03636737 ; JEP2022 - Journées d'Études sur la Parole, Jun 2022, Île de Noirmoutier, France (2022)
|
|
BASE
|
|
Show details
|
|
2 |
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
|
|
|
|
In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02995862 ; PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Virtual, China (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
|
|
|
|
In: Interspeech ; https://hal.archives-ouvertes.fr/hal-03228823 ; Interspeech, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03228823 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Adapting Language Models When Training on Privacy-Transformed Data
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.inria.fr/hal-03189354 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Evaluation of Speaker Anonymization on Emotional Speech
|
|
|
|
In: 1st ISCA Symposium on Security and Privacy in Speech Communication ; https://hal.inria.fr/hal-03377797 ; 1st ISCA Symposium on Security and Privacy in Speech Communication, Nov 2021, Virtual, Germany (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02907929 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Duration modelling and evaluation for Arabic statistical parametric speech synthesis
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.inria.fr/hal-03007287 ; Multimedia Tools and Applications, Springer Verlag, 2020, ⟨10.1007/s11042-020-09901-7⟩ (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Speaker information modification in the VoicePrivacy 2020 toolchain
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02995855 ; [Research Report] INRIA Nancy, équipe Multispeech; LIUM - Laboratoire d'Informatique de l'Université du Mans. 2020 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Correlation between prosody and pragmatics: case study of discourse markers in French and English
|
|
|
|
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02968475 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Extractive Text-Based Summarization of Arabic videos: Issues, Approaches and Evaluations
|
|
|
|
In: ICALP: International Conference on Arabic Language Processing ; https://hal.archives-ouvertes.fr/hal-02314238 ; ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.65-78, ⟨10.1007/978-3-030-32959-4_5⟩ (2019)
|
|
BASE
|
|
Show details
|
|
12 |
F0 modeling using DNN for Arabic parametric speech synthesis
|
|
|
|
In: INNSBDDL 2019 - INNS Big Data and Deep Learning ; https://hal.inria.fr/hal-02177496 ; INNSBDDL 2019 - INNS Big Data and Deep Learning, Apr 2019, Sestri Levante, Italy (2019)
|
|
BASE
|
|
Show details
|
|
13 |
A Fine-grained Multilingual Analysis Based on the Appraisal Theory: Application to Arabic and English Videos
|
|
|
|
In: ICALP: International Conference on Arabic Language Processing ; https://hal.archives-ouvertes.fr/hal-02314244 ; ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.49-61, ⟨10.1007/978-3-030-32959-4_4⟩ (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Machine Translation on a parallel Code-Switched Corpus
|
|
|
|
In: Canadian AI 2019 - 32nd Conference on Canadian Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02106010 ; Canadian AI 2019 - 32nd Conference on Canadian Artificial Intelligence, May 2019, Ontario, Canada (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Summarizing videos into a target language: Methodology, architectures and evaluation
|
|
|
|
In: ISSN: 1064-1246 ; EISSN: 1875-8967 ; Journal of Intelligent and Fuzzy Systems ; https://hal.archives-ouvertes.fr/hal-02271287 ; Journal of Intelligent and Fuzzy Systems, IOS Press, 2019, 1, pp.1-12. ⟨10.3233/JIFS-179350⟩ (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Speech Processing and Prosody
|
|
|
|
In: TSD 2019 - 22nd International Conference of Text, Speech and Dialogue ; https://hal.inria.fr/hal-02177210 ; TSD 2019 - 22nd International Conference of Text, Speech and Dialogue, Sep 2019, Ljubljana, Slovenia (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Can prosody meet pragmatics? Case of discourse particles in French
|
|
|
|
In: ICPhS 2019 - International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-02177202 ; ICPhS 2019 - International Congress of Phonetic Sciences, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Adaptation of speech recognition vocabularies for improved transcription of YouTube videos
|
|
|
|
In: ISSN: 2351-8715 ; Journal of International Science and General Applications ; https://hal.archives-ouvertes.fr/hal-01873801 ; Journal of International Science and General Applications, ISGA, 2018, 1 (1), pp.1-9 ; http://journal-isga.ma/ (2018)
|
|
Abstract:
International audience ; This paper discusses the adaptation of speech recognition vocabularies for automatic speech transcription. The context is the transcription of YouTube videos in French, English and Arabic. Base-line automatic speech recognition systems have been developed using previously available data. However, the available text data, including the GigaWord corpora from LDC, are getting quite old with respect to recent YouTube videos that are to be transcribed. After a discussion on the performance of the ASR baseline systems, the paper presents the collection of recent textual data from internet for updating the speech recognition vocabularies and for training the language models, as well as the elaboration of development data sets necessary for the vocabulary selection process. The paper also compares the coverage of the training data collected from internet, and of the GigaWord data, with finite size vocabularies made of the most frequent words. Finally, the paper presents and discusses the amount of out-of-vocabulary word occurrences, before and after the update of the speech recognition vocabularies, for the three languages. Moreover, some speech recognition evaluation results are provided and analyzed.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
|
|
URL: https://hal.archives-ouvertes.fr/hal-01873801/document https://hal.archives-ouvertes.fr/hal-01873801/file/DENIS.pdf https://hal.archives-ouvertes.fr/hal-01873801
|
|
BASE
|
|
Hide details
|
|
19 |
Duration modeling using DNN for Arabic speech synthesis
|
|
|
|
In: 9th International Conference on Speech Prosody ; https://hal.inria.fr/hal-01889917 ; 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland (2018)
|
|
BASE
|
|
Show details
|
|
20 |
A Proposed Methodology for Subjective Evaluation of Video and Text Summarization
|
|
|
|
In: MISSI 2018 - 11th edition of the International Conference on Multimedia and Network Information Systems ; https://hal.archives-ouvertes.fr/hal-01873685 ; MISSI 2018 - 11th edition of the International Conference on Multimedia and Network Information Systems, Sep 2018, Wrocław, Poland. pp.396-404, ⟨10.1007/978-3-319-98678-4_40⟩ (2018)
|
|
BASE
|
|
Show details
|
|
|
|