DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...19
Hits 1 – 20 of 380

1
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
BASE
Show details
2
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
BASE
Show details
3
Etude de cas de pathologies de la parole dans le cadre de la prise en charge orthophonique
In: https://hal.archives-ouvertes.fr/hal-03568182 ; 2022 (2022)
BASE
Show details
4
Automatic assessment of oral readings of young pupils
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03585934 ; Speech Communication, Elsevier : North-Holland, 2022, 138, pp.67-79. ⟨10.1016/j.specom.2022.01.008⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639322000164?via%3Dihub (2022)
BASE
Show details
5
Automatic Speech Recognition systems errors for accident-prone sleepiness detection through voice
In: EUSIPCO 2021 ; https://hal.archives-ouvertes.fr/hal-03324033 ; EUSIPCO 2021, Aug 2021, Dublin (en ligne), Ireland. ⟨10.23919/EUSIPCO54536.2021.9616299⟩ (2021)
BASE
Show details
6
Automatic Speech Recognition systems errors for objective sleepiness detection through voice
In: Proceedings Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03328827 ; Interspeech 2021, Aug 2021, Brno (virtual), Czech Republic. pp.2476-2480, ⟨10.21437/Interspeech.2021-291⟩ (2021)
BASE
Show details
7
Introducing an experimental distortion-tolerant speech encryption scheme for secure voice communication
In: https://hal.archives-ouvertes.fr/hal-03445994 ; 2021 (2021)
BASE
Show details
8
Spat~ : a comprehensive toolbox for sound spatialization in Max
In: ISSN: 2317-9694 ; Ideas Sonicas ; https://hal.archives-ouvertes.fr/hal-03356292 ; Ideas Sonicas, João Pedro Oliveira, 2021, Electroacoustic Space - Reflections - Tools for its design, 13 (24), pp.12 - 23 ; http://sonicideas.org (2021)
BASE
Show details
9
Re-synchronization using the Hand Preceding Model for Multi-modal Fusion in Automatic Continuous Cued Speech Recognition
In: ISSN: 1520-9210 ; IEEE Transactions on Multimedia ; https://hal.archives-ouvertes.fr/hal-02433830 ; IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2021, 23, pp.292-305. ⟨10.1109/TMM.2020.2976493⟩ (2021)
BASE
Show details
10
Human Beatbox: from extreme use of voice and speech to its use in speech therapy ; Le Human Beatbox : d’une utilisation extrême de la voix et de la parole à son utilité en orthophonie
In: ISSN: 0034-222X ; Rééducation orthophonique ; https://hal.archives-ouvertes.fr/hal-03377693 ; Rééducation orthophonique, Ortho édition, 2021, Rééducation orthophonique n°286 - Les phonations : sur la voie des voix, 286 ; https://www.orthoedition.com/revues/n-les-phonations-sur-la-voie-des-voix-4341.html (2021)
BASE
Show details
11
Analyse objective de la parole dysarthrique : évaluation d’une sélection d’indices acoustiques
In: https://hal.archives-ouvertes.fr/hal-03139503 ; 2021 (2021)
BASE
Show details
12
Automatic risk detection system by audiovisual signal processing ; Système de détection automatique de risques par traitement de signaux audiovisuels
Bendjoudi, Ilyes. - : HAL CCSD, 2021
In: https://tel.archives-ouvertes.fr/tel-03602318 ; Signal and Image processing. Université Polytechnique Hauts-de-France; Institut national des sciences appliquées Hauts-de-France, 2021. English. ⟨NNT : 2021UPHF0040⟩ (2021)
BASE
Show details
13
Leveraging lyrics from audio for MIR ; Exploiter les paroles de chansons à partir de l'audio pour le MIR
Vaglio, Andrea. - : HAL CCSD, 2021
In: https://tel.archives-ouvertes.fr/tel-03558515 ; Signal and Image processing. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAT027⟩ (2021)
BASE
Show details
14
A bio-inspired geometric model for sound reconstruction
In: ISSN: 2190-8567 ; Journal of Mathematical Neuroscience ; https://hal.archives-ouvertes.fr/hal-02531537 ; Journal of Mathematical Neuroscience, BioMed Central, 2021, 11 (1), pp.2. ⟨10.1186/s13408-020-00099-4⟩ (2021)
BASE
Show details
15
Photogrammétrie appliquée au végétale : automatisation et post traitement
In: 16èmes Journées de la Mesure et de la Métrologie (J2M) ; https://hal.archives-ouvertes.fr/hal-03644832 ; 16èmes Journées de la Mesure et de la Métrologie (J2M), INRAE, Oct 2021, Ardes sur Couze, France ; http://www7.inra.fr/j2m/fichiers/recueils/j2m_2021.pdf (2021)
BASE
Show details
16
La voce umana, dal respiro al canto
In: ISSN: 2611-5689 ; Bollettino del Laboratorio di Fonetica Sperimentale "Arturo Genre" ; https://hal.archives-ouvertes.fr/hal-03508030 ; Bollettino del Laboratorio di Fonetica Sperimentale "Arturo Genre", Universita di Torino, 2021, https://www.lfsag.unito.it/ricerca/phonews/07/7_3.pdf ; https://www.lfsag.unito.it/ricerca/phonews/index.html (2021)
BASE
Show details
17
Optimization of Dental Devices and Tools used on Teeth
In: BioMed Research International ; https://hal.archives-ouvertes.fr/hal-03253408 ; BioMed Research International, In press, pp.9913788. ⟨10.1155/2021/9913788⟩ (2021)
BASE
Show details
18
Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
In: ISSN: 0893-6080 ; Neural Networks ; https://hal.inria.fr/hal-03204193 ; Neural Networks, Elsevier, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩ (2021)
Abstract: International audience ; Great improvement has been made in the field of expressive audiovisual Text-to-Speech synthesis (EAVTTS) thanks to deep learning techniques. However, generating realistic speech is still an open issue and researchers in this area have been focusing lately on controlling the speech variability.In this paper, we use different neural architectures to synthesize emotional speech. We study the application of unsupervised learning techniques for emotional speech modeling as well as methods for restructuring emotions representation to make it continuous and more flexible. This manipulation of the emotional representation should allow us to generate new styles of speech by mixing emotions. We first present our expressive audiovisual corpus. We validate the emotional content of this corpus with three perceptual experiments using acoustic only, visual only and audiovisual stimuli.After that, we analyze the performance of a fully connected neural network in learning characteristics specific to different emotions for the phone duration aspect and the acoustic and visual modalities.We also study the contribution of a joint and separate training of the acoustic and visual modalities in the quality of the generated synthetic speech.In the second part of this paper, we use a conditional variational auto-encoder (CVAE) architecture to learn a latent representation of emotions. We applied this method in an unsupervised manner to generate features of expressive speech. We used a probabilistic metric to compute the overlapping degree between emotions latent clusters to choose the best parameters for the CVAE. By manipulating the latent vectors, we were able to generate nuances of a given emotion and to generate new emotions that do not exist in our database. For these new emotions, we obtain a coherent articulation. We conducted four perceptual experiments to evaluate our findings.
Keyword: [MATH.MATH-MG]Mathematics [math]/Metric Geometry [math.MG]; [SCCO.COMP]Cognitive science/Computer science; [SCCO.LING]Cognitive science/Linguistics; [SDV.OT]Life Sciences [q-bio]/Other [q-bio.OT]; [SHS.INFO]Humanities and Social Sciences/Library and information sciences; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; bidirectional long short-term memory (BLSTM); conditional variationalauto-encoder; deeplearning; emotion; Expressive audiovisual speech synthesis; Expressive talking avatar; facial expression
URL: https://doi.org/10.1016/j.neunet.2021.04.021
https://hal.inria.fr/hal-03204193/document
https://hal.inria.fr/hal-03204193
https://hal.inria.fr/hal-03204193/file/neural_networks_journal-8.pdf
BASE
Hide details
19
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
BASE
Show details
20
Humming beatboxing : the vocal orchestra within
In: MAVEBA 2021 - 12th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications ; https://hal.archives-ouvertes.fr/hal-03510719 ; MAVEBA 2021 - 12th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Dec 2021, Florence, Italy ; http://maveba.dinfo.unifi.it (2021)
BASE
Show details

Page: 1 2 3 4 5...19

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
380
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern