1 |
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
4 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Etude de cas de pathologies de la parole dans le cadre de la prise en charge orthophonique
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03568182 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Differentially private speaker anonymization
|
|
|
|
In: https://hal.inria.fr/hal-03588932 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Automatic assessment of oral readings of young pupils
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03585934 ; Speech Communication, Elsevier : North-Holland, 2022, 138, pp.67-79. ⟨10.1016/j.specom.2022.01.008⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639322000164?via%3Dihub (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Unsupervised quantification of entity consistency between photos and text in real-world news ...
|
|
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
|
|
BASE
|
|
Show details
|
|
10 |
Principles of Learning in Multitask Settings: A Probabilistic Perspective ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Principles of Learning in Multitask Settings: A Probabilistic Perspective ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Cross-view Brain Decoding ...
|
|
|
|
Abstract:
How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous efforts, which focus only on single view analysis, in this paper, we study the effectiveness of brain decoding in a zero-shot cross-view learning setup. Further, we propose brain decoding in the novel context of cross-view-translation tasks like image captioning (IC), image tagging (IT), keyword extraction (KE), and sentence formation (SF). Using extensive experiments, we demonstrate that cross-view zero-shot brain decoding is practical leading to ~0.68 average pairwise accuracy across view pairs. Also, the decoded representations are sufficiently detailed to enable high accuracy for cross-view-translation tasks with ... : 11 pages, 10 figures ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; Computer Vision and Pattern Recognition cs.CV; FOS Biological sciences; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Image and Video Processing eess.IV; Machine Learning cs.LG; Neurons and Cognition q-bio.NC
|
|
URL: https://dx.doi.org/10.48550/arxiv.2204.09564 https://arxiv.org/abs/2204.09564
|
|
BASE
|
|
Hide details
|
|
15 |
Who has ears, listen: Citizen Listening Program for disease prevention. ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Who has ears, listen: Citizen Listening Program for disease prevention. ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
|
|
|
|
In: Sensors; Volume 22; Issue 5; Pages: 1751 (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification
|
|
|
|
In: Sensors; Volume 22; Issue 5; Pages: 1899 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
|
|
|
|
In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02995862 ; PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Virtual, China (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Assessment of adult speech disorders: current situation and needs in French-speaking clinical practice
|
|
|
|
In: ISSN: 1401-5439 ; Logopedics Phoniatrics Vocology ; https://hal.archives-ouvertes.fr/hal-03120115 ; Logopedics Phoniatrics Vocology, Taylor & Francis, 2021, pp.1-15. ⟨10.1080/14015439.2020.1870245⟩ (2021)
|
|
BASE
|
|
Show details
|
|
|
|