Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...58

Hits 1 – 20 of 1.153

1	A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
	Bous, Frederik; Roebel, Axel
	In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
	BASE
	Show details

2	Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
	Roebel, Axel; Bous, Frederik
	In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
	BASE
	Show details

3	Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
	Sankar, Sanjana; Beautemps, Denis; Hueber, Thomas
	In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
	BASE
	Show details

4	An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
	Dey, Spandan; Sahidullah, Md; Saha, Goutam
	In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
	BASE
	Show details

5	Etude de cas de pathologies de la parole dans le cadre de la prise en charge orthophonique
	Sicard, Etienne; Menin-Sicard, Anne; Michel, Sandrine...
	In: https://hal.archives-ouvertes.fr/hal-03568182 ; 2022 (2022)
	BASE
	Show details

6	Computational models of disfluencies : fillers and discourse markers in spoken language understanding ; Modèles computationnels des disfluences dans le traitement de la parole
	Dinkar, Tanvi. - : HAL CCSD, 2022
	In: https://tel.archives-ouvertes.fr/tel-03653211 ; Computer science. Institut Polytechnique de Paris, 2022. English. ⟨NNT : 2022IPPAT001⟩ (2022)
	BASE
	Show details

7	Differentially private speaker anonymization
	Shamsabadi, Ali Shahin; Srivastava, Brij Mohan Lal; Bellet, Aurélien; Vauquier, Nathalie; Vincent, Emmanuel; Maouche, Mohamed; Tommasi, Marc; Papernot, Nicolas
	In: https://hal.inria.fr/hal-03588932 ; 2022 (2022)
	Abstract: Sharing real-world speech utterances is key to the training and deployment of voice-based services. However, it also raises privacy risks as speech contains a wealth of personal data. Speaker anonymization aims to remove speaker information from a speech utterance while leaving its linguistic and prosodic attributes intact. State-of-the-art techniques operate by disentangling the speaker information (represented via a speaker embedding) from these attributes and re-synthesizing speech based on the speaker embedding of another speaker. Prior research in the privacy community has shown that anonymization often provides brittle privacy protection, even less so any provable guarantee. In this work, we show that disentanglement is indeed not perfect: linguistic and prosodic attributes still contain speaker information. We remove speaker information from these attributes by introducing differentially private feature extractors based on an autoencoder and an automatic speech recognizer, respectively, trained using noise layers. We plug these extractors in the state-of-the-art anonymization pipeline and generate, for the first time, differentially private utterances with a provable upper bound on the speaker information they contain. We evaluate empirically the privacy and utility resulting from our differentially private speaker anonymization approach on the LibriSpeech data set. Experimental results show that the generated utterances retain very high utility for automatic speech recognition training and inference, while being much better protected against strong adversaries who leverage the full knowledge of the anonymization process to try to infer the speaker identity.
	Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
	URL: https://hal.inria.fr/hal-03588932
	BASE
	Hide details

8	Automatic assessment of oral readings of young pupils
	Bailly, Gérard; Godde, Erika; Piat-Marchand, Anne-Laure...
	In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03585934 ; Speech Communication, Elsevier : North-Holland, 2022, 138, pp.67-79. ⟨10.1016/j.specom.2022.01.008⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639322000164?via%3Dihub (2022)
	BASE
	Show details

9	Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition ...
	Lian, Jiachen; Black, Alan W; Goldstein, Louis. - : arXiv, 2022
	BASE
	Show details

10	Decoding Neural Correlation of Language-Specific Imagined Speech using EEG Signals ...
	Lee, Keon-Woo; Lee, Dae-Hyeok; Kim, Sung-Jin. - : arXiv, 2022
	BASE
	Show details

11	Multi Antenna Radar System for American Sign Language (ASL) Recognition Using Deep Learning ...
	MacLaughlin, Gavin; Malcolm, Jack; Hamza, Syed Ali. - : arXiv, 2022
	BASE
	Show details

12	Effect of Kinematics and Fluency in Adversarial Synthetic Data Generation for ASL Recognition with RF Sensors ...
	Rahman, M. M.; Malaia, E.; Gurbuz, A. C.. - : arXiv, 2022
	BASE
	Show details

13	Language vs Speaker Change: A Comparative Study ...
	Mishra, Jagabandhu; Prasanna, S. R. Mahadeva. - : arXiv, 2022
	BASE
	Show details

14	Separate What You Describe: Language-Queried Audio Source Separation ...
	Liu, Xubo; Liu, Haohe; Kong, Qiuqiang. - : arXiv, 2022
	BASE
	Show details

15	CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations ...
	Sachidananda, Vin; Tseng, Shao-Yen; Marchi, Erik. - : arXiv, 2022
	BASE
	Show details

16	Mutual Understanding in Situated Interactions with Conversational User Interfaces : Theory, Studies, and Computation
	Kontogiorgos, Dimosthenis. - : KTH, Tal-kommunikation, 2022. : Stockholm, 2022
	BASE
	Show details

17	Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
	Bartosz Kopczynski; Ewa Niebudek-Bogusz; Wioletta Pietruszewska; Pawel Strumillo
	In: Sensors; Volume 22; Issue 5; Pages: 1751 (2022)
	BASE
	Show details

18	A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
	Champion, Pierre; Jouvet, Denis; Larcher, Anthony
	In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02995862 ; PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence, Feb 2021, Virtual, China (2021)
	BASE
	Show details

19	Assessment of adult speech disorders: current situation and needs in French-speaking clinical practice
	Pommée, Timothy; Balaguer, Mathieu; Mauclair, Julie...
	In: ISSN: 1401-5439 ; Logopedics Phoniatrics Vocology ; https://hal.archives-ouvertes.fr/hal-03120115 ; Logopedics Phoniatrics Vocology, Taylor & Francis, 2021, pp.1-15. ⟨10.1080/14015439.2020.1870245⟩ (2021)
	BASE
	Show details

20	Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
	Sen, Nirmalya; Sahidullah, Md; Patil, Hemant...
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
	BASE
	Show details

Page: 1 2 3 4 5...58

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern