Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...13

Hits 1 – 20 of 260

1	Automatic Speech Recognition and Query By Example for Creole Languages Documentation
	MACAIRE, Cécile; Schwab, Didier; Lecouteux, Benjamin...
	In: Findings of the Association for Computational Linguistics: ACL 2022 ; https://hal.archives-ouvertes.fr/hal-03625303 ; Findings of the Association for Computational Linguistics: ACL 2022, May 2022, Dublin, Ireland (2022)
	BASE
	Show details

2	Cross-Situational Learning Towards Robot Grounding
	OOTA, Subba Reddy; Alexandre, Frédéric; Hinaut, Xavier
	In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
	BASE
	Show details

3	Cross-Situational Learning Towards Robot Grounding
	OOTA, Subba Reddy; Alexandre, Frédéric; Hinaut, Xavier
	In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
	BASE
	Show details

4	Emergent Communication for Understanding Human Language Evolution: What's Missing? ...
	Galke, Lukas; Ram, Yoav; Raviv, Limor. - : arXiv, 2022
	BASE
	Show details

5	Multimodal neural networks better explain multivoxel patterns in the hippocampus ...
	Choksi, Bhavin; Mozafari, Milad; VanRullen, Rufin. - : arXiv, 2022
	BASE
	Show details

6	End-to-end speaker segmentation for overlap-aware resegmentation
	Bredin, Hervé; Laurent, Antoine
	In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
	BASE
	Show details

7	High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features
	Grumiaux, Pierre-Amaury; Kitic, Srdan; Girin, Laurent...
	In: EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO) ; https://hal.archives-ouvertes.fr/hal-03537323 ; EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. pp.71-75, ⟨10.23919/Eusipco47968.2020.9287637⟩ (2021)
	BASE
	Show details

8	Tackling Morphological Analogies Using Deep Learning -- Extended Version
	Alsaidi, Safa; Decker, Amandine; Marquer, Esteban...
	In: https://hal.inria.fr/hal-03425776 ; 2021 (2021)
	BASE
	Show details

9	Recognizing lexical units in low-resource language contexts with supervised and unsupervised neural networks
	MACAIRE, Cécile
	In: https://hal.archives-ouvertes.fr/hal-03429051 ; [Research Report] LACITO (UMR 7107). 2021 (2021)
	BASE
	Show details

10	What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
	Pagliarini, Silvia; Trouvain, Nathan; Leblois, Arthur...
	In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
	BASE
	Show details

11	What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
	Pagliarini, Silvia; Trouvain, Nathan; Leblois, Arthur...
	In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
	BASE
	Show details

12	Artificial Text Detection via Examining the Topology of Attention Maps
	Kushnareva, Laida; Cherniavskii, Daniil; Mikhailov, Vladislav...
	In: ACL Anthology ; Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-03456191 ; Empirical Methods in Natural Language Processing, ACL (Association for Computational Linguistics), Nov 2021, Punta Cana, Dominican Republic (2021)
	BASE
	Show details

13	Modeling the neural network responsible for song learning ; Modélisation du réseau neuronal responsable de l'apprentissage du chant chez l'oiseau chanteur
	Pagliarini, Silvia. - : HAL CCSD, 2021
	In: https://tel.archives-ouvertes.fr/tel-03217834 ; Modeling and Simulation. Université de Bordeaux, 2021. English. ⟨NNT : 2021BORD0107⟩ (2021)
	BASE
	Show details

14	Multimodal Coarticulation Modeling : Towards the animation of an intelligible talking head ; Modélisation de la coarticulation multimodale : vers l'animation d'une tête parlante intelligible
	Biasutto-Lervat, Théo. - : HAL CCSD, 2021
	In: https://hal.univ-lorraine.fr/tel-03203815 ; Intelligence artificielle [cs.AI]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0019⟩ (2021)
	BASE
	Show details

15	Impact of Segmentation and Annotation in French end-to-end Synthesis
	Lenglet, Martin; Perrotin, Olivier; Bailly, Gérard
	In: Proc. 11th ISCA Speech Synthesis Workshop (SSW 11) ; SSW 11th ISCA Speech Synthesis Workshop ; https://hal.archives-ouvertes.fr/hal-03362000 ; SSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩ ; https://ssw11.hte.hu/ (2021)
	BASE
	Show details

16	Which Hype for my New Task? Hints and Random Search for Reservoir Computing Hyperparameters
	Hinaut, Xavier; Trouvain, Nathan
	In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203318 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia (2021)
	BASE
	Show details

17	Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
	Trouvain, Nathan; Hinaut, Xavier
	In: https://hal.inria.fr/hal-03203374 ; 2021 (2021)
	BASE
	Show details

18	Which Hype for my New Task? Hints and Random Search for Reservoir Computing Hyperparameters
	Hinaut, Xavier; Trouvain, Nathan
	In: https://hal.inria.fr/hal-03203318 ; 2021 (2021)
	BASE
	Show details

19	Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
	Trouvain, Nathan; Hinaut, Xavier
	In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203374 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia. pp.71--82, ⟨10.1007/978-3-030-86383-8_6⟩ ; https://link.springer.com/chapter/10.1007/978-3-030-86383-8_6 (2021)
	BASE
	Show details

20	On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition
	Macary, Manon; Tahon, Marie; Estève, Yannick; Rousseau, Anthony
	In: IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03003469 ; IEEE Spoken Language Technology Workshop, Jan 2021, Virtual, China (2021)
	Abstract: Accepted in IEEE SLT 2021 ; International audience ; Pre-training for feature extraction is an increasingly studied approach to get better continuous representations of audio and text content. In the present work, we use wav2vec and camemBERT as self-supervised learned models to represent our data in order to perform continuous emotion recognition from speech (SER) on AlloSat, a large French emotional database describing the satisfaction dimension, and on the state of the art corpus SEWA focusing on valence, arousal and liking dimensions. To the authors' knowledge, this paper presents the first study showing that the joint use of wav2vec and BERT-like pre-trained features is very relevant to deal with continuous SER task, usually characterized by a small amount of labeled training data. Evaluated by the well-known concordance correlation coefficient (CCC), our experiments show that we can reach a CCC value of 0.825 instead of 0.592 when using MFCC in conjunction with word2vec word embedding on the AlloSat dataset.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; [INFO]Computer Science [cs]; CamemBERT; Continuous Speech Emotion Recognition; Pre-trained feature extraction; Wav2vec
	URL: https://hal.archives-ouvertes.fr/hal-03003469/document https://hal.archives-ouvertes.fr/hal-03003469/file/2011.09212.pdf https://hal.archives-ouvertes.fr/hal-03003469
	BASE
	Hide details

Page: 1 2 3 4 5...13

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern