DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...13
Hits 1 – 20 of 260

1
Automatic Speech Recognition and Query By Example for Creole Languages Documentation
In: Findings of the Association for Computational Linguistics: ACL 2022 ; https://hal.archives-ouvertes.fr/hal-03625303 ; Findings of the Association for Computational Linguistics: ACL 2022, May 2022, Dublin, Ireland (2022)
BASE
Show details
2
Cross-Situational Learning Towards Robot Grounding
In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
BASE
Show details
3
Cross-Situational Learning Towards Robot Grounding
In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
BASE
Show details
4
Emergent Communication for Understanding Human Language Evolution: What's Missing? ...
Galke, Lukas; Ram, Yoav; Raviv, Limor. - : arXiv, 2022
BASE
Show details
5
Multimodal neural networks better explain multivoxel patterns in the hippocampus ...
BASE
Show details
6
End-to-end speaker segmentation for overlap-aware resegmentation
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
Abstract: International audience ; Speaker segmentation consists in partitioning a conversation between one or more speakers into speaker turns. Usually addressed as the late combination of three sub-tasks (voice activity detection, speaker change detection, and overlapped speech detection), we propose to train an end-to-end segmentation model that does it directly. Inspired by the original end-to-end neural speaker diarization approach (EEND), the task is modeled as a multi-label classification problem using permutation-invariant training. The main difference is that our model operates on short audio chunks (5 seconds) but at a much higher temporal resolution (every 16ms). Experiments on multiple speaker diarization datasets conclude that our model can be used with great success on both voice activity detection and overlapped speech detection. Our proposed model can also be used as a post-processing step, to detect and correctly assign overlapped speech regions. Relative diarization error rate improvement over the best considered baseline (VBx) reaches 17% on AMI, 13% on DIHARD 3, and 13% on VoxConverse.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; overlapped speech detection; resegmentation; speaker diarization; speaker segmentation; voice activity detection
URL: https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524/document
https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524/file/2104.04045.pdf
https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524
BASE
Hide details
7
High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features
In: EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO) ; https://hal.archives-ouvertes.fr/hal-03537323 ; EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. pp.71-75, ⟨10.23919/Eusipco47968.2020.9287637⟩ (2021)
BASE
Show details
8
Tackling Morphological Analogies Using Deep Learning -- Extended Version
In: https://hal.inria.fr/hal-03425776 ; 2021 (2021)
BASE
Show details
9
Recognizing lexical units in low-resource language contexts with supervised and unsupervised neural networks
In: https://hal.archives-ouvertes.fr/hal-03429051 ; [Research Report] LACITO (UMR 7107). 2021 (2021)
BASE
Show details
10
What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
BASE
Show details
11
What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
BASE
Show details
12
Artificial Text Detection via Examining the Topology of Attention Maps
In: ACL Anthology ; Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-03456191 ; Empirical Methods in Natural Language Processing, ACL (Association for Computational Linguistics), Nov 2021, Punta Cana, Dominican Republic (2021)
BASE
Show details
13
Modeling the neural network responsible for song learning ; Modélisation du réseau neuronal responsable de l'apprentissage du chant chez l'oiseau chanteur
Pagliarini, Silvia. - : HAL CCSD, 2021
In: https://tel.archives-ouvertes.fr/tel-03217834 ; Modeling and Simulation. Université de Bordeaux, 2021. English. ⟨NNT : 2021BORD0107⟩ (2021)
BASE
Show details
14
Multimodal Coarticulation Modeling : Towards the animation of an intelligible talking head ; Modélisation de la coarticulation multimodale : vers l'animation d'une tête parlante intelligible
Biasutto-Lervat, Théo. - : HAL CCSD, 2021
In: https://hal.univ-lorraine.fr/tel-03203815 ; Intelligence artificielle [cs.AI]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0019⟩ (2021)
BASE
Show details
15
Impact of Segmentation and Annotation in French end-to-end Synthesis
In: Proc. 11th ISCA Speech Synthesis Workshop (SSW 11) ; SSW 11th ISCA Speech Synthesis Workshop ; https://hal.archives-ouvertes.fr/hal-03362000 ; SSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩ ; https://ssw11.hte.hu/ (2021)
BASE
Show details
16
Which Hype for my New Task? Hints and Random Search for Reservoir Computing Hyperparameters
In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203318 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia (2021)
BASE
Show details
17
Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
In: https://hal.inria.fr/hal-03203374 ; 2021 (2021)
BASE
Show details
18
Which Hype for my New Task? Hints and Random Search for Reservoir Computing Hyperparameters
In: https://hal.inria.fr/hal-03203318 ; 2021 (2021)
BASE
Show details
19
Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203374 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia. pp.71--82, ⟨10.1007/978-3-030-86383-8_6⟩ ; https://link.springer.com/chapter/10.1007/978-3-030-86383-8_6 (2021)
BASE
Show details
20
On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition
In: IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03003469 ; IEEE Spoken Language Technology Workshop, Jan 2021, Virtual, China (2021)
BASE
Show details

Page: 1 2 3 4 5...13

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
260
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern