DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
BASE
Show details
2
End-to-end speaker segmentation for overlap-aware resegmentation
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
Abstract: International audience ; Speaker segmentation consists in partitioning a conversation between one or more speakers into speaker turns. Usually addressed as the late combination of three sub-tasks (voice activity detection, speaker change detection, and overlapped speech detection), we propose to train an end-to-end segmentation model that does it directly. Inspired by the original end-to-end neural speaker diarization approach (EEND), the task is modeled as a multi-label classification problem using permutation-invariant training. The main difference is that our model operates on short audio chunks (5 seconds) but at a much higher temporal resolution (every 16ms). Experiments on multiple speaker diarization datasets conclude that our model can be used with great success on both voice activity detection and overlapped speech detection. Our proposed model can also be used as a post-processing step, to detect and correctly assign overlapped speech regions. Relative diarization error rate improvement over the best considered baseline (VBx) reaches 17% on AMI, 13% on DIHARD 3, and 13% on VoxConverse.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; overlapped speech detection; resegmentation; speaker diarization; speaker segmentation; voice activity detection
URL: https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524/document
https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524/file/2104.04045.pdf
https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524
BASE
Hide details
3
Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project
In: ISSN: 1938-4122 ; Digital Humanities Quarterly ; https://hal.archives-ouvertes.fr/hal-03166755 ; Digital Humanities Quarterly, Alliance of Digital Humanities, 2021, Special Issue on AudioVisual Data in DH, 15 (1) ; http://digitalhumanities.org/dhq/ (2021)
BASE
Show details
4
Where are we in Named Entity Recognition from Speech?
In: 12th International Conference on Language Resources and Evaluation (LREC) ; https://hal.archives-ouvertes.fr/hal-02475026 ; 12th International Conference on Language Resources and Evaluation (LREC), May 2020, Marseille, France ; https://aclanthology.org/2020.lrec-1.556/ (2020)
BASE
Show details
5
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
6
Collective memory shapes the organization of individual memories in the medial prefrontal cortex
In: EISSN: 2397-3374 ; Nature Human Behaviour ; https://halshs.archives-ouvertes.fr/halshs-02416130 ; Nature Human Behaviour, Nature Research 2019, ⟨10.1038/s41562-019-0779-z⟩ (2019)
BASE
Show details
7
Effective keyword search for low-resourced conversational speech
In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
BASE
Show details
8
Language Recognition for Dialects and Closely Related Languages
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
BASE
Show details
9
Boosting bonsai trees for efficient features combination : application to speaker role identification
In: Interspeech ; https://hal.inria.fr/hal-01025171 ; Interspeech, Sep 2014, Singapour, Singapore (2014)
BASE
Show details
10
Improving recognition of proper nouns (in ASR) through generation and filtering of phonetic transcriptions
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01433238 ; Computer Speech and Language, Elsevier, 2014, 28 (4), pp.979-996. ⟨10.1016/j.csl.2014.02.006⟩ (2014)
BASE
Show details
11
Acoustics-Based Phonetic Transcription Method for Proper Nouns
In: International Conference on Spoken Language Processing (ISCA, Interspeech 2010) ; https://hal.archives-ouvertes.fr/hal-01433899 ; International Conference on Spoken Language Processing (ISCA, Interspeech 2010), 2010, Japon (Makuhari), Unknown Region (2010)
BASE
Show details
12
Iterative filtering of phonetic transcriptions of proper nouns
In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009) ; https://hal.archives-ouvertes.fr/hal-01433945 ; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009), 2009, Taipei, Taiwan. pp.4265--4268 (2009)
BASE
Show details
13
Grapheme to phoneme conversion using an SMT system
In: INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION. ANNUAL CONFERENCE. 10TH 2009. (INTERSPEECH 2009) ; 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009) ; https://hal.archives-ouvertes.fr/hal-01451534 ; 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009) , Sep 2009, Brighton, United Kingdom. pp.716-719 (2009)
BASE
Show details
14
Combinaison de systèmes pour la phonétisation automatique de noms propres
In: XXVIIe Journées d'étude sur la parole (JEP 2008) ; https://hal.archives-ouvertes.fr/hal-01450912 ; XXVIIe Journées d'étude sur la parole (JEP 2008), Jun 2008, Avignon, France. pp.4 (2008)
BASE
Show details
15
Combined systems for automatic phonetic transcription of proper nouns
In: LREC 2008 Proceedings ; 6th Language Evaluation and Resources Conference (LREC 2008) ; https://hal.archives-ouvertes.fr/hal-01433960 ; 6th Language Evaluation and Resources Conference (LREC 2008), May 2008, Marrakech, Morocco. pp.1791-1795 ; http://www.lrec-conf.org/lrec2008/ (2008)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern