1 |
End-to-end speaker segmentation for overlap-aware resegmentation
|
|
|
|
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
The Domain Mismatch Problem in the Broadcast Speaker Attribution Task
|
|
|
|
In: ISSN: 2076-3417 ; Applied Sciences ; https://hal.archives-ouvertes.fr/hal-03448852 ; Applied Sciences, MDPI, 2021, 11 (18), pp.8521. ⟨10.3390/app11188521⟩ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
The Domain Mismatch Problem in the Broadcast Speaker Attribution Task
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 18 (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization
|
|
|
|
In: Mathematics ; Volume 9 ; Issue 23 (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Speaker Diarization Using Improved SincNet Models to Extract Speaker Embeddings
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Evaluating the Performance of Using Speaker Diarization for Speech Separation of In-Person Role-Play Dialogues
|
|
|
|
In: Browse all Theses and Dissertations (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Does bilingual input hurt? A simulation of language discrimination and clustering using i-vectors
|
|
|
|
In: CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society ; https://hal.archives-ouvertes.fr/hal-02959451 ; CogSci 2020 - 42nd Annual Virtual Meeting of the Cognitive Science Society, Jul 2020, Toronto / Virtual, Canada (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Speaker detection in the wild: Lessons learned from JSALT 2019
|
|
|
|
In: Odyssey 2020 The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-02417632 ; Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan (2020)
|
|
BASE
|
|
Show details
|
|
11 |
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
|
|
|
|
In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Multimodal Speaker Diarization Using a Pre-Trained Audio-Visual Synchronization Model
|
|
|
|
In: Sensors ; Volume 19 ; Issue 23 (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media
|
|
|
|
In: Applied Sciences ; Volume 9 ; Issue 24 (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Pyannote.metrics: a toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01836450 ; Annual Conference of the International Speech Communication Association , Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
15 |
Improving Speech and Speaker Recognition For Multi-Speaker Conversations
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Iterative PLDA Adaptation for Speaker Diarization
|
|
|
|
In: Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01433172 ; Interspeech 2016, Sep 2016, San Francisco, United States. pp.2175 - 2179, ⟨10.21437/Interspeech.2016-572⟩ (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Speech segmentation and speaker diarisation for transcription and translation
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Unsupervised Speaker Identification in TV Broadcast Based on Written Names
|
|
|
|
In: ISSN: 1558-7916 ; IEEE Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01060827 ; IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (1), pp.57-68. ⟨10.1109/TASLP.2014.2367822⟩ ; https://dl.acm.org/authorize?N46627 (2015)
|
|
BASE
|
|
Show details
|
|
19 |
Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment
|
|
|
|
In: DTIC (2015)
|
|
BASE
|
|
Show details
|
|
20 |
An audiovisual attention model for natural conversation scenes
|
|
|
|
In: Proceedings of the IEEE ICIP 2014 ; ICIP 2014 - 21st IEEE International Conference on Image Processing ; https://hal.archives-ouvertes.fr/hal-01009467 ; ICIP 2014 - 21st IEEE International Conference on Image Processing, Oct 2014, Paris, France. pp.1-5 (2014)
|
|
BASE
|
|
Show details
|
|
|
|