1 |
A thorough evaluation of the Language Environment Analysis (LENA) system
|
|
|
|
In: Behav Res Methods (2021)
|
|
BASE
|
|
Show details
|
|
3 |
An open-source voice type classifier for child-centered daylong recordings
|
|
|
|
In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02989487 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shanghai / Virtual, China (2020)
|
|
BASE
|
|
Show details
|
|
4 |
A thorough evaluation of the Language Environment Analysis (LENATM) system
|
|
|
|
In: ISSN: 1554-351X ; EISSN: 1554-3528 ; Behavior Research Methods ; https://hal.archives-ouvertes.fr/hal-02989519 ; Behavior Research Methods, Psychonomic Society, Inc, 2020, ⟨10.31219/osf.io/mxr8s⟩ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
A thorough evaluation of the Language Environment Analysis (LENA) system
|
|
|
|
In: ISSN: 1554-351X ; EISSN: 1554-3528 ; Behavior Research Methods ; https://hal.archives-ouvertes.fr/hal-03095997 ; Behavior Research Methods, Psychonomic Society, Inc, 2020, 53 (2), pp.467-486. ⟨10.3758/s13428-020-01393-5⟩ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Speaker detection in the wild: Lessons learned from JSALT 2019
|
|
García, Paola; Villalba, Jesus; Bredin, Hervé; Du, Jun; Castan, Diego; Cristia, Alejandrina; Bullock, Latane; Guo, Ling; Okabe, Koji; Nidadavolu, Phani Sankar; Kataria, Saurabh; Chen, Sizhu; Galmant, Léo; Lavechin, Marvin; Sun, Lei; Gill, Marie-Philippe; Ben-Yair, Bar; Abdoli, Sajjad; Wang, Xin; Bouaziz, Wassim; Titeux, Hadrien; Dupoux, Emmanuel; Lee, Kong Aik; Dehak, Najim
|
|
In: Odyssey 2020 The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-02417632 ; Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan (2020)
|
|
Abstract:
International audience ; This paper presents the problems and solutions addressed at the JSALT workshop when using a single microphone for speaker detection in adverse scenarios. The main focus was to tackle a wide range of conditions that go from meetings to wild speech. We describe the research threads we explored and a set of modules that was successful for these scenarios. The ultimate goal was to explore speaker detection; but our first finding was that an effective diarization improves detection, and not having a diarization stage impoverishes the performance. All the different configurations of our research agree on this fact and follow a main backbone that includes diarization as a previous stage. With this backbone, we analyzed the following problems: voice activity detection, how to deal with noisy signals, domain mismatch, how to improve the clustering; and the overall impact of previous stages in the final speaker detection. In this paper, we show partial results for speaker diarizarion to have a better understanding of the problem and we present the final results for speaker detection.
|
|
Keyword:
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; Index Terms-speaker detection; Resegmentation; Speaker detection; Speaker diarization; Speech enhancement; voice ac- tivity detection; Voice activity detection
|
|
URL: https://hal.archives-ouvertes.fr/hal-02417632/document https://hal.archives-ouvertes.fr/hal-02417632/file/1912.00938.pdf https://hal.archives-ouvertes.fr/hal-02417632
|
|
BASE
|
|
Hide details
|
|
7 |
Longform recordings : Opportunities and challenges ; Enregistrements de longue durée: Opportunités et défis
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; LIFT 2020 - 2èmes journées scientifiques du Groupement de Recherche "Linguistique informatique, formelle et de terrain" ; https://hal.archives-ouvertes.fr/hal-03047153 ; LIFT 2020 - 2èmes journées scientifiques du Groupement de Recherche "Linguistique informatique, formelle et de terrain", Dec 2020, Montrouge / Virtual, France. pp.64-71 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
End-to-end Domain-Adversarial Voice Activity Detection
|
|
|
|
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02989502 ; Interspeech 2020, Nov 2020, Shanghai, France (2020)
|
|
BASE
|
|
Show details
|
|
9 |
ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings
|
|
|
|
In: Behav Res Methods (2020)
|
|
BASE
|
|
Show details
|
|
|
|