1 |
End-to-end speaker segmentation for overlap-aware resegmentation
|
|
|
|
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
An open-source voice type classifier for child-centered daylong recordings
|
|
|
|
In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02989487 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shanghai / Virtual, China (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Speaker detection in the wild: Lessons learned from JSALT 2019
|
|
|
|
In: Odyssey 2020 The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-02417632 ; Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan (2020)
|
|
BASE
|
|
Show details
|
|
4 |
End-to-end Domain-Adversarial Voice Activity Detection
|
|
|
|
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02989502 ; Interspeech 2020, Nov 2020, Shanghai, France (2020)
|
|
BASE
|
|
Show details
|
|
5 |
A Metric Learning Approach to Misogyny Categorization
|
|
|
|
In: Proceedings of the 5th Workshop on Representation Learning for NLP ; Workshop on Representation Learning for NLP ; https://hal.archives-ouvertes.fr/hal-02989293 ; Workshop on Representation Learning for NLP, Jul 2020, Online, France. pp.89-94, ⟨10.18653/v1/2020.repl4nlp-1.12⟩ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Low-latency speaker spotting with online diarization and detection
|
|
|
|
In: The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-01836490 ; The Speaker and Language Recognition Workshop, ISCA, Jun 2018, Les Sables d'Olonne, France (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Pyannote.metrics: a toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01836450 ; Annual Conference of the International Speech Communication Association , Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
8 |
Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization
|
|
|
|
In: Interspeech 2017, 18th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01690162 ; Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Aug 2017, Stockholm, Sweden. ⟨10.21437/Interspeech.2017-1067⟩ (2017)
|
|
BASE
|
|
Show details
|
|
9 |
Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
|
|
BASE
|
|
Show details
|
|
10 |
Benchmarking Multimedia Technologies with the CAMOMILE Platform: the Case of Multimodal Person Discovery at MediaEval 2015
|
|
|
|
In: LREC 2016 ; https://hal.archives-ouvertes.fr/hal-01690277 ; LREC 2016, May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
11 |
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
|
|
|
|
In: Proceedings of LREC 2016 ; LREC 2016 Conference ; https://hal.archives-ouvertes.fr/hal-01350096 ; LREC 2016 Conference, May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
12 |
IRIM at TRECVID 2016: Instance Search
|
|
|
|
In: TRECVid workshop 2016 ; http://hal.univ-smb.fr/hal-01416953 ; TRECVid workshop 2016, National Institute of Standards and Technology (NIST), Nov 2016, Gaithersburg, Maryland, United States ; http://www-nlpir.nist.gov/projects/trecvid/ (2016)
|
|
BASE
|
|
Show details
|
|
13 |
What Makes a Speaker Recognizable in TV Broadcast? Going Beyond Speaker Identification Error Rate
|
|
|
|
In: Interspeech 2015 ; ERRARE Workshop, a satellite event of Interspeech 2015. ; https://hal.archives-ouvertes.fr/hal-01433205 ; ERRARE Workshop, a satellite event of Interspeech 2015., 2015, Sinaia, Romania (2015)
|
|
BASE
|
|
Show details
|
|
14 |
Lexical speaker identification in TV shows
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
|
|
BASE
|
|
Show details
|
|
15 |
Collaborative Annotation for Person Identification in TV Shows
|
|
|
|
In: Interspeech 2015 (short demo paper) ; https://hal.archives-ouvertes.fr/hal-01170513 ; Interspeech 2015 (short demo paper), Sep 2015, Dresden, Germany (2015)
|
|
BASE
|
|
Show details
|
|
16 |
TVD: a reproducible and multiply aligned TV series dataset
|
|
|
|
In: LREC 2014 ; https://hal.archives-ouvertes.fr/hal-01690279 ; LREC 2014, May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
17 |
"Sheldon speaking, bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification
|
|
|
|
In: ACM MM 2014, 22nd ACM International Conference on Multimedia ; https://hal.archives-ouvertes.fr/hal-01987812 ; ACM MM 2014, 22nd ACM International Conference on Multimedia, 2014, Orlando, United States (2014)
|
|
BASE
|
|
Show details
|
|
18 |
Integer Linear Programming for Speaker Diarization and Cross-Modal Identification in TV Broadcast
|
|
|
|
In: the 14rd Annual Conference of the International Speech Communication Association, INTERSPEECH ; https://hal.inria.fr/hal-00953095 ; the 14rd Annual Conference of the International Speech Communication Association, INTERSPEECH, 2013, Lyon, France (2013)
|
|
BASE
|
|
Show details
|
|
19 |
Towards a better integration of written names for unsupervised speakers identification in videos
|
|
|
|
In: First Workshop on Speech, Language and Audio in Multimedia, SLAM ; https://hal.inria.fr/hal-00953089 ; First Workshop on Speech, Language and Audio in Multimedia, SLAM, 2013, Marseille, France (2013)
|
|
BASE
|
|
Show details
|
|
20 |
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
|
|
|
|
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
|
|
BASE
|
|
Show details
|
|
|
|