Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
	Poignant, Johann; Bredin, Hervé; Barras, Claude
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
	Abstract: This is a post-peer-review, pre-copyedit version of an article published in Multimedia Tools and Applications. The final authenticated version is available online at: http://dx.doi.org/10.1007/s11042-017-4730-x ; International audience ; We describe the " Multimodal Person Discovery in Broadcast TV " task of MediaEval 2015 benchmarking initiative. Participants were asked to return the names of people who can be both seen as well as heard in every shot of a collection of videos. The list of people was not known a priori and their names had to be discovered in an unsupervised way from media content using text overlay or speech transcripts. The task was evaluated using information retrieval metrics, based on a posteriori collaborative annotation of the test corpus. The first edition of the task gathered 9 teams which submitted 34 runs. This paper provides quantitative and qualitative comparisons of participants submissions. We also investigate why all systems failed for particular shots, paving the way for future promising research directions.
	Keyword: [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO]Computer Science [cs]; benchmark; error analysis; information retrieval; multimodal fusion; unsupervised person recognition
	URL: https://hal.archives-ouvertes.fr/hal-01690581 https://doi.org/10.1007/s11042-017-4730-x https://hal.archives-ouvertes.fr/hal-01690581/file/Poignant2017.pdf https://hal.archives-ouvertes.fr/hal-01690581/document
	BASE
	Hide details

2	Benchmarking Multimedia Technologies with the CAMOMILE Platform: the Case of Multimodal Person Discovery at MediaEval 2015
	Poignant, Johann; Bredin, Hervé; Barras, Claude...
	In: LREC 2016 ; https://hal.archives-ouvertes.fr/hal-01690277 ; LREC 2016, May 2016, Portorož, Slovenia (2016)
	BASE
	Show details

3	The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
	Poignant, Johann; Budnik, Mateusz; Bredin, Hervé...
	In: Proceedings of LREC 2016 ; LREC 2016 Conference ; https://hal.archives-ouvertes.fr/hal-01350096 ; LREC 2016 Conference, May 2016, Portoroz, Slovenia (2016)
	BASE
	Show details

4	What Makes a Speaker Recognizable in TV Broadcast? Going Beyond Speaker Identification Error Rate
	Charlet, Delphine; Poignant, Johann; Bredin, Hervé...
	In: Interspeech 2015 ; ERRARE Workshop, a satellite event of Interspeech 2015. ; https://hal.archives-ouvertes.fr/hal-01433205 ; ERRARE Workshop, a satellite event of Interspeech 2015., 2015, Sinaia, Romania (2015)
	BASE
	Show details

5	Unsupervised Speaker Identification in TV Broadcast Based on Written Names
	Poignant, Johann; Besacier, Laurent; Quénot, Georges
	In: ISSN: 1558-7916 ; IEEE Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01060827 ; IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (1), pp.57-68. ⟨10.1109/TASLP.2014.2367822⟩ ; https://dl.acm.org/authorize?N46627 (2015)
	BASE
	Show details

6	Collaborative Annotation for Person Identification in TV Shows
	Budnik, Matheuz; Besacier, Laurent; Poignant, Johann...
	In: Interspeech 2015 (short demo paper) ; https://hal.archives-ouvertes.fr/hal-01170513 ; Interspeech 2015 (short demo paper), Sep 2015, Dresden, Germany (2015)
	BASE
	Show details

7	Integer Linear Programming for Speaker Diarization and Cross-Modal Identification in TV Broadcast
	Bredin, Hervé; Poignant, Johann
	In: the 14rd Annual Conference of the International Speech Communication Association, INTERSPEECH ; https://hal.inria.fr/hal-00953095 ; the 14rd Annual Conference of the International Speech Communication Association, INTERSPEECH, 2013, Lyon, France (2013)
	BASE
	Show details

8	Towards a better integration of written names for unsupervised speakers identification in videos
	Poignant, Johann; Bredin, Hervé; Besacier, Laurent...
	In: First Workshop on Speech, Language and Audio in Multimedia, SLAM ; https://hal.inria.fr/hal-00953089 ; First Workshop on Speech, Language and Audio in Multimedia, SLAM, 2013, Marseille, France (2013)
	BASE
	Show details

9	Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
	Poignant, Johann; Bredin, Hervé; Le, Viet-Bac...
	In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern