Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 23

1	Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
	Barras, Claude; Le, Viet-Bac; Gauvain, Jean-Luc
	In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
	BASE
	Show details

2	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

3	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

4	Language Recognition for Dialects and Closely Related Languages
	Gelly, Grégory; Gauvain, Jean-Luc; Lamel, Lori...
	In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
	BASE
	Show details

5	Improving Data Selection for Low Resource STT and KWS
	Fraga-Silva, Thiago; Laurent,Antoine; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

6	Lexical speaker identification in TV shows
	Roy, Anindya; Bredin, Hervé; Hartmann, William...
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
	BASE
	Show details

7	Traduction de la parole dans le projet RAPMAT
	Maynard, Hélène; Segal, Natalia; Bilinski, Eric...
	In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
	BASE
	Show details

8	Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
	Hartmann, William; Le, Viet Bac; Messaoudi, Abdelkhalek...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
	BASE
	Show details

9	Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
	Sarkar, Achintya; Do, Cong-Thanh; Le, Viet-Bac...
	In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
	BASE
	Show details

10	Lattice MLLR based m-vector system for speaker verification
	Sarkar, Achintya Kumar; Barras, Claude; Le, Viet Bac
	In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
	BASE
	Show details

11	Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
	Poignant, Johann; Bredin, Hervé; Le, Viet-Bac; Besacier, Laurent; Barras, Claude; Quénot, Georges
	In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
	Abstract: Poster Session: Speaker Recognition III ; International audience ; We propose an approach for unsupervised speaker identification in TV broadcast videos, by combining acoustic speaker diarization with person names obtained via video OCR from overlaid texts. Three methods for the propagation of the overlaid names to the speech turns are compared, taking into account the co-occurence duration between the speaker clusters and the names provided by the video OCR and using a task-adapted variant of the TF-IDF information retrieval coefficient. These methods were tested on the REPERE dry-run evaluation corpus, containing 3 hours of annotated videos. Our best unsupervised system reaches a F-measure of 70.2% when considering all the speakers, and 81.7% if anchor speakers are left out. By comparison, a mono-modal, supervised speaker identification system with 535 speaker models trained on matching development data and additional TV and radio data only provided a 57.5% F-measure when considering all the speakers and 45.7% without anchor.
	Keyword: [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; multimodal fusion; optical character recognition; reproducible results; speaker diarization; unsupervised speaker identification
	URL: https://hal.archives-ouvertes.fr/hal-00767427 https://hal.archives-ouvertes.fr/hal-00767427/document https://hal.archives-ouvertes.fr/hal-00767427/file/Poignant-al_Interspeech2012.pdf
	BASE
	Hide details

12	Recherche par le contenu dans des documents audiovisuels multilingues
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
	BASE
	Show details

13	Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
	BASE
	Show details

14	Automatic speech recognition for under-resourced languages: application to Vietnamese language
	Besacier, Laurent; Le, Viet-Bac
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 8, 1471-1482
	BLLDB
	OLC Linguistik
	Show details

15	Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
	Do, Thi Ngoc Diep; Le, Viet-Bac; Bigi, Brigitte...
	In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
	BASE
	Show details

16	Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
	Do, Thi-Ngoc-Diep; Le, Viet-Bac; Bigi, Brigitte...
	In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
	BASE
	Show details

17	Recherche par le contenu dans des documents audiovisuels multilingues
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
	BASE
	Show details

18	Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
	BASE
	Show details

19	Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
	Bigi, Brigitte; Le, Viet-Bac
	In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
	BASE
	Show details

20	Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
	Le, Viet-Bac; Besacier, Laurent; Schultz, Tanja. - 2008
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern