1 |
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
|
|
|
|
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
Abstract:
International audience ; Much information in multimedia data related to terrorist activity can be extracted from the audio content. Our work in ongoing projects aims to provide a complete description of the audio portion of multimedia documents. The information that can be extracted can be derived from diarization, classification of acoustic events, language and speaker segmentation and clustering, as well as automatic transcription of the speech portions. An important consideration is ensuring that the audio processing technologies are well suited to the types of data of interest to the law enforcement agencies. While language identification and speech recognition may be considered as ’mature technologies’, our experience is that even state-of-the-art systems require customisation and enhancements to address the challenges of terrorist-related audio documents.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Automatic speech recognition Acoustic event detection Language identification Code switching
|
|
URL: https://hal.archives-ouvertes.fr/hal-02387373
|
|
BASE
|
|
Hide details
|
|
4 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
6 |
Lexical speaker identification in TV shows
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
|
|
BASE
|
|
Show details
|
|
7 |
Traduction de la parole dans le projet RAPMAT
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
|
|
BASE
|
|
Show details
|
|
8 |
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
|
|
BASE
|
|
Show details
|
|
9 |
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
|
|
|
|
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
|
|
BASE
|
|
Show details
|
|
10 |
Lattice MLLR based m-vector system for speaker verification
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
|
|
BASE
|
|
Show details
|
|
11 |
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
|
|
|
|
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
|
|
BASE
|
|
Show details
|
|
12 |
Recherche par le contenu dans des documents audiovisuels multilingues
|
|
|
|
In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
|
|
BASE
|
|
Show details
|
|
13 |
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
|
|
BASE
|
|
Show details
|
|
14 |
Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
|
|
|
|
In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
|
|
BASE
|
|
Show details
|
|
15 |
Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
|
|
|
|
In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
|
|
BASE
|
|
Show details
|
|
16 |
Recherche par le contenu dans des documents audiovisuels multilingues
|
|
|
|
In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
|
|
BASE
|
|
Show details
|
|
17 |
Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
|
|
|
|
In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
|
|
|
|
In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
|
|
BASE
|
|
Show details
|
|
19 |
Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Automatic Speech Recognition for Under-Ressourced Languages ; Reconnaissance automatique de la parole pour des langues peu dotées
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-00081061 ; Interface homme-machine [cs.HC]. Université Joseph-Fourier - Grenoble I, 2006. Français (2006)
|
|
BASE
|
|
Show details
|
|
|
|