Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2021 (1)
  - 2020 (1)
  - 2019 (3)
  - 2018 (1)
  - 2017 (2)
  - 2016 (6)
  - 2015 (1)
  - 2014 (6)
  - 2013 (3)
  - 2010 (1)
  - more
- Medium
- Type
- BLLDB-Access:
  - free (48)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 48

1	Modeling the effect of military oxygen masks on speech characteristics
	Elie, Benjamin; Gauvain, Jodie; Gauvain, Jean-Luc...
	In: Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03325087 ; Interspeech 2021, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

2	Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
	Barras, Claude; Le, Viet-Bac; Gauvain, Jean-Luc
	In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
	BASE
	Show details

3	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

4	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

5	Collective memory shapes the organization of individual memories in the medial prefrontal cortex
	Gagnepain, Pierre; Vallée, Thomas; Heiden, Serge...
	In: EISSN: 2397-3374 ; Nature Human Behaviour ; https://halshs.archives-ouvertes.fr/halshs-02416130 ; Nature Human Behaviour, Nature Research 2019, ⟨10.1038/s41562-019-0779-z⟩ (2019)
	BASE
	Show details

6	Conversational telephone speech recognition for Lithuanian
	Lileikyté, Rasa; Lamel, Lori; Gauvain, Jean-Luc...
	In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01837147 ; Computer Speech and Language, Elsevier, 2018, 49, pp.71-82 (2018)
	BASE
	Show details

7	Effective keyword search for low-resourced conversational speech
	Lileikyte, Rasa; Fraga-Silva, Thiago; Lamel, Lori...
	In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
	BASE
	Show details

8	An investigation into language model data augmentation for low-resourced STT and KWS
	Huang, Guangpu; Fraga Da Silva, Thiago; Lamel, Lori...
	In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01837171 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Mar 2017, New Orleans, United States (2017)
	BASE
	Show details

9	Language Recognition for Dialects and Closely Related Languages
	Gelly, Grégory; Gauvain, Jean-Luc; Lamel, Lori...
	In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
	BASE
	Show details

10	Language Model Data Augmentation for Keyword Spotting
	Gorin, Arseniy; Lileikyté, Rasa; Huang, Guangpu...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
	BASE
	Show details

11	Investigating techniques for low resource conversational speech recognition
	Laurent, Antoine; Fraga-Silva, Thiago; Lamel, Lori...
	In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016) ; https://hal-univ-lemans.archives-ouvertes.fr/hal-01515254 ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shangai, China. pp.5975-5979, ⟨10.1109/ICASSP.2016.7472824⟩ ; www.icassp2016.org (2016)
	BASE
	Show details

12	Improving Data Selection for Low Resource STT and KWS
	Fraga-Silva, Thiago; Laurent,Antoine; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

13	Machine Translation Based Data Augmentation for Cantonese Keyword Spotting (Author's Manuscript)
	Huang, Guangpu; Gorin,Arseniy; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

14	Investigating Techniques for Low Resource Conversational Speech Recognition
	Laurent, Antoine; Fraga-Silva,Thiago; Lamel,Lori. - 2016
	BASE
	Show details

15	Lexical speaker identification in TV shows
	Roy, Anindya; Bredin, Hervé; Hartmann, William; Le, Viet Bac; Barras, Claude; Gauvain, Jean-Luc
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
	Abstract: The final publication is available at https://link.springer.com/article/10.1007/s11042-014-1940-3 ; International audience ; It is possible to use lexical information extracted from speech transcripts for speaker identification (SID), either on its own or to improve the performance of standard cepstral-based SID systems upon fusion. This was established before typically using isolated speech from single speakers (NIST SRE corpora, parliamentary speeches). On the contrary, this work applies lexical approaches for SID on a different type of data. It uses the REPERE corpus consisting of unsegmented multiparty conversations, mostly debates, discussions and Q&A sessions from TV shows. It is hypothesized that people give out clues to their identity when speaking in such settings which this work aims to exploit. The impact on SID performance of the diarization front-end required to pre-process the unsegmented data is also measured. Four lexical SID approaches are studied in this work, including TFIDF, BM25 and LDA-based topic modeling. Results are analysed in terms of TV shows and speaker roles. Lexical approaches achieve low error rates for certain speaker roles such as anchors and journalists, sometimes lower than a standard cepstral-based Gaussian Supervector-Support Vector Machine (GSV-SVM) system. Also, in certain cases, the lexical system shows modest improvement over the cepstral-based system performance using score-level sum fusion. To highlight the potential of using lexical information not just to improve upon cepstral-based SID systems but as an independent approach in its own right, initial studies on crossmedia SID is briefly reported. Instead of using 2 Anindya Roy et al. speech data as all cepstral systems require, this approach uses Wikipedia texts to train lexical speaker models which are then tested on speech transcripts to identify speakers.
	Keyword: [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO]Computer Science [cs]
	URL: https://hal.archives-ouvertes.fr/hal-01690342/file/paper_v0.pdf https://hal.archives-ouvertes.fr/hal-01690342/document https://doi.org/10.1007/s11042-014-1940-3 https://hal.archives-ouvertes.fr/hal-01690342
	BASE
	Hide details

16	Traduction de la parole dans le projet RAPMAT
	Maynard, Hélène; Segal, Natalia; Bilinski, Eric...
	In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
	BASE
	Show details

17	Speech-to-Text Development for Slovak, a Low-Resourced Language
	Do, Cong-Thanh; Lamel, Lori; Gauvain, Jean-Luc
	In: International Workshop on Spoken Languages Technologies for Under-resourced languages ; https://hal.archives-ouvertes.fr/hal-01843417 ; International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St. Petersburg, Russia (2014)
	BASE
	Show details

18	Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
	Hartmann, William; Le, Viet Bac; Messaoudi, Abdelkhalek...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
	BASE
	Show details

19	Efficient Rule Scoring for Improved Grapheme-Based Lexicons
	Hartmann, William; Lamel, Lori; Gauvain, Jean-Luc
	In: European Signal Processing Conference ; https://hal.archives-ouvertes.fr/hal-01843411 ; European Signal Processing Conference, Jan 2014, Lisbon, Portugal (2014)
	BASE
	Show details

20	Cross-Word Sub-Word Units for Low-Resource Keyword Spotting
	Hartmann, William; Lamel, Lori; Gauvain, Jean-Luc
	In: International Workshop on Spoken Languages Technologies for Under-resourced languages ; https://hal.archives-ouvertes.fr/hal-01843415 ; International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St. Petersburg, Russia (2014)
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern