Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2020 (1)
  - 2018 (1)
  - 2017 (1)
  - 2016 (2)
  - 2014 (2)
  - 2013 (1)
- Medium:
  - Online (8)
- Type
- BLLDB-Access:
  - free (8)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 8 of 8

1	"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II) ...
	Gorin, Arseniy; Kulko, Daniil; Grima, Steven. - : arXiv, 2020
	BASE
	Show details

2	Conversational telephone speech recognition for Lithuanian
	Lileikyté, Rasa; Lamel, Lori; Gauvain, Jean-Luc...
	In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01837147 ; Computer Speech and Language, Elsevier, 2018, 49, pp.71-82 (2018)
	BASE
	Show details

3	An investigation into language model data augmentation for low-resourced STT and KWS
	Huang, Guangpu; Fraga Da Silva, Thiago; Lamel, Lori...
	In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01837171 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Mar 2017, New Orleans, United States (2017)
	BASE
	Show details

4	Language Model Data Augmentation for Keyword Spotting
	Gorin, Arseniy; Lileikyté, Rasa; Huang, Guangpu...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
	BASE
	Show details

5	Machine Translation Based Data Augmentation for Cantonese Keyword Spotting (Author's Manuscript)
	Huang, Guangpu; Gorin,Arseniy; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

6	Explicit trajectories and speaker class modeling for child and adult speech recognition ; Modélisation de trajectoires et de classes de locuteurs pour la reconnaissance de voix d'enfants et d'adultes
	Gorin, Arseniy; Jouvet, Denis
	In: XXXème édition des Journées d'Etudes sur la Parole ; https://hal.inria.fr/hal-01080343 ; XXXème édition des Journées d'Etudes sur la Parole, Jun 2014, Le Mans, France (2014)
	BASE
	Show details

7	Component Structuring and Trajectory Modeling for Speech Recognition
	Gorin, Arseniy; Jouvet, Denis
	In: Interspeech ; https://hal.inria.fr/hal-01063653 ; Interspeech, Sep 2014, Singapoore, Singapore (2014)
	BASE
	Show details

8	Efficient constrained parametrization of GMM with class-based mixture weights for Automatic Speech Recognition
	Gorin, Arseniy; Jouvet, Denis
	In: LTC'13 - 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics ; https://hal.inria.fr/hal-00923202 ; LTC'13 - 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Dec 2013, Poznań, Poland (2013)
	Abstract: International audience ; Acoustic modeling techniques, based on clustering of the training data, have become essential in large vocabulary continuous speech recognition (LVCSR) systems. Clustered data (supervised or unsupervised) is typically used to estimate the sets of parameters by adapting the speaker-independent model on each subset. For Hidden Markov Models with Gaussian mixture observation densities (HMM-GMM) most of the adaptation techniques are focusing on re-estimation of the mean vectors, whereas the mixture weights are typically distributed almost uniformly. In this work we propose a way of specifying the subspaces of the GMM by associating the sets of Gaussian mixture weights with the speaker classes and sharing the Gaussian parameters across speaker classes. The method allows us to better parametrize GMM without increasing significantly the number of model parameters. Our experiments on French radio broadcast data demonstrate the improvement of the accuracy with such parametrization compared to the models with similar, or even larger number of parameters.
	Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
	URL: https://hal.inria.fr/hal-00923202
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern