Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 8 of 8

1	Deep generative factorization for speech signal ...
	Sun, Haoran; Li, Lantian; Cai, Yunqi. - : arXiv, 2020
	BASE
	Show details

2	On Investigation of Unsupervised Speech Factorization Based on Normalization Flow ...
	Sun, Haoran; Cai, Yunqi; Li, Lantian. - : arXiv, 2019
	BASE
	Show details

3	Phonetic-attention scoring for deep speaker features in speaker verification ...
	Li, Lantian; Tang, Zhiyuan; Shi, Ying. - : arXiv, 2018
	BASE
	Show details

4	Phonetic Temporal Neural Model for Language Identification ...
	Tang, Zhiyuan; Wang, Dong; Chen, Yixiang. - : arXiv, 2017
	BASE
	Show details

5	Phone-aware Neural Language Identification ...
	Tang, Zhiyuan; Wang, Dong; Chen, Yixiang. - : arXiv, 2017
	BASE
	Show details

6	AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline ...
	Wang, Dong; Li, Lantian; Tang, Difei. - : arXiv, 2016
	BASE
	Show details

7	Multi-task Recurrent Model for True Multilingual Speech Recognition ...
	Tang, Zhiyuan; Li, Lantian; Wang, Dong. - : arXiv, 2016
	Abstract: Research on multilingual speech recognition remains attractive yet challenging. Recent studies focus on learning shared structures under the multi-task paradigm, in particular a feature sharing structure. This approach has been found effective to improve performance on each individual language. However, this approach is only useful when the deployed system supports just one language. In a true multilingual scenario where multiple languages are allowed, performance will be significantly reduced due to the competition among languages in the decoding space. This paper presents a multi-task recurrent model that involves a multilingual speech recognition (ASR) component and a language recognition (LR) component, and the ASR component is informed of the language information by the LR component, leading to a language-aware recognition. We tested the approach on an English-Chinese bilingual recognition task. The results show that the proposed multi-task recurrent model can improve performance of multilingual ... : APSIPA 2016. arXiv admin note: text overlap with arXiv:1603.09643 ...
	Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG; Neural and Evolutionary Computing cs.NE
	URL: https://arxiv.org/abs/1609.08337 https://dx.doi.org/10.48550/arxiv.1609.08337
	BASE
	Hide details

8	System Combination for Short Utterance Speaker Recognition ...
	Li, Lantian; Wang, Dong; Zhang, Xiaodong. - : arXiv, 2016
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern