2 |
On Investigation of Unsupervised Speech Factorization Based on Normalization Flow ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Phonetic-attention scoring for deep speaker features in speaker verification ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Phonetic Temporal Neural Model for Language Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Multi-task Recurrent Model for True Multilingual Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
System Combination for Short Utterance Speaker Recognition ...
|
|
|
|
Abstract:
For text-independent short-utterance speaker recognition (SUSR), the performance often degrades dramatically. This paper presents a combination approach to the SUSR tasks with two phonetic-aware systems: one is the DNN-based i-vector system and the other is our recently proposed subregion-based GMM-UBM system. The former employs phone posteriors to construct an i-vector model in which the shared statistics offers stronger robustness against limited test data, while the latter establishes a phone-dependent GMM-UBM system which represents speaker characteristics with more details. A score-level fusion is implemented to integrate the respective advantages from the two systems. Experimental results show that for the text-independent SUSR task, both the DNN-based i-vector system and the subregion-based GMM-UBM system outperform their respective baselines, and the score-level system combination delivers performance improvement. ... : APSIPA ASC 2016 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Neural and Evolutionary Computing cs.NE
|
|
URL: https://arxiv.org/abs/1603.09460 https://dx.doi.org/10.48550/arxiv.1603.09460
|
|
BASE
|
|
Hide details
|
|
|
|