1 |
Language-Conditioned Imitation Learning for Robot Manipulation Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
The Android and our Cyborg Selves: What Androids Will Teach us about Being (Post)Human
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Characterizing Deletion Transformations across Dialects using a Sophisticated Tying Mechanism
|
|
|
|
In: DTIC (2011)
|
|
BASE
|
|
Show details
|
|
9 |
USSS-MITLL 2010 Human Assisted Speaker Recognition
|
|
|
|
In: DTIC (2010)
|
|
BASE
|
|
Show details
|
|
11 |
Construction of a Phonotactic Dialect Corpus using Semiautomatic Annotation
|
|
|
|
In: DTIC (2007)
|
|
BASE
|
|
Show details
|
|
14 |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
15 |
Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
16 |
The MMSR Bilingual and Crosschannel Corpora for Speaker Recognition Research and Evaluation
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
17 |
The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
18 |
Combining Cross-Stream And Time Dimensions In Phonetic Speaker Recognition
|
|
|
|
In: DTIC (2003)
|
|
Abstract:
Recent studies show that phonetic sequences from multiple languages can provide effective features for speaker recognition. So far, only pronunciation dynamics in the time dimension, i.e., n-gram modeling on each of the phone sequences, have been examined. In the JHU 2002 Summer Workshop, we explored modeling the statistical pronunciation dynamics across streams in multiple languages (cross-stream dimension) as an additional component to the time dimension. We found that bigram modeling in the cross-stream dimension achieves improved performance over that in the time dimension on the NIST 2001 Speaker Recognition Evaluation Extended Data Task. Moreover, a linear combination of information from both dimensions at the score level further improves the performance, showing that the two dimensions contain complementary information. ; Presented at IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2003), held in Hong Kong, China in April 2003. The original document contains color images.
|
|
Keyword:
*PHONETIC SPEAKER RECOGNITION; *PHONETICS; *SPEECH RECOGNITION; ACOUSTICS; CHINA; Linguistics; PHONETIC SEQUENCES; PRONUNCIATION DYNAMICS; SPEAKER DETECTION; SPM(SPEAKER PHONETIC MODEL); SYMPOSIA; UBPM(UNIVERSAL-BACKGROUND PHONETIC MODEL); Voice Communications; WORDS(LANGUAGE)
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA523753 http://www.dtic.mil/docs/citations/ADA523753
|
|
BASE
|
|
Hide details
|
|
|
|