Page: 1 2 3 4 5 6 7 8 9... 43
81 |
Adaptation techniques for speech synthesis in under-resoured languages
|
|
|
|
In: http://www.cs.cmu.edu/~gopalakr/publications/sltu2010_anumanchipalli.pdf (2010)
|
|
BASE
|
|
Show details
|
|
82 |
ENGLISH SPOKEN TERM DETECTION IN MULTILINGUAL RECORDINGS
|
|
|
|
In: http://publications.idiap.ch/downloads/reports/2010/Motlicek_Idiap-RR-21-2010.pdf (2010)
|
|
BASE
|
|
Show details
|
|
83 |
VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop Speed Up Strategies for the Creation of Multimodal and Multilingual Dialogue Systems
|
|
|
|
In: http://lorien.die.upm.es/%7Elfdharo/Papers/PhDThesis_Fala2010.pdf (2010)
|
|
BASE
|
|
Show details
|
|
84 |
Using n-best recognition output for extractive summarization and keyword extraction in meeting speech
|
|
|
|
In: http://www.hlt.utdallas.edu/~shasha/papers/icassp2010_liu.pdf (2010)
|
|
BASE
|
|
Show details
|
|
85 |
Context preserving dynamic word cloud visualization
|
|
|
|
In: http://research.microsoft.com/en-us/um/people/shliu/cui_pacificvis10.pdf (2010)
|
|
BASE
|
|
Show details
|
|
86 |
United Kingdom (2009)" Cued Speech Recognition for Augmentative Communication in Normal-hearing and Hearing-impaired Subjects
|
|
|
|
In: http://hal.archives-ouvertes.fr/docs/00/45/11/30/PDF/interspeech2009.pdf (2010)
|
|
BASE
|
|
Show details
|
|
87 |
The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments
|
|
|
|
In: http://www.dcs.shef.ac.uk/%7Ening/pubs/christensen2010-chime.pdf (2010)
|
|
BASE
|
|
Show details
|
|
88 |
Speaker identification with distant microphone speech
|
|
|
|
In: http://www.cs.cmu.edu/~kornel/pubs/0004518.pdf (2010)
|
|
Abstract:
The field of speaker identification has recently seen significant advancement, but improvements have tended to be benchmarked on near-field speech, ignoring the more realistic setting of far-field-instrumented speakers. In this work we present several findings on far-field speech from the MIXER5 Corpus, in the areas of feature extraction, speaker modeling, and multichannel score combination. First, we observe that minimum-variance distortionless response (MVDR) features outperform Mel-frequency cepstral coefficient (MFCC) features, and that fundamental frequency variation (FFV) features offer complimentary information to both MFCC and MVDR features. Second, we present evidence that factor analysis significantly improves system performance, compared to the more traditional GMM/UBM strategy. Third, we find that frame-based score competition significantly improves performance under mismatched conditions with multiple channels available.
|
|
Keyword:
Distant Speech; Factor Analysis Finally; Far-field Speech; Front-end Features; Index Terms — Speaker Identification
|
|
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.210.1231 http://www.cs.cmu.edu/~kornel/pubs/0004518.pdf
|
|
BASE
|
|
Hide details
|
|
89 |
Content Modeling Paradigm: An Interplay of Relationship between Author, Document, Topic, and Words
|
|
|
|
In: http://www.ijcaonline.org/casct/number2/SPE40T.pdf (2010)
|
|
BASE
|
|
Show details
|
|
90 |
Development of a Computer-Aided Language Learning System for
|
|
|
|
In: http://speechprosody2010.illinois.edu/papers/100983.pdf (2010)
|
|
BASE
|
|
Show details
|
|
91 |
The CHiME corpus: a resource and a challenge for computational hearing in multisource environments
|
|
|
|
In: http://staffwww.dcs.shef.ac.uk/people/J.Barker/pubs/Christensen2010is.pdf (2010)
|
|
BASE
|
|
Show details
|
|
92 |
When intonation plays the main character: information-vs. confirmation- seeking questions in Majorcan Catalan
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01678574 ; France. 2010, Speech Prosody 2010 (2010)
|
|
BASE
|
|
Show details
|
|
93 |
Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
|
|
|
|
In: http://www.mmk.ei.tum.de/publ/pdf/10/10sch5.pdf (2010)
|
|
BASE
|
|
Show details
|
|
94 |
Creating a linguistic plausibility dataset with non-expert annotators
|
|
|
|
In: http://mlsp.cs.cmu.edu/publications/pdfs/p17190.pdf (2010)
|
|
BASE
|
|
Show details
|
|
95 |
Prosodic Focus in Hong Kong Cantonese without Post-focus Compression
|
|
|
|
In: http://www.phon.ucl.ac.uk/home/yi/yispapers/Wu_Xu_SP2010.pdf (2010)
|
|
BASE
|
|
Show details
|
|
96 |
Automatic Ontology Matching Via Upper Ontologies: A Systematic Evaluation
|
|
|
|
In: http://www.disi.unige.it/person/MascardiV/Download/TKDE-maggio-2009.pdf (2009)
|
|
BASE
|
|
Show details
|
|
97 |
Studying L2 suprasegmental features in Asian Englishes: a position paper
|
|
|
|
In: http://www.ling.sinica.edu.tw/eip/files/publish/2009.9.17.7807558.69687253.pdf (2009)
|
|
BASE
|
|
Show details
|
|
98 |
The semi-supervised switchboard transcription project
|
|
|
|
In: http://melodi.ee.washington.edu/~bilmes/mypubs/subramanya2009-s3tp.pdf (2009)
|
|
BASE
|
|
Show details
|
|
99 |
A comparison of query-by-example methods for spoken term detection
|
|
|
|
In: http://www.ll.mit.edu/mission/communications/ist/publications/2009_09_01_Shen_Interspeech_MS-38023.pdf (2009)
|
|
BASE
|
|
Show details
|
|
100 |
Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation
|
|
|
|
In: http://tcts.fpms.ac.be/~drugman/files/IS09-ComplexCepstrum.pdf (2009)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 43
|
|