1 |
An Acoustic-Phonetic Approach To Vocal Melody Extraction. ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
An Acoustic-Phonetic Approach To Vocal Melody Extraction. ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
BIC-based speaker segmentation using divide-and-conquer strategies with application to speaker diarization
|
|
|
|
In: http://www-gth.die.upm.es/research/documentation/referencias/Cheng_BIC-based.pdf (2010)
|
|
BASE
|
|
Show details
|
|
4 |
A probabilistic generative framework for extractive broadcast news speech summarization
|
|
|
|
In: http://berlin.csie.ntnu.edu.tw/Berlin_Research/Manuscripts/IS071119-Summarization.pdf (2009)
|
|
BASE
|
|
Show details
|
|
5 |
Spoken document summarization using relevant information
|
|
|
|
In: http://www.iis.sinica.edu.tw/papers/whm/3891-F.pdf (2007)
|
|
BASE
|
|
Show details
|
|
6 |
Automatic Speaker Clustering Using A Voice Characteristic Reference Space And Maximum Purity Estimation
|
|
|
|
In: http://www.iis.sinica.edu.tw/papers/whm/2641-F.pdf (2007)
|
|
BASE
|
|
Show details
|
|
7 |
Minimum boundary error training for automatic phonetic segmentation
|
|
|
|
In: http://www.iis.sinica.edu.tw/~whm/publish/papers/interspeech2006-mbe.pdf (2006)
|
|
BASE
|
|
Show details
|
|
8 |
Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models
|
|
|
|
In: http://berlin.csie.ntnu.edu.tw/Berlin_Research/Manuscripts/2006_Summarization_HMM-RM.pdf (2006)
|
|
BASE
|
|
Show details
|
|
9 |
Fluent speech prosody: Framework and modeling
|
|
|
|
In: http://www.iis.sinica.edu.tw/papers/whm/1908-F.pdf (2005)
|
|
BASE
|
|
Show details
|
|
10 |
A discriminative HMM/n-gram-based retrieval approach for Mandarin spoken documents
|
|
|
|
In: http://www.iis.sinica.edu.tw/papers/whm/1372-F.pdf (2004)
|
|
BASE
|
|
Show details
|
|
11 |
A discriminative HMM/N-gram-based retrieval approach for Mandarin spoken documents
|
|
|
|
In: http://berlin.csie.ntnu.edu.tw/Berlin_Research/Manuscripts/2004/2004-ACM TALIP Vol. 3, No. 2 -A Discriminative HMMN-Gram-Based Retrieval Approach for Mandarin Spoken Documents.pdf (2004)
|
|
BASE
|
|
Show details
|
|
12 |
Speaker clustering of speech utterances using a voice characteristic reference space
|
|
|
|
In: http://www.iis.sinica.edu.tw/~whm/publish/papers/icslp2004-speakerclustering.pdf (2004)
|
|
BASE
|
|
Show details
|
|
13 |
Towards retrieval of video archives based on the speech content
|
|
|
|
In: http://www.iis.sinica.edu.tw/~whm/publish/papers/iscslp2002-053.pdf (2002)
|
|
BASE
|
|
Show details
|
|
14 |
Discriminating Capabilities of Syllable-based Features and Approaches of Utilizing Them for Voice
|
|
|
|
In: http://speech.ee.ntu.edu.tw/~RA/lab/html/thesis/IEEEXplore_18.pdf (2002)
|
|
BASE
|
|
Show details
|
|
15 |
Multi-scale audio indexing for translingual spoken document retrieval
|
|
|
|
In: http://www.se.cuhk.edu.hk/~wklo/docpub/WangICASSP2001.pdf (2001)
|
|
Abstract:
MEI (Mandarin-English Information) is an English-Chinese crosslingual spoken document retrieval (CL-SDR) system developed during the Johns Hopkins University Summer Workshop 2000. We integrate speech recognition, machine translation, and information retrieval technologies to perform CL-SDR. MEI advocates a multi-scale paradigm, where both Chinese words and subwords (characters and syllables) are used in retrieval. The use of subword units can complement the word unit in handling the problems of Chinese word tokenization ambiguity, Chinese homophone ambiguity, and out-ofvocabulary words in audio indexing. This paper focuses on multi-scale audio indexing in MEI. Experiments are based on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3), where we indexed Voice of America Mandarin news broadcasts by speech recognition on both the word and subword scales. In this paper, we discuss the development of the MEI syllable recognizer, the representations of spoken documents using overlapping subword n-grams and lattice structures. Results show that augmenting words with subwords is beneficial to CL-SDR performance. 1.
|
|
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.148.7253 http://www.se.cuhk.edu.hk/~wklo/docpub/WangICASSP2001.pdf
|
|
BASE
|
|
Hide details
|
|
16 |
Multi-Scale Audio Indexing For Translingual Spoken Document Retrieval
|
|
|
|
In: http://www.se.cuhk.edu.hk/PEOPLE/hmmeng/Meng_AudioIndex_ICASSP2001.pdf (2001)
|
|
BASE
|
|
Show details
|
|
17 |
Retrieval of Mandarin Broadcast News Using Spoken Queries
|
|
|
|
In: http://www.iis.sinica.edu.tw/~whm/publish/papers/icslp2000-01298.pdf (2000)
|
|
BASE
|
|
Show details
|
|
18 |
A Spoken-Access Approach for Chinese Text and Speech Information Retrieval
|
|
|
|
In: http://wkd.iis.sinica.edu.tw/~lfchien/publication/JASIS-REVISED.pdf (2000)
|
|
BASE
|
|
Show details
|
|
19 |
Retrieval of mandarin broadcast news using spoken queries
|
|
|
|
In: http://berlin.csie.ntnu.edu.tw/Berlin_Research/Manuscripts/2000_ICSLP_SDR.pdf (2000)
|
|
BASE
|
|
Show details
|
|
20 |
Retrieval of broadcast news speech in Mandarin Chinese collected in taiwan using syllable-level statistical characteristics
|
|
|
|
In: http://www.iis.sinica.edu.tw/~whm/publish/papers/icassp2000.pdf (2000)
|
|
BASE
|
|
Show details
|
|
|
|