1 |
Phonetic knowledge, phonotactics and perceptual validation for automatic language identification
|
|
|
|
In: http://www1.cs.columbia.edu/~fadi/candidacy/LID/Decker-2003.pdf (2003)
|
|
BASE
|
|
Show details
|
|
2 |
Phonetic Knowledge, Phonotactics and Perceptual Validation for Automatic Language Identification
|
|
|
|
In: http://www.limsi.fr/RS2003FF/CHM2003/TLP2003/TLP5/Phonotactic.pdf
|
|
Abstract:
This study explores a multilingual phonotactic approach to automatic language identification using Broadcast News data. The definition of a multilingual phoneset is discussed and an upper limit on the performance of the phonotactic approach is estimated by eliminating any degradation due to recognition errors. This upper bound is compared to automatic language identification based on a phonotactic approach. The eight languages of interest are: Arabic, Mandarin, English, French, German, Italian, Portuguese and Spanish. A perceptual test has been carried out to compare human and machine performance in similar configurations. Different phoneset classes have been experimented with, ranging from a binary C/V distinction to a shared phone set of 70 phones. Experiments show that phonotactic constraints are in theory able to identify a language (among
|
|
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.195.1264 http://www.limsi.fr/RS2003FF/CHM2003/TLP2003/TLP5/Phonotactic.pdf
|
|
BASE
|
|
Hide details
|
|
|
|