1 |
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837178 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
Abstract:
International audience ; This paper investigates vowel elision and morpheme deletion inEmbosi (Bantu C25), an under-resourced language spoken inthe Republic of Congo. We propose that the observed mor-pheme deletion is morphological, and that vowel elision isphonological. The study focuses on vowel elision that occursacross word boundaries between the contact of long/short vow-els (i.e. CV[long] # V[short].CV), and between the contact ofshort/short vowels (CV[short] # V[short].CV). Several differ-ent categories of morphemes are explored: (i) prepositions (ya,mo), (ii) class-noun nominal prefixes (ba, etc.), (iii) singularsubject pronouns (ngá, nO, wa). For example, the preposition,ya, regularly deletes allowing for vowel elision if vowel contactoccurs between the head of the noun phrase and the previousword. Phonetically motivated speech variants are proposed inthe lexicon used for forced alignment (segmentation) enablingthese phenomena to be quantified in the corpus so as to developa dictionary containing relevant phonetic variants.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; language modeling; phonetics; phonology; under-resourced languages
|
|
URL: https://hal.archives-ouvertes.fr/hal-01837178
|
|
BASE
|
|
Hide details
|
|
2 |
Corpus base linguistic exploration via forced alignments with a ‘light-weight’ ASR tool
|
|
|
|
In: Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01837174 ; Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznań, Poland (2017)
|
|
BASE
|
|
Show details
|
|
|
|