21 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
BASE
|
|
Show details
|
|
22 |
Impact of post-lexical context and speech style on word-final /ʁ/ realization in French using large corpora and automatic speech processing
|
|
|
|
In: R-atics 6 ; https://hal.archives-ouvertes.fr/hal-03041044 ; R-atics 6, Nov 2019, Paris, France (2019)
|
|
BASE
|
|
Show details
|
|
23 |
Synchronic variation and sound change in Romance languages: a corpus-based study of lenition phenomena in Romanian and Spanish
|
|
|
|
In: Linguistic Symposium on Romance Languages ; https://hal.archives-ouvertes.fr/hal-02336116 ; Linguistic Symposium on Romance Languages, May 2019, Athens, United States (2019)
|
|
BASE
|
|
Show details
|
|
24 |
Final devoicing in the 'pool of variation': A large-scale corpora approach with automatic alignment
|
|
|
|
In: Phonetics and Phonology in Europe Conference ; https://hal.archives-ouvertes.fr/hal-02336112 ; Phonetics and Phonology in Europe Conference, Jun 2019, Lecce, Italy (2019)
|
|
BASE
|
|
Show details
|
|
25 |
Final devoicing of fricatives in French: Studying variation in large-scale corpora with automatic alignment
|
|
|
|
In: Proceedings of the 19th International Congress of Phonetic Sciences ; 19th International Congress of Phonetic Sciences ; https://hal.archives-ouvertes.fr/hal-02270089 ; 19th International Congress of Phonetic Sciences, 2019, Melbourne, Australia. pp.295-299 ; https://assta.org/proceedings/ICPhS2019/ (2019)
|
|
BASE
|
|
Show details
|
|
26 |
Variation in Pluricentric Mandarin Using Large Corpus: a forced alignment-based duration and tone frequency study
|
|
|
|
In: Pluricentric Languages in Speech Technology - Satellite Workshop at Interspeech 2019 ; https://hal.archives-ouvertes.fr/hal-03041066 ; Pluricentric Languages in Speech Technology - Satellite Workshop at Interspeech 2019, Sep 2019, Graz, Austria (2019)
|
|
BASE
|
|
Show details
|
|
27 |
A Speaking Atlas of Minority Languages of France: Collection and Analyses of Dialectical Data
|
|
|
|
In: International Congress of Phonetic Sciences ; https://hal.archives-ouvertes.fr/hal-02387368 ; International Congress of Phonetic Sciences, Sasha Calhoun, Paola Escudero, Marija Tabain and Paul Warren (Eds.), Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
28 |
"Gra[f]e!" Word-final devoicing of obstruents in Standard French: An acoustic study based on large corpora
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02336119 ; Annual Conference of the International Speech Communication Association, ISCA, Sep 2019, Graz, Austria. DOI:10.21437/Interspeech.2019-2329 (2019)
|
|
BASE
|
|
Show details
|
|
30 |
Speech technologies as an aide for large-scale linguistic exploration
|
|
|
|
In: International Conference on the Computational Processing of Portuguese ; https://hal.archives-ouvertes.fr/hal-02387383 ; International Conference on the Computational Processing of Portuguese, Sep 2018, Canela, Brazil (2018)
|
|
BASE
|
|
Show details
|
|
31 |
An automatic study of lenition of intra-lexical intervocalic /bdg/ and coda -s in Peninsular vs America spanish
|
|
|
|
In: Laboratory Phonology Conference ; https://hal.archives-ouvertes.fr/hal-01837162 ; Laboratory Phonology Conference, Jun 2018, Lisbonne, Portugal (2018)
|
|
BASE
|
|
Show details
|
|
32 |
Connected speech in Romanian: Exploring sound change through an ASR system
|
|
|
|
In: Production and perception mechanisms of sound change ; https://hal.archives-ouvertes.fr/hal-03127939 ; D. Recasens and F. Sánchez Miret (Eds.). Production and perception mechanisms of sound change, Lincom Europa, pp.129-143, 2018 (2018)
|
|
BASE
|
|
Show details
|
|
33 |
Exploring Temporal Reduction in Dialectal Spanish: A Large-scale Study of Lenition of Voiced Stops and Coda-s
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02387395 ; Annual Conference of the International Speech Communication Association, ISCA, Sep 2018, Hyderabad, India (2018)
|
|
BASE
|
|
Show details
|
|
34 |
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech
|
|
|
|
In: Interspeech 2018 ; https://halshs.archives-ouvertes.fr/halshs-01969143 ; Interspeech 2018, Sep 2018, Hyderabad,, India. ⟨10.21437/interspeech.2018-2381⟩ (2018)
|
|
BASE
|
|
Show details
|
|
35 |
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech
|
|
|
|
In: Interspeech 2018 ; https://halshs.archives-ouvertes.fr/halshs-02130906 ; Interspeech 2018, Sep 2018, Hyderabad, India. pp.2753-2757, ⟨10.21437/Interspeech.2018-2381⟩ (2018)
|
|
BASE
|
|
Show details
|
|
36 |
The French-Algerian Code-Switching Triggered audio corpus (FACST)
|
|
|
|
In: LREC 2018, Eleventh International Conference on Language Resources and Evaluation ; LREC 2018 11th edition of the Language Resources and Evaluation Conference, ; https://halshs.archives-ouvertes.fr/halshs-01969152 ; LREC 2018 11th edition of the Language Resources and Evaluation Conference,, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
37 |
Conversational telephone speech recognition for Lithuanian
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01837147 ; Computer Speech and Language, Elsevier, 2018, 49, pp.71-82 (2018)
|
|
Abstract:
International audience ; he research presented in the paper addresses conversational telephone speechrecognition and keyword spotting for the Lithuanian language. Lithuanian can beconsidered a low e-resourced language as little transcribed audio data, and more generally,only limited linguistic resources are available electronically. Part of this research exploresthe impact of reducing the amount of linguistic knowledge and manual supervision whendeveloping the transcription system. Since designing a pronunciation dictionary requireslanguage-specific expertise, the need for manual supervision was assessed by comparingphonemic and graphemic units for acoustic modeling. Although the Lithuanian language isgenerally described in the linguistic literature with 56 phonemes, under low-resourcedconditions some phonemes may not be sufficiently observed to be modeled. Therefore different phoneme inventories were explored to assess the effects of explicitly modeling diphthongs, affricates and soft consonants. The impact of using Web data for language modeling and additional untranscribed audio data for semi-supervised training was also measured. Out-of-vocabulary (OOV) keywords are a well-known challenge for keyword search. While word-based keyword search is quite effective for in-vocabulary words, OOV keywords are largely undetected. Morpheme-based subword units are compared with character n-gram-based units for their capacity to detect OOV keywords. Experimental results are reported for two training conditions defined in the IARPA Babel program: the full language pack and the very limited language pack, for which, respectively, 40 h and 3 h of transcribed training data are available. For both conditions, grapheme-based and phoneme-based models are shown to obtain comparable transcription and keyword spotting results. The use of Web texts for language modeling is shown to significantly improve both speech recognition and keyword spotting performance. Combining full-word and subword units leads to the best keyword spotting results.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Conversational telephone speech; Keyword spotting; Lithuanian; Speech-to-text
|
|
URL: https://hal.archives-ouvertes.fr/hal-01837147
|
|
BASE
|
|
Hide details
|
|
38 |
Studying variation in Romanian: deletion of the definite article -l in continuous speech
|
|
|
|
In: Linguistic Vanguard ; https://hal.archives-ouvertes.fr/hal-01837197 ; Linguistic Vanguard, 2018, 5 (1), 17p (2018)
|
|
BASE
|
|
Show details
|
|
39 |
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville)
|
|
|
|
In: 11th edition of the Language Resources and Evaluation Conference (LREC 2018) ; https://hal.archives-ouvertes.fr/hal-01710043 ; 11th edition of the Language Resources and Evaluation Conference (LREC 2018), ELRA, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
40 |
A corpus based study of morpheme deletion in a low resourced language: A case study for Embosi
|
|
|
|
In: Annual Meeting of the Linguistic Society of America ; https://hal.archives-ouvertes.fr/hal-01837164 ; Annual Meeting of the Linguistic Society of America, Jan 2018, Salt Lake City, United States (2018)
|
|
BASE
|
|
Show details
|
|
|
|