41 |
The French-Algerian Code-Switching Triggered audio corpus (FACST)
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01837163 ; International Conference on Language Resources and Evaluation, ELRA, May 2018, Miyazaki, Japan (2018)
|
|
Abstract:
International audience ; The French Algerian Code-Switching Triggered corpus (FACST) was created in order to support a variety of studies in phonetics, prosody and natural language processing. The first aim of the FACST corpus is to collect a spontaneous Code-switching speech (CS) corpus. In order to obtain a large quantity of spontaneous CS utterances in natural conversations experiments were carried out on how to elicit CS. Applying a triggering protocol by means of code-switched questions was found to be effective in eliciting CS in the responses. To ensure good audio quality, all recordings were made in a soundproof room or in a very calm room. This paper describes FACST corpus, along with the principal steps to build a CS speech corpus in French-Algerian languages and data collection steps. We also explain the selection criteria for the CS speakers and the recording protocols used. We present the methods used for data segmentation and annotation, and propose a conventional transcription of this type of speech in each language with the aim of being well-suited for both computational linguistic and acoustic-phonetic studies. We provide an a quantitative description of the FACST corpus along with results of linguistic studies, and discuss some of the challenges we faced in collecting CS data.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Arabic; bilingual speakers; Code-switching; French; oral speech data
|
|
URL: https://hal.archives-ouvertes.fr/hal-01837163
|
|
BASE
|
|
Hide details
|
|
42 |
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02387386 ; Annual Conference of the International Speech Communication Association, ISCA, Sep 2018, Hyderabad, India (2018)
|
|
BASE
|
|
Show details
|
|
44 |
Effective keyword search for low-resourced conversational speech
|
|
|
|
In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
|
|
BASE
|
|
Show details
|
|
45 |
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837178 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
46 |
Schwa Realization in French: Using Automatic Speech Processing to Study Phonological and Socio-linguistic Factors in Large Corpora
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837179 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
47 |
Addressing Code-Switching in French/Algerian Arabic Speech
|
|
|
|
In: Interspeech 2017 ; https://halshs.archives-ouvertes.fr/halshs-01969148 ; Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.62-66, ⟨10.21437/interspeech.2017-1373⟩ (2017)
|
|
BASE
|
|
Show details
|
|
48 |
An investigation into language model data augmentation for low-resourced STT and KWS
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01837171 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Mar 2017, New Orleans, United States (2017)
|
|
BASE
|
|
Show details
|
|
49 |
Addressing Code-Switching in French/Algerian Arabic Speech
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837206 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
50 |
Corpus base linguistic exploration via forced alignments with a ‘light-weight’ ASR tool
|
|
|
|
In: Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01837174 ; Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznań, Poland (2017)
|
|
BASE
|
|
Show details
|
|
51 |
Discovering speech reductions across speaking styles and languages
|
|
|
|
In: Rethinking reduction: Interdisciplinary perspectives on conditions, mechanisms, and domains for phonetic variation ; https://halshs.archives-ouvertes.fr/halshs-01507312 ; Cangemi, F., Clayards M., Niebuhr O., Schuppler B., & Zellers M. Rethinking reduction: Interdisciplinary perspectives on conditions, mechanisms, and domains for phonetic variation, De Gruyter Mouton 2017 (2017)
|
|
BASE
|
|
Show details
|
|
52 |
Phonetic variation and contrast neutralization patterns in Romanian fricatives accross different speaking styles
|
|
|
|
In: Diversity and Speech Dynamics ; https://hal.archives-ouvertes.fr/hal-01837181 ; Diversity and Speech Dynamics, May 2017, Herrsching am Ammersee, Germany (2017)
|
|
BASE
|
|
Show details
|
|
54 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
55 |
Language Model Data Augmentation for Keyword Spotting
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
|
|
BASE
|
|
Show details
|
|
56 |
Investigating techniques for low resource conversational speech recognition
|
|
|
|
In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016) ; https://hal-univ-lemans.archives-ouvertes.fr/hal-01515254 ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shangai, China. pp.5975-5979, ⟨10.1109/ICASSP.2016.7472824⟩ ; www.icassp2016.org (2016)
|
|
BASE
|
|
Show details
|
|
57 |
Multimodal Emotion Recognition for AVEC 2016 Challenge
|
|
|
|
In: Audio/Visual Emotion Challenge ; https://hal.archives-ouvertes.fr/hal-01837203 ; Audio/Visual Emotion Challenge, ACM, Oct 2016, Amsterdam, Netherlands (2016)
|
|
BASE
|
|
Show details
|
|
58 |
Marginal Contrast Among Romanian Vowels: Evidence from ASR and Functional Load
|
|
|
|
In: Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01453014 ; Interspeech 2016, ISCA, Sep 2016, San Francisco, United States. pp.2433 - 2437, ⟨10.21437/Interspeech.2016-762⟩ ; http://www.interspeech2016.org/ (2016)
|
|
BASE
|
|
Show details
|
|
59 |
Réalisation phonétique et contraste phonologique marginal : une étude automatique des voyelles du roumain
|
|
|
|
In: JEP 2016 ; https://hal.archives-ouvertes.fr/hal-01452974 ; JEP 2016, Aug 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
60 |
BULB: Breaking the Unwritten Language Barrier
|
|
|
|
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
|
|