1 |
Simulating reading mistakes for child speech Transformer-based phone recognition
|
|
|
|
In: Annual Conference of the International Speech Communication Association (INTERSPEECH) ; https://hal.archives-ouvertes.fr/hal-03257870 ; Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Vocal drum sounds in human beatboxing: An acoustic and articulatory exploration using electromagnetic articulography
|
|
|
|
In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.univ-grenoble-alpes.fr/hal-03107358 ; Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 149 (1), pp.191-206. ⟨10.1121/10.0002921⟩ ; https://asa.scitation.org/doi/full/10.1121/10.0002921 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
End-to-end acoustic modelling for phone recognition of young readers
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03373156 ; Speech Communication, Elsevier : North-Holland, 2021, 134, pp.71-84. ⟨10.1016/j.specom.2021.08.003⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639321000959?via%3Dihub (2021)
|
|
BASE
|
|
Show details
|
|
4 |
L'apport du geste dans l'acquisition de la prononciation en L2 via un outil d'apprentissage en ligne : une étude pilote
|
|
|
|
In: Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021) ; https://hal.archives-ouvertes.fr/hal-03428242 ; Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021), Nov 2021, Paris, France ; http://www.inalco.fr/evenement/journees-etudes-gis-reseau-acquisition-langues-secondes-real2-acquisition-didactique-vice (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Weakly supervised discourse segmentation for multiparty oral conversations
|
|
|
|
In: 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03466161 ; 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), ACL: Association for Computational Linguistics, Nov 2021, Punta Cana, Dominican Republic. pp.1381-1392 ; https://aclanthology.org/2021.emnlp-main.104/ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
End-to-end acoustic modelling for phone recognition of young readers ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Weakly supervised discourse segmentation for multiparty oral conversations ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
|
|
|
|
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419437 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.2993-2997 (2019)
|
|
BASE
|
|
Show details
|
|
9 |
Char+CV-CTC: Combining Graphemes and Consonant/Vowel Units for CTC-Based ASR Using Multitask Learning
|
|
|
|
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419431 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.1611-1615 (2019)
|
|
BASE
|
|
Show details
|
|
10 |
Comparaison de systèmes automatiques de reconnaissance grand vocabulaire appliqué à de la parole pathologique
|
|
|
|
In: Actes des 8e Journees de Phonetique Clinique ; 8e Journees de Phonetique Clinique (JPC 2019) ; https://hal.archives-ouvertes.fr/hal-02421557 ; 8e Journees de Phonetique Clinique (JPC 2019), May 2019, Mons, Belgique. pp.53-54 (2019)
|
|
BASE
|
|
Show details
|
|
11 |
Towards phonetic interpretability in deep learning applied to voice comparison
|
|
|
|
In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412948 ; ICPhS, Aug 2019, Melbourne, Australia. pp.ISBN 978-0-646-80069-1 (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Deep learning and voice comparison: phonetically-motivated vs. automatically-learned features
|
|
|
|
In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412947 ; ICPhS, Aug 2019, Melbourne, Australia (2019)
|
|
Abstract:
International audience ; Broadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ extracted from radio broadcast corpora were used to recognize 45 speakers with a deep convolutional neural network (CNN). The same network was also trained with 62 phonetic parameters to i) see if the resulting confusions were identical to those made by the CNN trained with spectrograms, and ii) understand which acoustic parameters were used by the network. The two networks had identical discrimination results 68% of the time. In 22% of the data, the network trained with spectrograms achieved successful discrimination while the network trained with phonetic parameters failed, and the reverse was found in 10% of the data. We display the relevant phonetic parameters with raw values and values relative to the speakers' means and show cases favouring bad discrimination results. When the network trained with spectrograms failed to discriminate between some tokens, parameters related to f0 proved significant.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; deep learning; foren- sic phonetics; phonetic parameters; voice comparison; vowels
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-02412947/document https://halshs.archives-ouvertes.fr/halshs-02412947 https://halshs.archives-ouvertes.fr/halshs-02412947/file/Gendrot_Ferragne_Pellegrini.pdf
|
|
BASE
|
|
Hide details
|
|
13 |
Lexical Emphasis Detection in Spoken French using F-BANKs and neural networks
|
|
|
|
In: SLSP 2017: Statistical Language and Speech Processing ; International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559768 ; International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.241-249 (2017)
|
|
BASE
|
|
Show details
|
|
14 |
Identification non-supervisée de pseudo-phones à l'aide de k-means et de réseaux convolutifs
|
|
|
|
In: Actes de GRETSI 2017 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017) ; https://hal.archives-ouvertes.fr/hal-02559763 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017), Sep 2017, Juan-les-Pins, France. pp.1-4 (2017)
|
|
BASE
|
|
Show details
|
|
15 |
Unsupervised Speech Unit Discovery Using K-means and Neural Networks
|
|
|
|
In: SLSP 2017: Statistical Language and Speech Processing ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559766 ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.169-180 (2017)
|
|
BASE
|
|
Show details
|
|
16 |
CNN-based phone segmentation experiments in a less-represented language
|
|
|
|
In: Proceedings of INTERSPEECH 2016 Volume 2 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01500519 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 3549-3553 (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Pronunciation assessment of Japanese learners of French with GOP scores and phonetic information
|
|
|
|
In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474896 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, CA, United States. pp.2686-2690, ⟨10.21437/Interspeech.2016-513⟩ (2016)
|
|
BASE
|
|
Show details
|
|
18 |
Traitement de la prononciation en langue étrangère : approches didactiques, méthodes automatiques et enjeux pour l'apprentissage
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01919021 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2016, 57 (3), pp.15-39 (2016)
|
|
BASE
|
|
Show details
|
|
19 |
Inferring phonemic classes from CNN activation maps using clustering techniques
|
|
|
|
In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474886 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 1290-1294 (2016)
|
|
BASE
|
|
Show details
|
|
20 |
Automatic Assessment of Speech Capability Loss in Disordered Speech
|
|
|
|
In: ISSN: 1936-7228 ; EISSN: 1936-7236 ; ACM Transactions on Accessible Computing ; https://hal.archives-ouvertes.fr/hal-01371812 ; ACM Transactions on Accessible Computing , ACM New York, NY, USA 2015, 6 (3), pp.1-14. ⟨10.1145/2739051⟩ (2015)
|
|
BASE
|
|
Show details
|
|
|
|