DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 28

1
Simulating reading mistakes for child speech Transformer-based phone recognition
In: Annual Conference of the International Speech Communication Association (INTERSPEECH) ; https://hal.archives-ouvertes.fr/hal-03257870 ; Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2021, Brno, Czech Republic (2021)
BASE
Show details
2
Vocal drum sounds in human beatboxing: An acoustic and articulatory exploration using electromagnetic articulography
In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.univ-grenoble-alpes.fr/hal-03107358 ; Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 149 (1), pp.191-206. ⟨10.1121/10.0002921⟩ ; https://asa.scitation.org/doi/full/10.1121/10.0002921 (2021)
BASE
Show details
3
End-to-end acoustic modelling for phone recognition of young readers
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03373156 ; Speech Communication, Elsevier : North-Holland, 2021, 134, pp.71-84. ⟨10.1016/j.specom.2021.08.003⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639321000959?via%3Dihub (2021)
BASE
Show details
4
L'apport du geste dans l'acquisition de la prononciation en L2 via un outil d'apprentissage en ligne : une étude pilote
In: Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021) ; https://hal.archives-ouvertes.fr/hal-03428242 ; Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021), Nov 2021, Paris, France ; http://www.inalco.fr/evenement/journees-etudes-gis-reseau-acquisition-langues-secondes-real2-acquisition-didactique-vice (2021)
BASE
Show details
5
Weakly supervised discourse segmentation for multiparty oral conversations
In: 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03466161 ; 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), ACL: Association for Computational Linguistics, Nov 2021, Punta Cana, Dominican Republic. pp.1381-1392 ; https://aclanthology.org/2021.emnlp-main.104/ (2021)
BASE
Show details
6
End-to-end acoustic modelling for phone recognition of young readers ...
Abstract: Automatic recognition systems for child speech are lagging behind those dedicated to adult speech in the race of performance. This phenomenon is due to the high acoustic and linguistic variability present in child speech caused by their body development, as well as the lack of available child speech data. Young readers speech additionally displays peculiarities, such as slow reading rate and presence of reading mistakes, that hardens the task. This work attempts to tackle the main challenges in phone acoustic modelling for young child speech with limited data, and improve understanding of strengths and weaknesses of a wide selection of model architectures in this domain. We find that transfer learning techniques are highly efficient on end-to-end architectures for adult-to-child adaptation with a small amount of child speech data. Through transfer learning, a Transformer model complemented with a Connectionist Temporal Classification (CTC) objective function, reaches a phone error rate of 28.1%, ... : 16 pages, 8 figures ...
Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2103.02899
https://dx.doi.org/10.48550/arxiv.2103.02899
BASE
Hide details
7
Weakly supervised discourse segmentation for multiparty oral conversations ...
BASE
Show details
8
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419437 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.2993-2997 (2019)
BASE
Show details
9
Char+CV-CTC: Combining Graphemes and Consonant/Vowel Units for CTC-Based ASR Using Multitask Learning
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419431 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.1611-1615 (2019)
BASE
Show details
10
Comparaison de systèmes automatiques de reconnaissance grand vocabulaire appliqué à de la parole pathologique
In: Actes des 8e Journees de Phonetique Clinique ; 8e Journees de Phonetique Clinique (JPC 2019) ; https://hal.archives-ouvertes.fr/hal-02421557 ; 8e Journees de Phonetique Clinique (JPC 2019), May 2019, Mons, Belgique. pp.53-54 (2019)
BASE
Show details
11
Towards phonetic interpretability in deep learning applied to voice comparison
In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412948 ; ICPhS, Aug 2019, Melbourne, Australia. pp.ISBN 978-0-646-80069-1 (2019)
BASE
Show details
12
Deep learning and voice comparison: phonetically-motivated vs. automatically-learned features
In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412947 ; ICPhS, Aug 2019, Melbourne, Australia (2019)
BASE
Show details
13
Lexical Emphasis Detection in Spoken French using F-BANKs and neural networks
In: SLSP 2017: Statistical Language and Speech Processing ; International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559768 ; International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.241-249 (2017)
BASE
Show details
14
Identification non-supervisée de pseudo-phones à l'aide de k-means et de réseaux convolutifs
In: Actes de GRETSI 2017 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017) ; https://hal.archives-ouvertes.fr/hal-02559763 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017), Sep 2017, Juan-les-Pins, France. pp.1-4 (2017)
BASE
Show details
15
Unsupervised Speech Unit Discovery Using K-means and Neural Networks
In: SLSP 2017: Statistical Language and Speech Processing ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559766 ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.169-180 (2017)
BASE
Show details
16
CNN-based phone segmentation experiments in a less-represented language
In: Proceedings of INTERSPEECH 2016 Volume 2 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01500519 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 3549-3553 (2016)
BASE
Show details
17
Pronunciation assessment of Japanese learners of French with GOP scores and phonetic information
In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474896 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, CA, United States. pp.2686-2690, ⟨10.21437/Interspeech.2016-513⟩ (2016)
BASE
Show details
18
Traitement de la prononciation en langue étrangère : approches didactiques, méthodes automatiques et enjeux pour l'apprentissage
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01919021 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2016, 57 (3), pp.15-39 (2016)
BASE
Show details
19
Inferring phonemic classes from CNN activation maps using clustering techniques
In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474886 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 1290-1294 (2016)
BASE
Show details
20
Automatic Assessment of Speech Capability Loss in Disordered Speech
In: ISSN: 1936-7228 ; EISSN: 1936-7236 ; ACM Transactions on Accessible Computing ; https://hal.archives-ouvertes.fr/hal-01371812 ; ACM Transactions on Accessible Computing , ACM New York, NY, USA 2015, 6 (3), pp.1-14. ⟨10.1145/2739051⟩ (2015)
BASE
Show details

Page: 1 2

Catalogues
0
0
2
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
26
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern