Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 28

1	Simulating reading mistakes for child speech Transformer-based phone recognition
	Gelin, Lucile; Pellegrini, Thomas; Pinquier, Julien...
	In: Annual Conference of the International Speech Communication Association (INTERSPEECH) ; https://hal.archives-ouvertes.fr/hal-03257870 ; Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

2	Vocal drum sounds in human beatboxing: An acoustic and articulatory exploration using electromagnetic articulography
	Paroni, Annalisa; Henrich Bernardoni, Nathalie; Savariaux, Christophe...
	In: ISSN: 0001-4966 ; EISSN: 1520-8524 ; Journal of the Acoustical Society of America ; https://hal.univ-grenoble-alpes.fr/hal-03107358 ; Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 149 (1), pp.191-206. ⟨10.1121/10.0002921⟩ ; https://asa.scitation.org/doi/full/10.1121/10.0002921 (2021)
	BASE
	Show details

3	End-to-end acoustic modelling for phone recognition of young readers
	Gelin, Lucile; Daniel, Morgane; Pinquier, Julien...
	In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03373156 ; Speech Communication, Elsevier : North-Holland, 2021, 134, pp.71-84. ⟨10.1016/j.specom.2021.08.003⟩ ; https://www.sciencedirect.com/science/article/pii/S0167639321000959?via%3Dihub (2021)
	BASE
	Show details

4	L'apport du geste dans l'acquisition de la prononciation en L2 via un outil d'apprentissage en ligne : une étude pilote
	Alazard-Guiu, Charlotte; Contreras Roa, Leonardo; Ferrané, Isabelle...
	In: Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021) ; https://hal.archives-ouvertes.fr/hal-03428242 ; Journées d'études du GIS Réseau d'acquisition des langues secondes (REAL2 2021), Nov 2021, Paris, France ; http://www.inalco.fr/evenement/journees-etudes-gis-reseau-acquisition-langues-secondes-real2-acquisition-didactique-vice (2021)
	BASE
	Show details

5	Weakly supervised discourse segmentation for multiparty oral conversations
	Gravellier, Lila; Hunter, Julie; Muller, Philippe...
	In: 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03466161 ; 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), ACL: Association for Computational Linguistics, Nov 2021, Punta Cana, Dominican Republic. pp.1381-1392 ; https://aclanthology.org/2021.emnlp-main.104/ (2021)
	BASE
	Show details

6	End-to-end acoustic modelling for phone recognition of young readers ...
	Gelin, Lucile; Daniel, Morgane; Pinquier, Julien; Pellegrini, Thomas. - : arXiv, 2021
	Abstract: Automatic recognition systems for child speech are lagging behind those dedicated to adult speech in the race of performance. This phenomenon is due to the high acoustic and linguistic variability present in child speech caused by their body development, as well as the lack of available child speech data. Young readers speech additionally displays peculiarities, such as slow reading rate and presence of reading mistakes, that hardens the task. This work attempts to tackle the main challenges in phone acoustic modelling for young child speech with limited data, and improve understanding of strengths and weaknesses of a wide selection of model architectures in this domain. We find that transfer learning techniques are highly efficient on end-to-end architectures for adult-to-child adaptation with a small amount of child speech data. Through transfer learning, a Transformer model complemented with a Connectionist Temporal Classification (CTC) objective function, reaches a phone error rate of 28.1%, ... : 16 pages, 8 figures ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://arxiv.org/abs/2103.02899 https://dx.doi.org/10.48550/arxiv.2103.02899
	BASE
	Hide details

7	Weakly supervised discourse segmentation for multiparty oral conversations ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Ferrané, Isabelle; Gravellier, Lila. - : Underline Science Inc., 2021
	BASE
	Show details

8	The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
	Pellegrini, Thomas; Farinas, Jérome; Delpech, Estelle...
	In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419437 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.2993-2997 (2019)
	BASE
	Show details

9	Char+CV-CTC: Combining Graphemes and Consonant/Vowel Units for CTC-Based ASR Using Multitask Learning
	Heba, Abdelwahab; Pellegrini, Thomas; Lorré, Jean-Pierre...
	In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419431 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.1611-1615 (2019)
	BASE
	Show details

10	Comparaison de systèmes automatiques de reconnaissance grand vocabulaire appliqué à de la parole pathologique
	Farinas, Jérome; Pellegrini, Thomas; Pinquier, Julien
	In: Actes des 8e Journees de Phonetique Clinique ; 8e Journees de Phonetique Clinique (JPC 2019) ; https://hal.archives-ouvertes.fr/hal-02421557 ; 8e Journees de Phonetique Clinique (JPC 2019), May 2019, Mons, Belgique. pp.53-54 (2019)
	BASE
	Show details

11	Towards phonetic interpretability in deep learning applied to voice comparison
	Ferragne, Emmanuel; Gendrot, Cédric; Pellegrini, Thomas
	In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412948 ; ICPhS, Aug 2019, Melbourne, Australia. pp.ISBN 978-0-646-80069-1 (2019)
	BASE
	Show details

12	Deep learning and voice comparison: phonetically-motivated vs. automatically-learned features
	Gendrot, Cédric; Ferragne, Emmanuel; Pellegrini, Thomas
	In: ICPhS ; https://halshs.archives-ouvertes.fr/halshs-02412947 ; ICPhS, Aug 2019, Melbourne, Australia (2019)
	BASE
	Show details

13	Lexical Emphasis Detection in Spoken French using F-BANKs and neural networks
	Heba, Abdelwahab; Pellegrini, Thomas; Jorquera, Tom...
	In: SLSP 2017: Statistical Language and Speech Processing ; International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559768 ; International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.241-249 (2017)
	BASE
	Show details

14	Identification non-supervisée de pseudo-phones à l'aide de k-means et de réseaux convolutifs
	Manenti, Céline; Pellegrini, Thomas; Pinquier, Julien
	In: Actes de GRETSI 2017 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017) ; https://hal.archives-ouvertes.fr/hal-02559763 ; 26e Colloque GRETSI sur le Traitement du Signal et des Images (GRETSI 2017), Sep 2017, Juan-les-Pins, France. pp.1-4 (2017)
	BASE
	Show details

15	Unsupervised Speech Unit Discovery Using K-means and Neural Networks
	Manenti, Céline; Pellegrini, Thomas; Pinquier, Julien
	In: SLSP 2017: Statistical Language and Speech Processing ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017) ; https://hal.archives-ouvertes.fr/hal-02559766 ; 5th International Conference on Statistical Language and Speech Processing (SLSP 2017), Oct 2017, Le Mans, France. pp.169-180 (2017)
	BASE
	Show details

16	CNN-based phone segmentation experiments in a less-represented language
	Manenti, Céline; Pellegrini, Thomas; Pinquier, Julien
	In: Proceedings of INTERSPEECH 2016 Volume 2 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01500519 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 3549-3553 (2016)
	BASE
	Show details

17	Pronunciation assessment of Japanese learners of French with GOP scores and phonetic information
	Laborde, Vincent; Pellegrini, Thomas; Fontan, Lionel...
	In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474896 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, CA, United States. pp.2686-2690, ⟨10.21437/Interspeech.2016-513⟩ (2016)
	BASE
	Show details

18	Traitement de la prononciation en langue étrangère : approches didactiques, méthodes automatiques et enjeux pour l'apprentissage
	Detey, Sylvain; Fontan, Lionel; Pellegrini, Thomas
	In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01919021 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2016, 57 (3), pp.15-39 (2016)
	BASE
	Show details

19	Inferring phonemic classes from CNN activation maps using clustering techniques
	Pellegrini, Thomas; Mouysset, Sandrine
	In: Proceedings of INTERSPEECH 2016 ; Annual conference Interspeech (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01474886 ; Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 1290-1294 (2016)
	BASE
	Show details

20	Automatic Assessment of Speech Capability Loss in Disordered Speech
	Pellegrini, Thomas; Fontan, Lionel; Mauclair, Julie...
	In: ISSN: 1936-7228 ; EISSN: 1936-7236 ; ACM Transactions on Accessible Computing ; https://hal.archives-ouvertes.fr/hal-01371812 ; ACM Transactions on Accessible Computing , ACM New York, NY, USA 2015, 6 (3), pp.1-14. ⟨10.1145/2739051⟩ (2015)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern