1 |
Processing rhythm in speech and music: Shared mechanisms and implications for developmental speech and language disorders.
|
|
|
|
In: ISSN: 0894-4105 ; Neuropsychology ; https://hal.archives-ouvertes.fr/hal-03384346 ; Neuropsychology, American Psychological Association, 2021, ⟨10.1037/neu0000766⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Automatic extraction of speech rhythm descriptors for speech intelligibility assessment in the context of Head and Neck Cancers
|
|
|
|
In: à paraître ; INTERSPEECH 2021 ; https://hal-univ-tlse3.archives-ouvertes.fr/hal-03269227 ; INTERSPEECH 2021, ISCA : International Speech and Communication Association, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org (2021)
|
|
Abstract:
International audience ; The temporal dimension of speech acoustics is rarely taken into account in automatic models for Speech Intelligibility evaluation, although the rhythmic recurrence of phonemes, syllables and prosodic groups are allegedly good predictors of speech intelligibility. The present study aims at unravelling those automatic parameters that best account for the different levels of the speech signal's rhythmic structure, and to evaluate their correlation with a perceptual intelligibility measure. The parameters are extracted from the Fourier Transform of the amplitude modulation of the signal (Envelope Modulation Spectrum) [1, 2]. A Lasso linear model for feature selection is first implemented to select the most relevant parameters, and a SVR regression analysis is run to reveal the best parameters' combination. Our analyses of EMS, using data from the French corpora of cancer speech C2SI [3], show strong performances of the automatic prediction, with a correlation of 0.70 between our model and an intelligibility evaluation score by speech-pathologists. In particular, the highest correlation with speech intelligibility lies in the ratio between the energy in the low frequency band (0.5-4 Hz that represents slow rhythmic modulations indicative of prosodic groups) and in the higher one (4-10 Hz that represents fast rhythmic modulations like phonemes).
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; Automatic Speech Processing; pathological speech; perceptual speech intelligibility; speech rhythm modeling
|
|
URL: https://hal-univ-tlse3.archives-ouvertes.fr/hal-03269227/file/Interspeech2021_1736_Paper.pdf https://hal-univ-tlse3.archives-ouvertes.fr/hal-03269227 https://hal-univ-tlse3.archives-ouvertes.fr/hal-03269227/document
|
|
BASE
|
|
Hide details
|
|
3 |
What you hear first, is what you get: Initial metrical cue presentation modulates syllable detection in sentence processing
|
|
|
|
In: ISSN: 1943-3921 ; EISSN: 1943-393X ; Attention, Perception, and Psychophysics ; https://hal.archives-ouvertes.fr/hal-03384366 ; Attention, Perception, and Psychophysics, Springer Verlag, 2021, 83, pp.1861 - 1877. ⟨10.3758/s13414-021-02251-y⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Tapping into linguistic rhythm
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 11 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Age and gender effects in European Portuguese spontaneous speech ; Efectos de la edad y del sexo en el habla espontánea en portugués europeo
|
|
|
|
In: Loquens; Vol. 8 No. 1-2 (2021): Online First; e077 ; Loquens; Vol. 8 Núm. 1-2 (2021): Número en curso; e077 ; 2386-2637 ; 10.3989/loquens.2021.v8.i1-2 (2021)
|
|
BASE
|
|
Show details
|
|
8 |
L’enjeu du traduire est de transformer toute la théorie du langage ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The iambic trochaic law in actual words: The case of English ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Production of nonce words to establish the cues for prominence and grouping in English ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
In Time with the Beat: Entrainment in Patients with Phonological Impairment, Apraxia of Speech, and Parkinson’s Disease
|
|
|
|
In: Brain Sciences ; Volume 11 ; Issue 11 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Towards a Comprehensive Account of Rhythm Processing Issues in Developmental Dyslexia
|
|
|
|
In: Brain Sciences ; Volume 11 ; Issue 10 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Regressive cross-linguistic influence in multilingual speech rhythm ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Towards a Comprehensive Account of Rhythm Processing Issues in Developmental Dyslexia
|
|
|
|
In: Brain Sciences ; 11 (2021), 10. - 1303. - MDPI. - eISSN 2076-3425 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Tapping into linguistic rhythm
|
|
|
|
In: Laboratory Phonology ; 12 (2021), 1. - 11. - Ubiquity Press. - eISSN 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
17 |
A cross‐linguistic study of multisensory perceptual narrowing in German and Swedish infants during the first year of life
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Production of French final stressed syllables in Accentual Phrase by Chinese learners: A pilot study
|
|
|
|
In: Proc. 10th International Conference on Speech Prosody 2020 ; 10th International Conference on Speech Prosody 2020 ; https://hal.archives-ouvertes.fr/hal-02620130 ; 10th International Conference on Speech Prosody 2020, May 2020, Tokyo, Japan. pp.895-899, ⟨10.21437/SpeechProsody.2020-183⟩ (2020)
|
|
BASE
|
|
Show details
|
|
19 |
МЕЛОДИКА СОЧИНЕНИЯ ДЛЯ ТЕРМЕНВОКСА Л. КАВИНОЙ «МОНОЛОГ» ... : MELODY OF L. KAVINA'S COMPOSITION FOR THEREMIN “MONOLOGUE” ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
The role of isochrony in speech perception in noise - Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|