1 |
Hippocampal and auditory contributions to speech segmentation
|
|
|
|
In: ISSN: 0010-9452 ; Cortex ; https://hal.archives-ouvertes.fr/hal-03604957 ; Cortex, Elsevier, 2022, ⟨10.1016/j.cortex.2022.01.017⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Speaking clearly improves speech segmentation by statistical learning under optimal listening conditions ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The effect of lengthening aspiration on speech segmentation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03372487 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Speaking clearly improves speech segmentation by statistical learning under optimal listening conditions
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 12, No 1 (2021); 14 ; 1868-6354 (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Production of nonce words to establish the cues for prominence and grouping in English ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
The Iambic Trochaic Law in speech: The case of Japanese ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Production of French final stressed syllables in Accentual Phrase by Chinese learners: A pilot study
|
|
|
|
In: Proc. 10th International Conference on Speech Prosody 2020 ; 10th International Conference on Speech Prosody 2020 ; https://hal.archives-ouvertes.fr/hal-02620130 ; 10th International Conference on Speech Prosody 2020, May 2020, Tokyo, Japan. pp.895-899, ⟨10.21437/SpeechProsody.2020-183⟩ (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Visual Speech Mitigates the Influence of Speech Rate on Speech Perception
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Subsegments and the emergence of segments
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 4 (2019): Proceedings of the Linguistic Society of America; 37:1–8 ; 2473-8689 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Using Baby Sign As A Facilitating Technique To Segment Speech With 4-18-Month-Old Monolingual And Bilingual Hearing Infants-An Extension Study
|
|
|
|
In: Open Access Theses & Dissertations (2018)
|
|
BASE
|
|
Show details
|
|
16 |
Acoustic Models for the Automatic Identification of Prosodic Boundaries in Spontaneous Speech / Modelos acústicos para a identificação automática de fronteiras prosódicas na fala espontânea
|
|
|
|
In: Revista de Estudos da Linguagem, Vol 26, Iss 4, Pp 1455-1488 (2018) (2018)
|
|
Abstract:
Abstract: This work presents the results of the analysis of multiple acoustic parameters for the construction of a model for the automatic segmentation of speech in tone units. Based on literature review, we defined sets of acoustic parameters related to the signalization of terminal and non-terminal boundaries. For each parameter, we extracted a series of measurements: 6 for speech rate and rhythm; 34 for duration; 65 for fundamental frequency; 4 for intensity and 2 measurements related to pause. These parameters were extracted from spontaneous speech fragments that were previously segmented into tone units, manually performed by 14 human annotators. We used two methods of statistical classification, Random Forest (RF) and Linear Discriminant Analysis (LDA), to generate models for the identification of prosodic boundaries. After several phases of training and testing, both methods were relatively successful in identifying terminal and non-terminal boundaries. The LDA method presented a higher accuracy in the prediction of terminal and non-terminal boundaries than the RF method, therefore the model obtained with LDA was further refined. As a result, the terminal boundary model is based on 20 acoustic measurements and shows a convergence of 80% in relation to boundaries identified by annotators in the speech sample. For non-terminal boundaries, we arrived at three models that, combined, presented a convergence of 98% in relation to the boundaries identified by annotators in the sample. Keywords: speech segmentation; prosodic boundaries; spontaneous speech. Resumo: Este trabalho apresenta os resultados da análise de múltiplos parâmetros acústicos para a construção de um modelo para a segmentação automática da fala em unidades tonais. A partir da investigação da literatura, definimos conjuntos de parâmetros acústicos relacionados à identificação de fronteiras terminais e não terminais. Para cada parâmetro, uma série de medidas foram extraídas: 6 medidas de taxa de elocução e ritmo; 34 de duração; 65 de frequência fundamental; 4 de intensidade e 2 medidas relativas às pausas. Tais parâmetros foram extraídos de fragmentos de fala espontânea previamente segmentada em unidades tonais de forma manual por 14 anotadores humanos. Utilizamos dois métodos de classificação estatística, Random Forest (RF) e Linear Discriminant Analysis (LDA), para gerar modelos de identificação de fronteiras prosódicas. Após diversas fases de treinamentos e testes, ambos os métodos apresentaram sucesso relativo na identificação de fronteiras terminais e não-terminais. O método LDA apresentou maior índice de acerto na previsão de fronteiras terminais e não-terminais do que o RF, portanto, o modelo obtido com este método foi refinado. Como resultado, O modelo para as fronteiras terminais baseia-se em 20 medidas acústicas e apresenta uma convergência de 80% em relação às fronteiras identificadas pelos anotadores na amostra de fala. Para as fronteiras não terminais, chegamos a três modelos que, combinados, apresentaram uma convergência de 98% em relação às fronteiras identificadas pelos anotadores na amostra. Palavras-chave: segmentação da fala; fronteiras prosódicas; fala espontânea.
|
|
Keyword:
Language and Literature; P; P1-1091; Philology. Linguistics; prosodic boundaries; speech segmentation; spontaneous speech
|
|
URL: https://doi.org/10.17851/2237-2083.26.4.1455-1488 https://doaj.org/article/9040a43b4c4d4f7989fbab382dc78a22
|
|
BASE
|
|
Hide details
|
|
17 |
Acoustic Correlates of Prosodic Boundaries in French A Review of Corpus Data / Correlatos acústicos de fronteiras prosódicas em francês: uma revisão de dados de corpora
|
|
|
|
In: Revista de Estudos da Linguagem, Vol 26, Iss 4, Pp 1531-1549 (2018) (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Learning novel phonotactics from exposure to continuous speech
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 8, No 1 (2017); 12 ; 1868-6354 (2017)
|
|
BASE
|
|
Show details
|
|
19 |
French Speech Segmentation in Liaison Contexts by L1 and L2 Listeners
|
|
|
|
In: Laboratory Phonology: Journal of the Association for Laboratory Phonology; Vol 7, No 1 (2016); 17 ; 1868-6354 (2016)
|
|
BASE
|
|
Show details
|
|
20 |
Speech segmentation and speaker diarisation for transcription and translation
|
|
|
|
BASE
|
|
Show details
|
|
|
|