1 |
Automatic audiovisual synchronisation for ultrasound tongue imaging ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Automatic audiovisual synchronisation for ultrasound tongue imaging
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Using ultrasound tongue imaging to support the phonetic transcription of childhood speech sound disorders
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Automatic audiovisual synchronisation for ultrasound tongue imaging
|
|
|
|
Abstract:
Ultrasound tongue imaging is used to visualise the intra-oral articulators during speech production. It is utilised in a range of applications, including speech and language therapy and phonetics research. Ultrasound and speech audio are recorded simultaneously, and in order to correctly use this data, the two modalities should be correctly synchronised. Synchronisation is achieved using specialised hardware at recording time, but this approach can fail in practice resulting in data of limited usability. In this paper, we address the problem of automatically synchronising ultrasound and audio after data collection. We first investigate the tolerance of expert ultrasound users to synchronisation errors in order to find the thresholds for error detection. We use these thresholds to define accuracy scoring boundaries for evaluating our system. We then describe our approach for automatic synchronisation, which is driven by a self-supervised neural network, exploiting the correlation between the two signals to synchronise them. We train our model on data from multiple domains with different speaker characteristics, different equipment, and different recording environments, and achieve an accuracy >92.4% on held-out in-domain data. Finally, we introduce a novel resource, the Cleft dataset, which we gathered with a new clinical subgroup and for which hardware synchronisation proved unreliable. We apply our model to this out-of-domain data, and evaluate its performance subjectively with expert users. Results show that users prefer our model's output over the original hardware output 79.3% of the time. Our results demonstrate the strength of our approach and its ability to generalise to data from new domains.
|
|
Keyword:
461104 - Neural networks
|
|
URL: https://hdl.handle.net/1959.7/uws:62528 https://doi.org/10.1016/j.specom.2021.05.008
|
|
BASE
|
|
Hide details
|
|
5 |
[In Press] Using ultrasound tongue imaging to support the phonetic transcription of childhood speech sound disorders
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Evaluation of Parent- and Speech-Language Pathologist–Delivered Multiple Oppositions Intervention for Children With Phonological Impairment: A Multiple-Baseline Design Study
|
|
|
|
In: ETSU Faculty Works (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Evaluation of parent and speech-language pathologist delivered multiple oppositions intervention for children with phonological impairment : a multiple-baseline design study
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Evaluation of parent- and speech-language pathologist-delivered multiple oppositions intervention for children with phonological impairment : a multiple-baseline design study
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The impact of real-time articulatory information on phonetic transcription : ultrasound-aided transcription in cleft lip and palate speech
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Parent- and SLP-delivered multiple oppositions (Sugden et al., 2019) ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Parent- and SLP-delivered multiple oppositions (Sugden et al., 2019) ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Parents’ Experiences of Completing Home Practice for Speech Sound Disorders
|
|
|
|
In: ETSU Faculty Works (2019)
|
|
BASE
|
|
Show details
|
|
13 |
The impact of real-time articulatory information on phonetic transcription: Ultrasound-aided transcription in cleft lip and palate speech
|
|
|
|
BASE
|
|
Show details
|
|
14 |
The Impact of Real-Time Articulatory Information on Phonetic Transcription: Ultrasound-Aided Transcription in Cleft Lip and Palate Speech.
|
|
|
|
In: eissn: 1421-9972 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
The Impact of Real-Time Articulatory Information on Phonetic Transcription: Ultrasound-Aided Transcription in Cleft Lip and Palate Speech.
|
|
|
|
In: eissn: 1421-9972 (2019)
|
|
BASE
|
|
Show details
|
|
16 |
The impact of real-time articulatory information on phonetic transcription : ultrasound-aided transcription in cleft lip and palate speech
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Improving the reliability of phonetic transcription in cleft lip and palate using ultrasound tongue imaging
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Parents' experiences of completing home practice for speech sound disorders
|
|
|
|
BASE
|
|
Show details
|
|
19 |
ULTRAX2020 : Ultrasound Technology for Optimising the Treatment of Speech Disorders : Clinicians' Resource Manual ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Service Delivery and Intervention Intensity for Phonology‐Based Speech Sound Disorders
|
|
|
|
In: ETSU Faculty Works (2018)
|
|
BASE
|
|
Show details
|
|
|
|