25 |
Новые данные о речевом дыхании по результатам онлайновой магнитно-резонансной томографии легких ... : New data on speech breathing upon results from online Magnetic Resonance Imaging ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Appendix: Training Multilingual Writing Strategies in Higher Education ... : Anhang zu Training Multilingual Writing Strategies in Higher Education ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
Interference in processing of Czech intraclausal garden-path structures - yes/no questions ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
Agreement attraction in English and Czech: A direct experimental comparison ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
Interference in processing of Czech intraclausal garden-path structures - open-ended questions ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Current & Future Research Directions in Singapore Mandarin ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Use of Parsing Heuristics in the Comprehension of Passive Sentences: Evidence from Dyslexia and Individual Differences
|
|
|
|
In: Brain Sciences; Volume 12; Issue 2; Pages: 209 (2022)
|
|
BASE
|
|
Show details
|
|
34 |
What Is Going on with Visual Attention in Reading and Dyslexia? A Critical Review of Recent Studies
|
|
|
|
In: Brain Sciences; Volume 12; Issue 1; Pages: 87 (2022)
|
|
BASE
|
|
Show details
|
|
35 |
Can Heritage Speakers Predict Lexical and Morphosyntactic Information in Reading?
|
|
|
|
In: Languages; Volume 7; Issue 1; Pages: 60 (2022)
|
|
BASE
|
|
Show details
|
|
36 |
Multimodal Lip-Reading for Tracheostomy Patients in the Greek Language
|
|
|
|
In: Computers; Volume 11; Issue 3; Pages: 34 (2022)
|
|
Abstract:
Voice loss constitutes a crucial disorder which is highly associated with social isolation. The use of multimodal information sources, such as, audiovisual information, is crucial since it can lead to the development of straightforward personalized word prediction models which can reproduce the patient’s original voice. In this work we designed a multimodal approach based on audiovisual information from patients before loss-of-voice to develop a system for automated lip-reading in the Greek language. Data pre-processing methods, such as, lip-segmentation and frame-level sampling techniques were used to enhance the quality of the imaging data. Audio information was incorporated in the model to automatically annotate sets of frames as words. Recurrent neural networks were trained on four different video recordings to develop a robust word prediction model. The model was able to correctly identify test words in different time frames with 95% accuracy. To our knowledge, this is the first word prediction model that is trained to recognize words from video recordings in the Greek language.
|
|
Keyword:
deep learning; lip reading; multimodal interfaces; tracheostomy
|
|
URL: https://doi.org/10.3390/computers11030034
|
|
BASE
|
|
Hide details
|
|
37 |
FedQAS: Privacy-Aware Machine Reading Comprehension with Federated Learning
|
|
|
|
In: Applied Sciences; Volume 12; Issue 6; Pages: 3130 (2022)
|
|
BASE
|
|
Show details
|
|
38 |
Beyond the Edge: Markerless Pose Estimation of Speech Articulators from Ultrasound and Camera Images Using DeepLabCut
|
|
|
|
In: Sensors; Volume 22; Issue 3; Pages: 1133 (2022)
|
|
BASE
|
|
Show details
|
|
39 |
Literacy Acquisition Trajectories in Bilingual Language Minority Children and Monolingual Peers with Similar or Different SES: A Three-Year Longitudinal Study
|
|
|
|
In: Brain Sciences; Volume 12; Issue 5; Pages: 563 (2022)
|
|
BASE
|
|
Show details
|
|
40 |
Remote Dyslexia Screening for Bilingual Children
|
|
|
|
In: Multimodal Technologies and Interaction; Volume 6; Issue 1; Pages: 7 (2022)
|
|
BASE
|
|
Show details
|
|
|
|