1 |
Finding the best way to put media bias research into practice via an annotation app ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Multimodal Lip-Reading for Tracheostomy Patients in the Greek Language
|
|
|
|
In: Computers; Volume 11; Issue 3; Pages: 34 (2022)
|
|
Abstract:
Voice loss constitutes a crucial disorder which is highly associated with social isolation. The use of multimodal information sources, such as, audiovisual information, is crucial since it can lead to the development of straightforward personalized word prediction models which can reproduce the patient’s original voice. In this work we designed a multimodal approach based on audiovisual information from patients before loss-of-voice to develop a system for automated lip-reading in the Greek language. Data pre-processing methods, such as, lip-segmentation and frame-level sampling techniques were used to enhance the quality of the imaging data. Audio information was incorporated in the model to automatically annotate sets of frames as words. Recurrent neural networks were trained on four different video recordings to develop a robust word prediction model. The model was able to correctly identify test words in different time frames with 95% accuracy. To our knowledge, this is the first word prediction model that is trained to recognize words from video recordings in the Greek language.
|
|
Keyword:
deep learning; lip reading; multimodal interfaces; tracheostomy
|
|
URL: https://doi.org/10.3390/computers11030034
|
|
BASE
|
|
Hide details
|
|
3 |
Towards Portuguese Sign Language Identification Using Deep Learning
|
|
|
|
BASE
|
|
Show details
|
|
4 |
SonAmi: A Tangible Creativity Support Tool for Productive Procrastination
|
|
|
|
In: C&C ’21 - 13th ACM Conference on Creativity & Cognition ; https://hal.inria.fr/hal-03442565 ; C&C ’21 - 13th ACM Conference on Creativity & Cognition, Jun 2021, Virtual Event, Italy. pp.1-10, ⟨10.1145/3450741.3465250⟩ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
A global-scale screening of non-native aquatic organisms to identify potentially invasive species under current and future climate conditions
|
|
|
|
In: ISSN: 0048-9697 ; EISSN: 1879-1026 ; Science of the Total Environment ; https://hal.univ-lorraine.fr/hal-03544887 ; Science of the Total Environment, Elsevier, 2021, 788, pp.147868. ⟨10.1016/j.scitotenv.2021.147868⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Toucher le son d’avant d’écrire (Touching Sound, Learning Writing)
|
|
|
|
In: Actes ERGO'IA 2021 ; ERGO'IA 2021 ; https://hal.archives-ouvertes.fr/hal-03365473 ; ERGO'IA 2021, Oct 2021, Bidart, France (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Potential yield simulated by global gridded crop models: using a process-based emulator to explain their differences
|
|
|
|
In: ISSN: 1991-959X ; Geoscientific Model Development ; https://hal.archives-ouvertes.fr/hal-03188035 ; Geoscientific Model Development, European Geosciences Union, 2021, 14, pp.1639 - 1656. ⟨10.5194/gmd-14-1639-2021⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Dictionaries Integrated into English Learning Apps: Critical Comments and Suggestions for Improvement
|
|
|
|
In: Lexikos; Vol. 31 (2021); 68-92 ; 2224-0039 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Clitics are not enough: on agreement and null subjects in Brazilian Venetan
|
|
|
|
In: Glossa: a journal of general linguistics; Vol 6, No 1 (2021); 86 ; 2397-1835 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Minimax Feature Merge: The Featural Linguistic Turing Machine ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Minimax Feature Merge: The Featural Linguistic Turing Machine ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Enter the Matrix: What the new brain-computer interfaces teach us about agency, privacy, and human subjectivity
|
|
|
|
In: The iJournal: Graduate Student Journal of the Faculty of Information; Vol 6 No 2 (2021): Spring 2021 ; 2561-7397 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Voice-user interfaces for TESOL: Potential and receptiveness among native and non-native English speaking instructors
|
|
Kent, David. - : University of Hawaii National Foreign Language Resource Center, 2021. : Center for Language & Technology, 2021. : (co-sponsored by Center for Open Educational Resources and Language Learning, University of Texas at Austin), 2021
|
|
BASE
|
|
Show details
|
|
14 |
Evolution of human computer interaction
|
|
|
|
In: Sci. Visualization ; Scientific Visualization (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Natural Language Processing for Lexical Corpus Analysis
|
|
|
|
In: Doctoral Dissertations (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Audio-driven Character Animation
|
|
|
|
In: Doctoral Dissertations (2021)
|
|
BASE
|
|
Show details
|
|
17 |
Zeitgeist: Modelando um projeto editorial com interface digital
|
|
|
|
In: Pandaemonium Germanicum: Revista de Estudos Germanísticos, Vol 24, Iss 42 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Dictionaries Integrated into English Learning Apps: Critical Comments and Suggestions for Improvement
|
|
|
|
In: Lexikos, Vol 31, Pp 68-92 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
19 |
Explaining Rainfall Accumulations over Several Days in the French Alps Using Low-Dimensional Atmospheric Predictors Based on Analogy
|
|
|
|
In: ISSN: 1558-8424 ; EISSN: 1558-8432 ; Journal of Applied Meteorology and Climatology ; https://hal.archives-ouvertes.fr/hal-03087661 ; Journal of Applied Meteorology and Climatology, American Meteorological Society, 2020, 59 (2), pp.237-250. ⟨10.1175/JAMC-D-19-0112.1⟩ (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Voks: Digital instruments for chironomic control of voice samples
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-03009712 ; Speech Communication, Elsevier : North-Holland, 2020, 125, pp.97 - 113. ⟨10.1016/j.specom.2020.10.002⟩ (2020)
|
|
BASE
|
|
Show details
|
|
|
|