DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...81
Hits 1 – 20 of 1.603

1
Learning and controlling the source-filter representation of speech with a variational autoencoder
In: https://hal.archives-ouvertes.fr/hal-03650569 ; 2022 (2022)
BASE
Show details
2
Genetic Neural Architecture Search for automatic assessment of human sperm images
In: ISSN: 0957-4174 ; Expert Systems with Applications ; https://hal.archives-ouvertes.fr/hal-03585035 ; Expert Systems with Applications, Elsevier, 2022 (2022)
BASE
Show details
3
Unsupervised quantification of entity consistency between photos and text in real-world news ...
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
BASE
Show details
4
Multi language Email Classification Using Transfer learning
BASE
Show details
5
Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse​ ...
BASE
Show details
6
Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach ...
BASE
Show details
7
Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian
In: Electronics; Volume 11; Issue 3; Pages: 374 (2022)
BASE
Show details
8
COVID-19 Vaccination-Related Sentiments Analysis: A Case Study Using Worldwide Twitter Dataset
In: Healthcare; Volume 10; Issue 3; Pages: 411 (2022)
BASE
Show details
9
A Novel Pathological Voice Identification Technique through Simulated Cochlear Implant Processing Systems
In: Applied Sciences; Volume 12; Issue 5; Pages: 2398 (2022)
BASE
Show details
10
Considering Commonsense in Solving QA: Reading Comprehension with Semantic Search and Continual Learning
In: Applied Sciences; Volume 12; Issue 9; Pages: 4099 (2022)
BASE
Show details
11
Multimodal Lip-Reading for Tracheostomy Patients in the Greek Language
In: Computers; Volume 11; Issue 3; Pages: 34 (2022)
Abstract: Voice loss constitutes a crucial disorder which is highly associated with social isolation. The use of multimodal information sources, such as, audiovisual information, is crucial since it can lead to the development of straightforward personalized word prediction models which can reproduce the patient’s original voice. In this work we designed a multimodal approach based on audiovisual information from patients before loss-of-voice to develop a system for automated lip-reading in the Greek language. Data pre-processing methods, such as, lip-segmentation and frame-level sampling techniques were used to enhance the quality of the imaging data. Audio information was incorporated in the model to automatically annotate sets of frames as words. Recurrent neural networks were trained on four different video recordings to develop a robust word prediction model. The model was able to correctly identify test words in different time frames with 95% accuracy. To our knowledge, this is the first word prediction model that is trained to recognize words from video recordings in the Greek language.
Keyword: deep learning; lip reading; multimodal interfaces; tracheostomy
URL: https://doi.org/10.3390/computers11030034
BASE
Hide details
12
Identifying Learners’ Interaction Patterns in an Online Learning Community
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 4; Pages: 2245 (2022)
BASE
Show details
13
Analysis of the Full-Size Russian Corpus of Internet Drug Reviews with Complex NER Labeling Using Deep Learning Neural Networks and Language Models
In: Applied Sciences; Volume 12; Issue 1; Pages: 491 (2022)
BASE
Show details
14
An Evolution Gaining Momentum—The Growing Role of Artificial Intelligence in the Diagnosis and Treatment of Spinal Diseases
In: Diagnostics; Volume 12; Issue 4; Pages: 836 (2022)
BASE
Show details
15
Artificial Intelligence in Digestive Endoscopy—Where Are We and Where Are We Going?
In: Diagnostics; Volume 12; Issue 4; Pages: 927 (2022)
BASE
Show details
16
Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model
In: Applied Sciences; Volume 12; Issue 7; Pages: 3338 (2022)
BASE
Show details
17
Transformer-Based Abstractive Summarization for Reddit and Twitter: Single Posts vs. Comment Pools in Three Languages
In: Future Internet; Volume 14; Issue 3; Pages: 69 (2022)
BASE
Show details
18
The Sustainable Development of Intangible Cultural Heritage with AI: Cantonese Opera Singing Genre Classification Based on CoGCNet Model in China
In: Sustainability; Volume 14; Issue 5; Pages: 2923 (2022)
BASE
Show details
19
Artificial Intelligence and Machine Learning in the Diagnosis and Management of Gastroenteropancreatic Neuroendocrine Neoplasms—A Scoping Review
In: Diagnostics; Volume 12; Issue 4; Pages: 874 (2022)
BASE
Show details
20
Automatic Classification Framework of Tongue Feature Based on Convolutional Neural Networks
In: Micromachines; Volume 13; Issue 4; Pages: 501 (2022)
BASE
Show details

Page: 1 2 3 4 5...81

Catalogues
2
0
0
0
0
0
2
Bibliographies
3
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.598
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern