1 |
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Estimating Vocal Tract Resonances of Synthesized High-Pitched Vowels Using CNN ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Characterizing News Portrayal of Civil Unrest in Hong Kong, 1998–2020 ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Face Biometric Spoof Detection Method Using a Remote Photoplethysmography Signal
|
|
|
|
In: Sensors; Volume 22; Issue 8; Pages: 3070 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
ECG Recurrence Plot-Based Arrhythmia Classification Using Two-Dimensional Deep Residual CNN Features
|
|
|
|
In: Sensors; Volume 22; Issue 4; Pages: 1660 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Automatic Classification Framework of Tongue Feature Based on Convolutional Neural Networks
|
|
|
|
In: Micromachines; Volume 13; Issue 4; Pages: 501 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Short Text Aspect-Based Sentiment Analysis Based on CNN + BiGRU
|
|
|
|
In: Applied Sciences; Volume 12; Issue 5; Pages: 2707 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Impact of Sentence Representation Matching in Neural Machine Translation
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1313 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
A Combined Text-Based and Metadata-Based Deep-Learning Framework for the Detection of Spam Accounts on the Social Media Platform Twitter
|
|
|
|
In: Processes; Volume 10; Issue 3; Pages: 439 (2022)
|
|
BASE
|
|
Show details
|
|
13 |
Application of Modern Digital Systems and Approaches to Business Process Management
|
|
|
|
In: Sustainability; Volume 14; Issue 3; Pages: 1697 (2022)
|
|
BASE
|
|
Show details
|
|
14 |
BenSignNet: Bengali Sign Language Alphabet Recognition Using Concatenated Segmentation and Convolutional Neural Network
|
|
|
|
In: Applied Sciences; Volume 12; Issue 8; Pages: 3933 (2022)
|
|
Abstract:
Sign language recognition is one of the most challenging applications in machine learning and human-computer interaction. Many researchers have developed classification models for different sign languages such as English, Arabic, Japanese, and Bengali; however, no significant research has been done on the general-shape performance for different datasets. Most research work has achieved satisfactory performance with a small dataset. These models may fail to replicate the same performance for evaluating different and larger datasets. In this context, this paper proposes a novel method for recognizing Bengali sign language (BSL) alphabets to overcome the issue of generalization. The proposed method has been evaluated with three benchmark datasets such as ‘38 BdSL’, ‘KU-BdSL’, and ‘Ishara-Lipi’. Here, three steps are followed to achieve the goal: segmentation, augmentation, and Convolutional neural network (CNN) based classification. Firstly, a concatenated segmentation approach with YCbCr, HSV and watershed algorithm was designed to accurately identify gesture signs. Secondly, seven image augmentation techniques are selected to increase the training data size without changing the semantic meaning. Finally, the CNN-based model called BenSignNet was applied to extract the features and classify purposes. The performance accuracy of the model achieved 94.00%, 99.60%, and 99.60% for the BdSL Alphabet, KU-BdSL, and Ishara-Lipi datasets, respectively. Experimental findings confirmed that our proposed method achieved a higher recognition rate than the conventional ones and accomplished a generalization property in all datasets for the BSL domain.
|
|
Keyword:
38-BdSL; Bengali sign language (BSL); concatenated segmentation; Convolutional neural network (CNN); Hue saturation value (HSV); Ishara-Lipi; KU-BdSL; Luminance blue red (YCbCr)
|
|
URL: https://doi.org/10.3390/app12083933
|
|
BASE
|
|
Hide details
|
|
15 |
Bangla Sign Language (BdSL) Alphabets and Numerals Classification Using a Deep Learning Model
|
|
|
|
In: Sensors; Volume 22; Issue 2; Pages: 574 (2022)
|
|
BASE
|
|
Show details
|
|
16 |
TASE: Task-Aware Speech Enhancement for Wake-Up Word Detection in Voice Assistants
|
|
|
|
In: Applied Sciences; Volume 12; Issue 4; Pages: 1974 (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Factors Influencing Students’ Intention to Use E-Textbooks and Their Impact on Academic Achievement in Bilingual Environment: An Empirical Study Jordan
|
|
|
|
In: Information; Volume 13; Issue 5; Pages: 233 (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Active Binaural Auditory Perceptual System for a Socially Interactive Humanoid Robot
|
|
|
|
In: Engineering Proceedings; Volume 12; Issue 1; Pages: 83 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Rethinking the Methods and Algorithms for Inner Speech Decoding and Making Them Reproducible
|
|
|
|
In: NeuroSci; Volume 3; Issue 2; Pages: 226-244 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Multitask Pointer Network for Multi-Representational Parsing
|
|
|
|
BASE
|
|
Show details
|
|
|
|