1 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
BBC-Oxford British Sign Language Dataset
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Can machines learn to see without visual databases?
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03526569 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Large-scale Bilingual Language-Image Contrastive Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Bridging Video-text Retrieval with Multiple Choice Questions ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Towards a Perceptual Model for Estimating the Quality of Visual Speech ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Expression-preserving face frontalization improves visually assisted speech processing ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Modeling Intensification for Sign Language Generation: A Computational Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Keypoint based Sign Language Translation without Glosses ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
A Transformer-Based Contrastive Learning Approach for Few-Shot Sign Language Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Including Facial Expressions in Contextual Embeddings for Sign Language Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Statistical and Spatio-temporal Hand Gesture Features for Sign Language Recognition using the Leap Motion Sensor ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition ...
|
|
|
|
Abstract:
Sign language is a beautiful visual language and is also the primary language used by speaking and hearing-impaired people. However, sign language has many complex expressions, which are difficult for the public to understand and master. Sign language recognition algorithms will significantly facilitate communication between hearing-impaired people and normal people. Traditional continuous sign language recognition often uses a sequence learning method based on Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM). These methods can only learn spatial and temporal features separately, which cannot learn the complex spatial-temporal features of sign language. LSTM is also difficult to learn long-term dependencies. To alleviate these problems, this paper proposes a multi-view spatial-temporal continuous sign language recognition network. The network consists of three parts. The first part is a Multi-View Spatial-Temporal Feature Extractor Network (MSTN), which can directly extract the ... : 12 pages, 4 figures ...
|
|
Keyword:
Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; I.2.7; I.2.10
|
|
URL: https://dx.doi.org/10.48550/arxiv.2204.08747 https://arxiv.org/abs/2204.08747
|
|
BASE
|
|
Hide details
|
|
|
|