1 |
The "Fat Face" illusion: A robust adaptation for processing pairs of faces
|
|
|
|
In: ISSN: 0042-6989 ; EISSN: 0042-6989 ; Vision Research ; https://hal.archives-ouvertes.fr/hal-03579276 ; Vision Research, Elsevier, 2022, 195, pp.108015. ⟨10.1016/j.visres.2022.108015⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
BBC-Oxford British Sign Language Dataset
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Can machines learn to see without visual databases?
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03526569 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Unsupervised quantification of entity consistency between photos and text in real-world news ...
|
|
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
|
|
BASE
|
|
Show details
|
|
7 |
Large-scale Bilingual Language-Image Contrastive Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Bridging Video-text Retrieval with Multiple Choice Questions ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Towards a Perceptual Model for Estimating the Quality of Visual Speech ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Expression-preserving face frontalization improves visually assisted speech processing ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Modeling Intensification for Sign Language Generation: A Computational Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Keypoint based Sign Language Translation without Glosses ...
|
|
|
|
Abstract:
Sign Language Translation (SLT) is a task that has not been studied relatively much compared to the study of Sign Language Recognition (SLR). However, the SLR is a study that recognizes the unique grammar of sign language, which is different from the spoken language and has a problem that non-disabled people cannot easily interpret. So, we're going to solve the problem of translating directly spoken language in sign language video. To this end, we propose a new keypoint normalization method for performing translation based on the skeleton point of the signer and robustly normalizing these points in sign language translation. It contributed to performance improvement by a customized normalization method depending on the body parts. In addition, we propose a stochastic frame selection method that enables frame augmentation and sampling at the same time. Finally, it is translated into the spoken language through an Attention-based translation model. Our method can be applied to various datasets in a way that ... : 14 pages, 5 figures, IEEE Sensors Journals ...
|
|
Keyword:
Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2204.10511 https://dx.doi.org/10.48550/arxiv.2204.10511
|
|
BASE
|
|
Hide details
|
|
15 |
A Transformer-Based Contrastive Learning Approach for Few-Shot Sign Language Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Including Facial Expressions in Contextual Embeddings for Sign Language Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Statistical and Spatio-temporal Hand Gesture Features for Sign Language Recognition using the Leap Motion Sensor ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|