Page: 1 2 3 4 5 6 7 8... 870
61 |
Word separation in continuous sign language using isolated signs and post-processing ...
|
|
|
|
BASE
|
|
Show details
|
|
62 |
Exploring Sub-skeleton Trajectories for Interpretable Recognition of Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
63 |
ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
64 |
TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials ...
|
|
|
|
Abstract:
Thai Finger Spelling (TFS) sign recognition could benefit a community of hearing-difficulty people in bridging to a major hearing population. With a relatively large number of alphabets, TFS employs multiple signing schemes. Two schemes of more common signing -- static and dynamic single-hand signing, widely used in other sign languages -- have been addressed in several previous works. To complete the TFS sign recognition, the remaining two of quite distinct signing schemes -- static and dynamic point-on-hand signing -- need to be sufficiently addressed. With the advent of many off-the-shelf hand skeleton prediction models and that training a model to recognize a sign language from scratch is expensive, we explore an approach building upon recently launched MediaPipe Hands (MPH). MPH is a high-precision well-trained model for hand-keypoint detection. We have investigated MPH on three TFS schemes: static-single-hand (S1), simplified dynamic-single-hand (S2) and static-point-on-hand (P1) schemes. Our results ... : 19 pages, 10 figures ...
|
|
Keyword:
Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; Human-Computer Interaction cs.HC
|
|
URL: https://dx.doi.org/10.48550/arxiv.2201.03170 https://arxiv.org/abs/2201.03170
|
|
BASE
|
|
Hide details
|
|
65 |
Sign Language Video Retrieval with Free-Form Textual Queries ...
|
|
|
|
BASE
|
|
Show details
|
|
68 |
Sign Language Recognition System using TensorFlow Object Detection API ...
|
|
|
|
BASE
|
|
Show details
|
|
69 |
Τρισδιάστατη ανακατασκευή ανθρωπίνου σώματος, χεριών και προσώπου με εφαρμογές στην αναγνώριση νοηματικής γλώσσας ...
|
|
|
|
BASE
|
|
Show details
|
|
70 |
Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
71 |
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
72 |
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework ...
|
|
|
|
BASE
|
|
Show details
|
|
73 |
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
74 |
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos ...
|
|
|
|
BASE
|
|
Show details
|
|
76 |
EnvEdit: Environment Editing for Vision-and-Language Navigation ...
|
|
|
|
BASE
|
|
Show details
|
|
77 |
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
80 |
IterVM: Iterative Vision Modeling Module for Scene Text Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8... 870
|
|