Page: 1 2 3 4 5 6 7 8... 870
61 |
Word separation in continuous sign language using isolated signs and post-processing ...
|
|
|
|
BASE
|
|
Show details
|
|
62 |
Exploring Sub-skeleton Trajectories for Interpretable Recognition of Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
63 |
ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language ...
|
|
|
|
BASE
|
|
Show details
|
|
64 |
TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials ...
|
|
|
|
BASE
|
|
Show details
|
|
65 |
Sign Language Video Retrieval with Free-Form Textual Queries ...
|
|
|
|
BASE
|
|
Show details
|
|
68 |
Sign Language Recognition System using TensorFlow Object Detection API ...
|
|
|
|
BASE
|
|
Show details
|
|
69 |
Τρισδιάστατη ανακατασκευή ανθρωπίνου σώματος, χεριών και προσώπου με εφαρμογές στην αναγνώριση νοηματικής γλώσσας ...
|
|
|
|
BASE
|
|
Show details
|
|
70 |
Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
71 |
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification ...
|
|
|
|
Abstract:
Social media platforms often act as breeding grounds for various forms of trolling or malicious content targeting users or communities. One way of trolling users is by creating memes, which in most cases unites an image with a short piece of text embedded on top of it. The situation is more complex for multilingual(e.g., Tamil) memes due to the lack of benchmark datasets and models. We explore several models to detect Troll memes in Tamil based on the shared task, "Troll Meme Classification in DravidianLangTech2022" at ACL-2022. We observe while the text-based model MURIL performs better for Non-troll meme classification, the image-based model VGG16 performs better for Troll-meme classification. Further fusing these two modalities help us achieve stable outcomes in both classes. Our fusion model achieved a 0.561 weighted average F1 score and ranked second in this task. ... : Accepted at ACL 2022 DravidianLangTech Workshop ...
|
|
Keyword:
Computation and Language cs.CL; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; Machine Learning cs.LG; Multimedia cs.MM
|
|
URL: https://arxiv.org/abs/2204.12587 https://dx.doi.org/10.48550/arxiv.2204.12587
|
|
BASE
|
|
Hide details
|
|
72 |
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework ...
|
|
|
|
BASE
|
|
Show details
|
|
73 |
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
74 |
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos ...
|
|
|
|
BASE
|
|
Show details
|
|
76 |
EnvEdit: Environment Editing for Vision-and-Language Navigation ...
|
|
|
|
BASE
|
|
Show details
|
|
77 |
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
80 |
IterVM: Iterative Vision Modeling Module for Scene Text Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8... 870
|
|