Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7 8...870

Hits 61 – 80 of 17.396

61	Word separation in continuous sign language using isolated signs and post-processing ...
	Rastgoo, Razieh; Kiani, Kourosh; Escalera, Sergio. - : arXiv, 2022
	BASE
	Show details

62	Exploring Sub-skeleton Trajectories for Interpretable Recognition of Sign Language ...
	Gudmundsson, Joachim; Seybold, Martin P.; Pfeifer, John. - : arXiv, 2022
	BASE
	Show details

63	ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language ...
	de Amorim, Cleison Correia; Zanchettin, Cleber. - : arXiv, 2022
	BASE
	Show details

64	TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials ...
	Sanalohit, Jinnavat; Katanyukul, Tatpong. - : arXiv, 2022
	BASE
	Show details

65	Sign Language Video Retrieval with Free-Form Textual Queries ...
	Duarte, Amanda; Albanie, Samuel; Giró-i-Nieto, Xavier. - : arXiv, 2022
	BASE
	Show details

66	All You Need In Sign Language Production ...
	Rastgoo, Razieh; Kiani, Kourosh; Escalera, Sergio. - : arXiv, 2022
	BASE
	Show details

67	Towards Zero-shot Sign Language Recognition ...
	Bilge, Yunus Can; Cinbis, Ramazan Gokberk; Ikizler-Cinbis, Nazli. - : arXiv, 2022
	BASE
	Show details

68	Sign Language Recognition System using TensorFlow Object Detection API ...
	Srivastava, Sharvani; Gangwar, Amisha; Mishra, Richa. - : arXiv, 2022
	BASE
	Show details

69	Τρισδιάστατη ανακατασκευή ανθρωπίνου σώματος, χεριών και προσώπου με εφαρμογές στην αναγνώριση νοηματικής γλώσσας ...
	Kratimenos, Angelos. - : National Technological University of Athens, 2022
	BASE
	Show details

70	Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation ...
	Chang, Xiaoguang; Wang, Teng; Sun, Changyin. - : arXiv, 2022
	BASE
	Show details

71	hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification ...
	Das, Mithun; Banerjee, Somnath; Mukherjee, Animesh. - : arXiv, 2022
	BASE
	Show details

72	Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework ...
	Gu, Jiaxi; Meng, Xiaojun; Lu, Guansong. - : arXiv, 2022
	BASE
	Show details

73	SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition ...
	Huang, Mingxin; Liu, Yuliang; Peng, Zhenghao. - : arXiv, 2022
	BASE
	Show details

74	3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos ...
	Gupta, Vikram; Mittal, Trisha; Mathur, Puneet. - : arXiv, 2022
	BASE
	Show details

75	Taking an Emotional Look at Video Paragraph Captioning ...
	Li, Qinyu; Li, Tengpeng; Wang, Hanli; Chen, Chang Wen. - : arXiv, 2022
	Abstract: Translating visual data into natural language is essential for machines to understand the world and interact with humans. In this work, a comprehensive study is conducted on video paragraph captioning, with the goal to generate paragraph-level descriptions for a given video. However, current researches mainly focus on detecting objective facts, ignoring the needs to establish the logical associations between sentences and to discover more accurate emotions related to video contents. Such a problem impairs fluent and abundant expressions of predicted captions, which are far below human language tandards. To solve this problem, we propose to construct a large-scale emotion and logic driven multilingual dataset for this task. This dataset is named EMVPC (standing for "Emotional Video Paragraph Captioning") and contains 53 widely-used emotions in daily life, 376 common scenes corresponding to these emotions, 10,291 high-quality videos and 20,582 elaborated paragraph captions with English and Chinese versions. ...
	Keyword: Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
	URL: https://dx.doi.org/10.48550/arxiv.2203.06356 https://arxiv.org/abs/2203.06356
	BASE
	Hide details

76	EnvEdit: Environment Editing for Vision-and-Language Navigation ...
	Li, Jialu; Tan, Hao; Bansal, Mohit. - : arXiv, 2022
	BASE
	Show details

77	IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ...
	Bugliarello, Emanuele; Liu, Fangyu; Pfeiffer, Jonas. - : arXiv, 2022
	BASE
	Show details

78	Natural Language Descriptions of Deep Visual Features ...
	Hernandez, Evan; Schwettmann, Sarah; Bau, David. - : arXiv, 2022
	BASE
	Show details

79	Finding Structural Knowledge in Multimodal-BERT ...
	Milewski, Victor; de Lhoneux, Miryam; Moens, Marie-Francine. - : arXiv, 2022
	BASE
	Show details

80	IterVM: Iterative Vision Modeling Module for Scene Text Recognition ...
	Chu, Xiaojie; Wang, Yongtao. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5 6 7 8...870

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern