Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...212

Hits 1 – 20 of 4.228

1	The Impact of Removing Head Movements on Audio-visual Speech Enhancement
	Kang, Zhiqi; Sadeghi, Mostafa; Horaud, Radu...
	In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
	BASE
	Show details

2	An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
	Dey, Spandan; Sahidullah, Md; Saha, Goutam
	In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
	BASE
	Show details

3	BBC-Oxford British Sign Language Dataset
	Albanie, Samuel; Varol, Gül; Momeni, Liliane...
	In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
	BASE
	Show details

4	Can machines learn to see without visual databases?
	Betti, Alessandro; Gori, Marco; Melacci, Stefano...
	In: https://hal.archives-ouvertes.fr/hal-03526569 ; 2022 (2022)
	BASE
	Show details

5	Large-scale Bilingual Language-Image Contrastive Learning ...
	Ko, Byungsoo; Gu, Geonmo. - : arXiv, 2022
	BASE
	Show details

6	Bridging Video-text Retrieval with Multiple Choice Questions ...
	Ge, Yuying; Ge, Yixiao; Liu, Xihui. - : arXiv, 2022
	BASE
	Show details

7	DanFEVER: claim verification dataset for Danish ...
	Nørregaard, Jeppe; Derczynski, Leon. - : figshare, 2022
	BASE
	Show details

8	DanFEVER: claim verification dataset for Danish ...
	Nørregaard, Jeppe; Derczynski, Leon. - : figshare, 2022
	BASE
	Show details

9	Towards a Perceptual Model for Estimating the Quality of Visual Speech ...
	Aldeneh, Zakaria; Fedzechkina, Masha; Seto, Skyler. - : arXiv, 2022
	BASE
	Show details

10	An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
	Roy, Anwesha; Belagali, Varun; Ghosh, Prasanta Kumar. - : arXiv, 2022
	Abstract: The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the entire original and predicted contours. Such an evaluation measure may not capture local errors in the predicted contour. Careful analysis of predicted contours reveals errors in regions like the velum part of contour1 (ATB comprising of upper lip, hard palate, and velum) and tongue base section of contour2 (ATB covering jawline, lower lip, tongue base, and epiglottis), which are not captured in a global evaluation metric like DTW distance. In this work, we automatically detect such errors and propose a correction scheme for the same. We also propose two new evaluation metrics for ATB segmentation separately in ... : accepted for ICASSP 2022 ...
	Keyword: Audio and Speech Processing eess.AS; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering
	URL: https://dx.doi.org/10.48550/arxiv.2203.06004 https://arxiv.org/abs/2203.06004
	BASE
	Hide details

11	Expression-preserving face frontalization improves visually assisted speech processing ...
	Kang, Zhiqi; Sadeghi, Mostafa; Horaud, Radu. - : arXiv, 2022
	BASE
	Show details

12	WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language ...
	Tavella, Federico; Schlegel, Viktor; Romeo, Marta. - : arXiv, 2022
	BASE
	Show details

13	Modeling Intensification for Sign Language Generation: A Computational Approach ...
	İnan, Mert; Zhong, Yang; Hassan, Sabit. - : arXiv, 2022
	BASE
	Show details

14	Keypoint based Sign Language Translation without Glosses ...
	Kim, Youngmin; Kwak, Minji; Lee, Dain. - : arXiv, 2022
	BASE
	Show details

15	A Transformer-Based Contrastive Learning Approach for Few-Shot Sign Language Recognition ...
	Ferreira, Silvan; Costa, Esdras; Dahia, Márcio. - : arXiv, 2022
	BASE
	Show details

16	A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation ...
	Chen, Yutong; Wei, Fangyun; Sun, Xiao. - : arXiv, 2022
	BASE
	Show details

17	Including Facial Expressions in Contextual Embeddings for Sign Language Generation ...
	Viegas, Carla; İnan, Mert; Quandt, Lorna. - : arXiv, 2022
	BASE
	Show details

18	Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production ...
	Saunders, Ben; Camgoz, Necati Cihan; Bowden, Richard. - : arXiv, 2022
	BASE
	Show details

19	Statistical and Spatio-temporal Hand Gesture Features for Sign Language Recognition using the Leap Motion Sensor ...
	Bird, Jordan J.. - : arXiv, 2022
	BASE
	Show details

20	Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition ...
	Li, Ronghui; Meng, Lu. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5...212

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern