Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium:
  - Online (989)
- Type
- BLLDB-Access:
  - free (989)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6...50

Hits 21 – 40 of 989

21	Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation ...
	Georges, Marc-Antoine; Diard, Julien; Girin, Laurent. - : arXiv, 2022
	BASE
	Show details

22	Can Social Robots Effectively Elicit Curiosity in STEM Topics from K-1 Students During Oral Assessments? ...
	Johnson, Alexander; Martin, Alejandra; Quintero, Marlen. - : arXiv, 2022
	BASE
	Show details

23	An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
	Roy, Anwesha; Belagali, Varun; Ghosh, Prasanta Kumar. - : arXiv, 2022
	Abstract: The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the entire original and predicted contours. Such an evaluation measure may not capture local errors in the predicted contour. Careful analysis of predicted contours reveals errors in regions like the velum part of contour1 (ATB comprising of upper lip, hard palate, and velum) and tongue base section of contour2 (ATB covering jawline, lower lip, tongue base, and epiglottis), which are not captured in a global evaluation metric like DTW distance. In this work, we automatically detect such errors and propose a correction scheme for the same. We also propose two new evaluation metrics for ATB segmentation separately in ... : accepted for ICASSP 2022 ...
	Keyword: Audio and Speech Processing eess.AS; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering
	URL: https://dx.doi.org/10.48550/arxiv.2203.06004 https://arxiv.org/abs/2203.06004
	BASE
	Hide details

24	Expression-preserving face frontalization improves visually assisted speech processing ...
	Kang, Zhiqi; Sadeghi, Mostafa; Horaud, Radu. - : arXiv, 2022
	BASE
	Show details

25	Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition ...
	Soleymanpour, Mohammad; Johnson, Michael T.; Soleymanpour, Rahim. - : arXiv, 2022
	BASE
	Show details

26	Multi Antenna Radar System for American Sign Language (ASL) Recognition Using Deep Learning ...
	MacLaughlin, Gavin; Malcolm, Jack; Hamza, Syed Ali. - : arXiv, 2022
	BASE
	Show details

27	Effect of Kinematics and Fluency in Adversarial Synthetic Data Generation for ASL Recognition with RF Sensors ...
	Rahman, M. M.; Malaia, E.; Gurbuz, A. C.. - : arXiv, 2022
	BASE
	Show details

28	A Hierarchical Model for Spoken Language Recognition ...
	Ferrer, Luciana; Castan, Diego; McLaren, Mitchell. - : arXiv, 2022
	BASE
	Show details

29	Language vs Speaker Change: A Comparative Study ...
	Mishra, Jagabandhu; Prasanna, S. R. Mahadeva. - : arXiv, 2022
	BASE
	Show details

30	Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition ...
	Liu, Qianying; Yang, Yuhang; Gong, Zhuo. - : arXiv, 2022
	BASE
	Show details

31	Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training ...
	Yang, J.; He, Lei. - : arXiv, 2022
	BASE
	Show details

32	Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition ...
	Hernandez, Abner; Pérez-Toro, Paula Andrea; Nöth, Elmar. - : arXiv, 2022
	BASE
	Show details

33	VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge ...
	Brown, Andrew; Huh, Jaesung; Chung, Joon Son. - : arXiv, 2022
	BASE
	Show details

34	Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks ...
	Lu, Yizhou; Huang, Mingkun; Qu, Xinghua. - : arXiv, 2022
	BASE
	Show details

35	Multilingual Simultaneous Speech Translation ...
	Subramanya, Shashank; Niehues, Jan. - : arXiv, 2022
	BASE
	Show details

36	Code-Switching Text Augmentation for Multilingual Speech Processing ...
	Hussein, Amir; Chowdhury, Shammur Absar; Abdelali, Ahmed. - : arXiv, 2022
	BASE
	Show details

37	The 2021 NIST Speaker Recognition Evaluation ...
	Sadjadi, Seyed Omid; Greenberg, Craig; Singer, Elliot. - : arXiv, 2022
	BASE
	Show details

38	Multilingual and Multimodal Abuse Detection ...
	Sharon, Rini; Shah, Heet; Mukherjee, Debdoot. - : arXiv, 2022
	BASE
	Show details

39	Self-supervised Learning with Random-projection Quantizer for Speech Recognition ...
	Chiu, Chung-Cheng; Qin, James; Zhang, Yu. - : arXiv, 2022
	BASE
	Show details

40	BEA-Base: A Benchmark for ASR of Spontaneous Hungarian ...
	Mihajlik, P.; Balog, A.; Gráczi, T. E.. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5 6...50

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern