DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6...50
Hits 21 – 40 of 989

21
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation ...
BASE
Show details
22
Can Social Robots Effectively Elicit Curiosity in STEM Topics from K-1 Students During Oral Assessments? ...
BASE
Show details
23
An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
Abstract: The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the entire original and predicted contours. Such an evaluation measure may not capture local errors in the predicted contour. Careful analysis of predicted contours reveals errors in regions like the velum part of contour1 (ATB comprising of upper lip, hard palate, and velum) and tongue base section of contour2 (ATB covering jawline, lower lip, tongue base, and epiglottis), which are not captured in a global evaluation metric like DTW distance. In this work, we automatically detect such errors and propose a correction scheme for the same. We also propose two new evaluation metrics for ATB segmentation separately in ... : accepted for ICASSP 2022 ...
Keyword: Audio and Speech Processing eess.AS; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering
URL: https://dx.doi.org/10.48550/arxiv.2203.06004
https://arxiv.org/abs/2203.06004
BASE
Hide details
24
Expression-preserving face frontalization improves visually assisted speech processing ...
BASE
Show details
25
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition ...
BASE
Show details
26
Multi Antenna Radar System for American Sign Language (ASL) Recognition Using Deep Learning ...
BASE
Show details
27
Effect of Kinematics and Fluency in Adversarial Synthetic Data Generation for ASL Recognition with RF Sensors ...
BASE
Show details
28
A Hierarchical Model for Spoken Language Recognition ...
BASE
Show details
29
Language vs Speaker Change: A Comparative Study ...
BASE
Show details
30
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition ...
BASE
Show details
31
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training ...
Yang, J.; He, Lei. - : arXiv, 2022
BASE
Show details
32
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition ...
BASE
Show details
33
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge ...
BASE
Show details
34
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks ...
BASE
Show details
35
Multilingual Simultaneous Speech Translation ...
BASE
Show details
36
Code-Switching Text Augmentation for Multilingual Speech Processing ...
BASE
Show details
37
The 2021 NIST Speaker Recognition Evaluation ...
BASE
Show details
38
Multilingual and Multimodal Abuse Detection ...
BASE
Show details
39
Self-supervised Learning with Random-projection Quantizer for Speech Recognition ...
BASE
Show details
40
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian ...
BASE
Show details

Page: 1 2 3 4 5 6...50

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
989
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern