21 |
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation ...
|
|
|
|
BASE
|
|
Show details
|
|
22 |
Can Social Robots Effectively Elicit Curiosity in STEM Topics from K-1 Students During Oral Assessments? ...
|
|
|
|
BASE
|
|
Show details
|
|
23 |
An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
Expression-preserving face frontalization improves visually assisted speech processing ...
|
|
|
|
BASE
|
|
Show details
|
|
25 |
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Multi Antenna Radar System for American Sign Language (ASL) Recognition Using Deep Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
Effect of Kinematics and Fluency in Adversarial Synthetic Data Generation for ASL Recognition with RF Sensors ...
|
|
|
|
Abstract:
RF sensors have been recently proposed as a new modality for sign language processing technology. They are non-contact, effective in the dark, and acquire a direct measurement of signing kinematic via exploitation of the micro-Doppler effect. First, this work provides an in depth, comparative examination of the kinematic properties of signing as measured by RF sensors for both fluent ASL users and hearing imitation signers. Second, as ASL recognition techniques utilizing deep learning requires a large amount of training data, this work examines the effect of signing kinematics and subject fluency on adversarial learning techniques for data synthesis. Two different approaches for the synthetic training data generation are proposed: 1) adversarial domain adaptation to minimize the differences between imitation signing and fluent signing data, and 2) kinematically-constrained generative adversarial networks for accurate synthesis of RF signing signatures. The results show that the kinematic discrepancies ...
|
|
Keyword:
FOS Electrical engineering, electronic engineering, information engineering; Signal Processing eess.SP
|
|
URL: https://dx.doi.org/10.48550/arxiv.2201.00055 https://arxiv.org/abs/2201.00055
|
|
BASE
|
|
Hide details
|
|
30 |
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge ...
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks ...
|
|
|
|
BASE
|
|
Show details
|
|
36 |
Code-Switching Text Augmentation for Multilingual Speech Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
39 |
Self-supervised Learning with Random-projection Quantizer for Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|