DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8...100
Hits 61 – 80 of 1.986

61
Filter-based Discriminative Autoencoders for Children Speech Recognition ...
BASE
Show details
62
Transducer-based language embedding for spoken language identification ...
Shen, Peng; Lu, Xugang; Kawai, Hisashi. - : arXiv, 2022
BASE
Show details
63
Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0 ...
BASE
Show details
64
Multi-sequence Intermediate Conditioning for CTC-based ASR ...
BASE
Show details
65
Code Switched and Code Mixed Speech Recognition for Indic languages ...
BASE
Show details
66
Simple and Effective Unsupervised Speech Synthesis ...
Abstract: We introduce the first unsupervised speech synthesis system based on a simple, yet effective recipe. The framework leverages recent work in unsupervised speech recognition as well as existing neural-based speech synthesis. Using only unlabeled speech audio and unlabeled text as well as a lexicon, our method enables speech synthesis without the need for a human-labeled corpus. Experiments demonstrate the unsupervised system can synthesize speech similar to a supervised counterpart in terms of naturalness and intelligibility measured by human evaluation. ... : preprint, equal contribution from first two authors ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2204.02524
https://dx.doi.org/10.48550/arxiv.2204.02524
BASE
Hide details
67
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding ...
BASE
Show details
68
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech ...
Zhang, Cong; Zeng, Huinan; Liu, Huang. - : arXiv, 2022
BASE
Show details
69
MAESTRO: Matched Speech Text Representations through Modality Matching ...
BASE
Show details
70
Improving Language Identification of Accented Speech ...
Kukk, Kunnar; Alumäe, Tanel. - : arXiv, 2022
BASE
Show details
71
Cross-stitched Multi-modal Encoders ...
BASE
Show details
72
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems ...
BASE
Show details
73
UK-South Korea Prosody Research Network ...
Jeon, Hae-Sung. - : Open Science Framework, 2022
BASE
Show details
74
Speaker Extraction with Co-Speech Gestures Cue ...
Pan, Zexu; Qian, Xinyuan; Li, Haizhou. - : arXiv, 2022
BASE
Show details
75
Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features ...
BASE
Show details
76
Cochlear Implant Results in Older Adults with Post-Lingual Deafness: The Role of “Top-Down” Neurocognitive Mechanisms
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 3; Pages: 1343 (2022)
BASE
Show details
77
MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension
In: Applied Sciences; Volume 12; Issue 2; Pages: 804 (2022)
BASE
Show details
78
On the Difference of Scoring in Speech in Babble Tests
In: Healthcare; Volume 10; Issue 3; Pages: 458 (2022)
BASE
Show details
79
An Empirical Performance Analysis of the Speak Correct Computerized Interface
In: Processes; Volume 10; Issue 3; Pages: 487 (2022)
BASE
Show details
80
DeepFry: Identifying Vocal Fry Using Deep Neural Networks ...
BASE
Show details

Page: 1 2 3 4 5 6 7 8...100

Catalogues
0
0
61
0
0
0
0
Bibliographies
297
0
0
0
0
0
44
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.644
1
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern