DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
以《Cofacts 真的假的》資料庫為基礎建立中文科學假訊息之探勘模型 ; Text Mining Model for Detecting Chinese Fake Scientific Messages based on Cofacts Open Data
BASE
Show details
2
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition ...
Yi, Cheng; Zhou, Shiyu; Xu, Bo. - : arXiv, 2021
Abstract: End-to-end models have achieved impressive results on the task of automatic speech recognition (ASR). For low-resource ASR tasks, however, labeled data can hardly satisfy the demand of end-to-end models. Self-supervised acoustic pre-training has already shown its amazing ASR performance, while the transcription is still inadequate for language modeling in end-to-end models. In this work, we fuse a pre-trained acoustic encoder (wav2vec2.0) and a pre-trained linguistic encoder (BERT) into an end-to-end ASR model. The fused model only needs to learn the transfer from speech to language during fine-tuning on limited labeled data. The length of the two modalities is matched by a monotonic attention mechanism without additional parameters. Besides, a fully connected layer is introduced for the hidden mapping between modalities. We further propose a scheduled fine-tuning strategy to preserve and utilize the text context modeling ability of the pre-trained linguistic encoder. Experiments show our effective utilizing ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2101.06699
https://dx.doi.org/10.48550/arxiv.2101.06699
BASE
Hide details
3
ISLab System for SMM4H Shared Task 2020 ...
BASE
Show details
4
Unsupervised pre-training for sequence to sequence speech recognition ...
Fan, Zhiyun; Zhou, Shiyu; Xu, Bo. - : arXiv, 2019
BASE
Show details
5
Unsupervised Neural Machine Translation with Weight Sharing ...
Yang, Zhen; Chen, Wei; Wang, Feng. - : arXiv, 2018
BASE
Show details
6
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese ...
Zhou, Shiyu; Dong, Linhao; Xu, Shuang. - : arXiv, 2018
BASE
Show details
7
Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages ...
Zhou, Shiyu; Xu, Shuang; Xu, Bo. - : arXiv, 2018
BASE
Show details
8
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese ...
Zhou, Shiyu; Dong, Linhao; Xu, Shuang. - : arXiv, 2018
BASE
Show details
9
Anticipatory Posturing of the Vocal Tract Reveals Dissociation of Speech Movement Plans from Linguistic Units
Tilsen, Sam; Spincemaille, Pascal; Xu, Bo. - : Public Library of Science, 2016
BASE
Show details
10
A-STAR: Toward translating Asian spoken languages
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 27 (2013) 2, 509-527
OLC Linguistik
Show details
11
From English pitch accent detection to Mandarin stress detection, where is the difference?
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 26 (2012) 3, 127-148
BLLDB
OLC Linguistik
Show details
12
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 30-44
OLC Linguistik
Show details
13
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 30-44
BLLDB
OLC Linguistik
Show details
14
An approach to automatic acquisition of translation templates based on phrase structure extraction and alignment
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1656-1663
BLLDB
OLC Linguistik
Show details
15
Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 6, 2014-2023
BLLDB
OLC Linguistik
Show details
16
Formant comparison between whispered and voiced vowels in Mandarin
In: Acta acustica united with Acustica. - Stuttgart : Hirzel 91 (2005) 6, 1079-1085
BLLDB
OLC Linguistik
Show details
17
Tone Modeling for Continuous Mandarin Speech Recognition
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 7 (2004) 2-3, 115-128
OLC Linguistik
Show details
18
Statistical Approach to ChineseEnglish Spoken-language Translation
In: http://www.colips.org/conference/iscslp2006/anthology/2000/paper/PsB2/057.pdf (2000)
BASE
Show details
19
RULE-BASED POST-PROCESSING OF PINYIN TO1 CHINESE CHARACTERS CONVERSION SYSTEM
In: http://www.colips.org/conference/iscslp2006/anthology/2000/paper/PSB2/106.pdf
BASE
Show details
20
ClassTriphone Acoustic Modeling Based On Decision Tree
In: http://www.colips.org/conference/iscslp2006/anthology/1998/papers/asr-a1.pdf
BASE
Show details

Page: 1 2

Catalogues
0
0
8
0
0
0
0
Bibliographies
5
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern