Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 74

1	Semi-supervised cycle-consistency training for end-to-end ASR using unpaired speech
	Wu, Ningkai. - 2022
	BASE
	Show details

2	Improving multilingual speech recognition systems
	Gao, Heting. - 2021
	BASE
	Show details

3	Enforcing constraints for multi-lingual and cross-lingual speech-to-text systems
	Ni, Junrui. - 2021
	BASE
	Show details

4	Knowledge base integration in biomedical natural language processing applications
	Sakakini, Tarek. - 2021
	BASE
	Show details

5	Learning speech embeddings for speaker adaptation and speech understanding
	Sari, Leda. - 2021
	BASE
	Show details

6	Modeling phones, keywords, topics and intents in spoken languages
	Chen, Wenda. - 2021
	BASE
	Show details

7	Speech technology for unwritten languages
	Scharenborg, Odette; Besacier, Laurent; Black, Alan...
	In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
	BASE
	Show details

8	How Phonotactics Affect Multilingual and Zero-shot ASR Performance ...
	Feng, Siyuan; Żelasko, Piotr; Moro-Velázquez, Laureano. - : arXiv, 2020
	BASE
	Show details

9	That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages ...
	Żelasko, Piotr; Moro-Velázquez, Laureano; Hasegawa-Johnson, Mark. - : arXiv, 2020
	BASE
	Show details

10	Autosegmental Neural Nets: Should Phones and Tones be Synchronous or Asynchronous? ...
	Li, Jialu; Hasegawa-Johnson, Mark. - : arXiv, 2020
	BASE
	Show details

11	Identify Speakers in Cocktail Parties with End-to-End Attention ...
	Zhu, Junzhe; Hasegawa-Johnson, Mark; Sari, Leda. - : arXiv, 2020
	BASE
	Show details

12	Acoustic event, spoken keyword and emotional outburst detection
	Xu, Yijia. - 2019
	BASE
	Show details

13	Automatic speech recognition for low-resource languages and dialects
	Hoffer-Sohn, Yda. - 2019
	BASE
	Show details

14	A sensorimotor basis of speech communication
	Bryan, Jacob. - 2019
	BASE
	Show details

15	The benefits of acoustic perceptual information for speech processing systems
	He, Di. - 2019
	Abstract: The frame-synchronized framework has dominated many speech processing systems, such as ASR and AED targeting human speech activities. These systems have little consideration for the science behind speech and treat the task as a simple statistical classification. The framework also assumes each feature vector to be equally important to the task. However, through some preliminary experiments, this study has found evidence that some concepts defined in speech perception theories such as auditory roughness and acoustic landmarks can act as heuristics to these systems and benefit them in multiple ways. Findings of acoustic landmarks hint that the idea of treating each frame equally might not be optimal. In some cases, landmark information can improve system accuracy through highlighting the more significant frames, or improve the acoustic model accuracy by training through MTL. Further investigation into the topic found experimental evidence suggesting that acoustic landmark information can also benefit end-to-end acoustic models trained through CTC loss. With the help of acoustic landmarks, CTC models can converge with less training data and achieve lower error rate. For the first time, positive results were collected on a mid-size ASR corpus (WSJ) for acoustic landmarks. The results indicate that audio perception information can benefit a broad range of audio processing systems.
	Keyword: Acoustic Landmark; AED; ASR; Auditory Roughness; CTC; FPGA; IoT; MTL
	URL: http://hdl.handle.net/2142/104888
	BASE
	Hide details

16	Dealing with linguistic mismatches for automatic speech recognition
	Yang, Xuesong. - 2019
	BASE
	Show details

17	Modeling DNN as human learner
	Ni, Junrui. - 2019
	BASE
	Show details

18	Bayesian models for unit discovery on a very low resource language
	Ondel, Lucas; Godard, Pierre; Besacier, Laurent...
	In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-01709589 ; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2018, Calgary, Alberta, Canada (2018)
	BASE
	Show details

19	Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
	Scharenborg, Odette; Besacier, Laurent; Black, Alan...
	In: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01709578 ; ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada (2018)
	BASE
	Show details

20	Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop ...
	Scharenborg, Odette; Besacier, Laurent; Black, Alan. - : arXiv, 2018
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern