Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2021 (62)
- Medium:
  - Online (61)
  - Print (1)
- Type
- BLLDB-Access:
  - free (62)
  - subject to license (3)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 62

1	Applying phonetics : speech science in everyday life
	Munro, Murray J.. - Chichester, West Sussex : Wiley Blackwell, 2021
	BLLDB
	UB Frankfurt Linguistik
	Show details

2	Automatic Speech Recognition systems errors for accident-prone sleepiness detection through voice
	Martin, Vincent,; Rouas, Jean-Luc; Boyer, Florian...
	In: EUSIPCO 2021 ; https://hal.archives-ouvertes.fr/hal-03324033 ; EUSIPCO 2021, Aug 2021, Dublin (en ligne), Ireland. ⟨10.23919/EUSIPCO54536.2021.9616299⟩ (2021)
	BASE
	Show details

3	Automatic Speech Recognition systems errors for objective sleepiness detection through voice
	Martin, Vincent,; Rouas, Jean-Luc; Boyer, Florian...
	In: Proceedings Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03328827 ; Interspeech 2021, Aug 2021, Brno (virtual), Czech Republic. pp.2476-2480, ⟨10.21437/Interspeech.2021-291⟩ (2021)
	BASE
	Show details

4	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

5	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

6	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

7	Re-synchronization using the Hand Preceding Model for Multi-modal Fusion in Automatic Continuous Cued Speech Recognition
	Liu, Li; Feng, Gang; Beautemps, Denis...
	In: ISSN: 1520-9210 ; IEEE Transactions on Multimedia ; https://hal.archives-ouvertes.fr/hal-02433830 ; IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2021, 23, pp.292-305. ⟨10.1109/TMM.2020.2976493⟩ (2021)
	BASE
	Show details

8	Identifying Speaker State from Multimodal Cues
	Yang, Zixiaofan. - 2021
	BASE
	Show details

9	Unsupervised Morphological Segmentation and Part-of-Speech Tagging for Low-Resource Scenarios
	Eskander, Ramy. - 2021
	BASE
	Show details

10	Recognizing lexical units in low-resource language contexts with supervised and unsupervised neural networks
	MACAIRE, Cécile
	In: https://hal.archives-ouvertes.fr/hal-03429051 ; [Research Report] LACITO (UMR 7107). 2021 (2021)
	Abstract: Automatic Speech Recognition (ASR) has made significant progress thanks to the advent of deep neural networks (DNNs). In the context of under-resourced languages, for which few resources are available, spectacular achievements has been reported. ASR systems are a step forward for language documentation, as the annotation cost is considerably reduced for field linguists (manually annotated an audio file can take a tremendous amount of time), and the language is preserved and perpetuated through documentation. Previous `standard' deep neural networks reached very good performances for phonemic transcription (such as with Kaldi and ESPnet approaches).However, these methods only rely on the phoneme-level. In this thesis, we explore recently published ASR approaches which have shown to be effective on low-resource languages to produce word-level audio-aligned transcriptions. The first approach, based on self-supervised learning, is a speech model that uses a Connectionist Temporal Classification (CTC). The second, entitled wav2vec-U, proposes a framework intended to build an ASR system in a fully unsupervised fashion. With few resources at our disposal, we try to assess the usability that can be made from dictionaries. We conducted experiments on two low-resource corpora, the Yongning Na and the Japhug from the Pangloss Collection. The experimental results from the first approach demonstrate powerful word-level transcriptions with competitive error rates. Preliminary results are reported on the second approach. By a coverage measure of dictionaries on the available transcriptions, we show that these resources are not yet usable in the conducted approaches.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Automatic Speech Recognition ASR; deep learning; Machine learning; Neural networks
	URL: https://hal.archives-ouvertes.fr/hal-03429051/file/Macaire2021_RecognizingLexicalUnits.pdf https://hal.archives-ouvertes.fr/hal-03429051/document https://hal.archives-ouvertes.fr/hal-03429051
	BASE
	Hide details

11	Speech Normalization and Data Augmentation Techniques Based on Acoustical and Physiological Constraints and Their Applications to Child Speech Recognition
	Yeung, Gary Joseph. - : eScholarship, University of California, 2021
	BASE
	Show details

12	Large vocabulary automatic speech recognition: from hybrid to end-to-end approaches ; Reconnaissance automatique de la parole à large vocabulaire : des approches hybrides aux approches End-to-End
	Heba, Abdelwahab. - : HAL CCSD, 2021
	In: https://hal.archives-ouvertes.fr/tel-03269807 ; Son [cs.SD]. Université toulouse 3 Paul Sabatier, 2021. Français (2021)
	BASE
	Show details

13	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

14	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

15	The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
	BASE
	Show details

16	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

17	The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
	BASE
	Show details

18	Enhancing Speech Privacy with Slicing
	Maouche, Mohamed; Srivastava, Brij Mohan Lal; Vauquier, Nathalie...
	In: https://hal.inria.fr/hal-03369137 ; 2021 (2021)
	BASE
	Show details

19	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

20	Training RNN Language Models on Uncertain ASR Hypotheses in Limited Data Scenarios
	Sheikh, Imran,; Vincent, Emmanuel; Illina, Irina
	In: https://hal.inria.fr/hal-03327306 ; 2021 (2021)
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern