Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 14 of 14

1	Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling ...
	Peng, Puyuan; Harwath, David. - : arXiv, 2022
	BASE
	Show details

2	Learning Audio-Video Language Representations
	Rouditchenko, Andrew. - : Massachusetts Institute of Technology, 2021
	BASE
	Show details

3	Cascaded Multilingual Audio-Visual Learning from Videos ...
	Rouditchenko, Andrew; Boggust, Angie; Harwath, David. - : arXiv, 2021
	BASE
	Show details

4	Text-Free Image-to-Speech Synthesis Using Learned Segmental Units ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Glass, James; Harwath, David. - : Underline Science Inc., 2021
	BASE
	Show details

5	Fast-Slow Transformer for Visually Grounding Speech ...
	Peng, Puyuan; Harwath, David. - : arXiv, 2021
	BASE
	Show details

6	Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech ...
	Harwath, David; Hsu, Wei-Ning; Glass, James. - : arXiv, 2019
	BASE
	Show details

7	Transfer Learning from Audio-Visual Grounding to Speech Recognition ...
	Hsu, Wei-Ning; Harwath, David; Glass, James. - : arXiv, 2019
	BASE
	Show details

8	Unsupervised learning of spoken language with visual context
	Harwath, David; Torralba, Antonio; Glass, James R.
	In: Neural Information Processing Systems (NIPS) (2019)
	BASE
	Show details

9	Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech ...
	Harwath, David; Chuang, Galen; Glass, James. - : arXiv, 2018
	BASE
	Show details

10	Learning spoken language through vision
	Harwath, David F. (David Frank). - : Massachusetts Institute of Technology, 2018
	BASE
	Show details

11	Learning Word-Like Units from Joint Audio-Visual Analysis ...
	Harwath, David; Glass, James R.. - : arXiv, 2017
	BASE
	Show details

12	Unsupervised modeling of latent topics and lexical units in speech audio
	Harwath, David F. (David Frank). - : Massachusetts Institute of Technology, 2013
	BASE
	Show details

13	A Summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition
	Jansen, Aren; Dupoux, Emmanuel; Seltzer, Mike. - : Piscataway, NJ : IEEE, 2013
	BASE
	Show details

14	Phonetic Landmark Detection for Automatic Language Identification
	Harwath, David. - 2010
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern