Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year
- Medium:
  - Online (11)
  - Print (9)
- Type
- BLLDB-Access:
  - free (20)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 20 of 20

1	Unsupervised learning of spoken language with visual context
	Harwath, David; Torralba, Antonio; Glass, James R.
	In: Neural Information Processing Systems (NIPS) (2019)
	Abstract: Humans learn to speak before they can read or write, so why can't computers do the same? In this paper, we present a deep neural network model capable of rudimentary spoken language acquisition using untranscribed audio training data, whose only supervision comes in the form of contextually relevant visual images. We describe the collection of our data comprised of over 120,000 spoken audio captions for the Places image dataset and evaluate our model on an image search and annotation task. We also provide some visualizations which suggest that our model is learning to recognize meaningful words within the caption spectrograms.
	URL: https://hdl.handle.net/1721.1/124455
	BASE
	Hide details

2	Learning Word-Like Units from Joint Audio-Visual Analysis ...
	Harwath, David; Glass, James R.. - : arXiv, 2017
	BASE
	Show details

3	Unsupervised Lexicon Discovery from Acoustic Input
	Lee, Chia-ying; O'Donnell, Timothy John; Glass, James R.
	In: Transactions of the Association for Computational Linguistics (2015)
	BASE
	Show details

4	Learning lexicons from speech using a pronunciation mixture model
	McGraw, Ian; Badr, Ibrahim; Glass, James R.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 21 (2013) 2, 357-366
	OLC Linguistik
	Show details

5	Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation
	Ming, Ji; Hazen, Timothy J.; Glass, James R.
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 67-76
	OLC Linguistik
	Show details

6	Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation
	Glass, James R.; Hazen, Timothy J.; Ming, Ji
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 67-76
	BLLDB
	OLC Linguistik
	Show details

7	Speech rhythm guided syllable nuclei detection
	Glass, James R.; Zhang, Yaodong, Ph. D. Massachusetts Institute of Technology
	In: IEEE (2009)
	BASE
	Show details

8	On the phonetic information in ultrasonic microphone signals
	Glass, James R.; Zhu, Bo; Livescu, Karen
	In: IEEE (2009)
	BASE
	Show details

9	Multistream Articulatory Feature-Based Models for Visual Speech Recognition
	Glass, James R.; Saenko, Ekaterina; Livescu, Karen...
	In: IEEE (2009)
	BASE
	Show details

10	Research Developments and Directions in Speech Recognition and Understanding, Part 1
	Baker, Janet M.; Glass, James R.; Khudanpur, Sanjeev...
	In: IEEE (2009)
	BASE
	Show details

11	Unsupervised pattern discovery in speech
	Glass, James R.; Park, Alex S.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 16 (2008) 1, 186-197
	BLLDB
	OLC Linguistik
	Show details

12	An implementation of rational wavelets and filter design for phonetic classification
	Choueiter, Ghinwa F.; Glass, James R.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 3, 939-948
	BLLDB
	Show details

13	Robust speaker recognition in noisy conditions
	Glass, James R.; Hazen, Timothy J.; Reynolds, Douglas A....
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 5, 1711-1723
	BLLDB
	OLC Linguistik
	Show details

14	Mathematical foundations of speech and language processing
	Weber, Katrin (Mitarb.); Kedem, Benjamin (Mitarb.); Khudanpur, Sanjeev (Hrsg.). - New York [u.a.] : Springer, 2004
	BLLDB
	UB Frankfurt Linguistik
	Show details

15	A probabilistic framework for segment-based speech recognition
	Glass, James R.
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 17 (2003) 2, 137-152
	OLC Linguistik
	Show details

16	New computational paradigms for acoustic modeling in speech recognition
	Russell, Martin J. (Hrsg.); Bilmes, Jeff A. (Hrsg.); Lefevre, Fabrice (Mitarb.)...
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 17 (2003) 2-3, 107-305
	BLLDB
	Show details

17	Finding Acoustic Regularities in Speech: Applications to Phonetic Recognition
	Glass, James R.
	In: DTIC AND NTIS (1988)
	BASE
	Show details

18	Finding Acoustic Regularities in Speech: Applications to Phonetic Recognition.
	Glass, James R.
	In: DTIC AND NTIS (1988)
	BASE
	Show details

19	Speech Communication
	Stevens, Kenneth N.; Allen, Jonathan; Halle, Morris. - : Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT), 1987
	BASE
	Show details

20	Speech Communication
	Stevens, Kenneth N.; Allen, Jonathan; Halle, Morris. - : Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT), 1987
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern