Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	Asynchronous Speech Recognition Affects Physician Editing of Notes
	Lybarger, Kevin J.; Ostendorf, Mari; Riskin, Eve. - : Georg Thieme Verlag KG, 2018
	BASE
	Show details

2	Low-Rank RNN Adaptation for Context-Aware Language Modeling
	Jaech, Aaron. - 2018
	Abstract: Thesis (Ph.D.)--University of Washington, 2018 ; A long-standing weakness of statistical language models is that their performance drastically degrades if they are used on data that varies even slightly from the data on which they were trained. In practice, applications require the use of adaptation methods to adjust the predictions of the model to match the local context. For instance, in a speech recognition application, a single static language model would not be able to handle all the different ways that people speak to their voice assistants such as selecting music and sending a message to a friend. An adapted model would make its predictions conditioned on the knowledge of who is speaking and what task they are trying to do. The current standard approach to recurrent neural network language model adaptation is to apply a simple linear shift to the recurrent and/or output layer bias vector. Although this is helpful, it does not go far enough. This thesis introduces a new approach to adaptation, which we call the FactorCell, that generates a custom recurrent network for each context by applying a low-rank transformation. The FactorCell allows for a more substantial change to the recurrent layer weights. Different from previous approaches, the introduction of a rank hyperparameter gives control over how different or similar the adapted models should be. In our experiments on several different datasets and multiple types of context, the increased adaptation of the recurrent layer is always helpful, as measured by perplexity, the standard for evaluating language models. We also demonstrate impact on two applications: personalized query completion and context-specific text generation, finding that the enhanced adaptation benefits both. We also show that the FactorCell provides a more effective text classification model, but more importantly the classification results reveal that there are important differences between the models that are not captured by perplexity. The classification metric is particularly important for the text generation application.
	Keyword: Computer science; Electrical engineering; language modeling; natural language processing; Statistics
	URL: http://hdl.handle.net/1773/42292
	BASE
	Hide details

Search in the Catalogues and Directories