1 |
Analyzing group behavior from language use with natural language processing and experimental methods : three applications in political science and sociology
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Universal quantification in the nominal domain in American Sign Language
|
|
|
|
BASE
|
|
Show details
|
|
3 |
An investigation of projection and temporal reference in Kaqchikel
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Reproducible Research in Linguistics: A Position Statement on Data Citation and Attribution in Our Field
|
|
|
|
In: Linguistics, 2017. Berlin, Germany: Walter de Gruyter (2017)
|
|
BASE
|
|
Show details
|
|
5 |
"Cause" and affect : evaluative and emotive parameters of meaning among the periphrastic causative verb in English
|
|
|
|
BASE
|
|
Show details
|
|
7 |
When Small Words Foretell Academic Success: The Case of College Admissions Essays
|
|
|
|
BASE
|
|
Show details
|
|
12 |
The French c'est-cleft : empirical studies of its meaning and use
|
|
|
|
BASE
|
|
Show details
|
|
14 |
On the form and meaning of Chinese bare conditionals : not just "whatever"
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Unsupervised partial parsing
|
|
|
|
Abstract:
text ; The subject matter of this thesis is the problem of learning to discover grammatical structure from raw text alone, without access to explicit instruction or annotation -- in particular, by a computer or computational process -- in other words, unsupervised parser induction, or simply, unsupervised parsing. This work presents a method for raw text unsupervised parsing that is simple, but nevertheless achieves state-of-the-art results on treebank-based direct evaluation. The approach to unsupervised parsing presented in this dissertation adopts a different way to constrain learned models than has been deployed in previous work. Specifically, I focus on a sub-task of full unsupervised partial parsing called unsupervised partial parsing. In essence, the strategy is to learn to segment a string of tokens into a set of non-overlapping constituents or chunks which may be one or more tokens in length. This strategy has a number of advantages: it is fast and scalable, based on well-understood and extensible natural language processing techniques, and it produces predictions about human language structure which are useful for human language technologies. The models developed for unsupervised partial parsing recover base noun phrases and local constituent structure with high accuracy compared to strong baselines. Finally, these models may be applied in a cascaded fashion for the prediction of full constituent trees: first segmenting a string of tokens into local phrases, then re-segmenting to predict higher-level constituent structure. This simple strategy leads to an unsupervised parsing model which produces state-of-the-art results for constituent parsing of English, German and Chinese. This thesis presents, evaluates and explores these models and strategies. ; Linguistics
|
|
Keyword:
Chunking; Computational linguistics; Natural language processing; Parsing; Text processing; Unsupervised
|
|
URL: http://hdl.handle.net/2152/ETD-UT-2011-08-3991
|
|
BASE
|
|
Hide details
|
|
|
|