DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
An Experimental Investigation of Complexity-Based Ordering
Teddiman, Laura G. H.. - : University of Alberta. Department of Linguistics., 2012
BASE
Show details
2
Valence-Increasing Morphology in Temne
Kanu, Sullay Mohamed. - : University of Alberta. Department of Linguistics., 2012
BASE
Show details
3
An Experimental Investigation of Complexity-Based Ordering
Teddiman, Laura G. H.. - : University of Alberta. Department of Linguistics., 2012
BASE
Show details
4
Valence-Increasing Morphology in Temne
Kanu, Sullay Mohamed. - : University of Alberta. Department of Linguistics., 2012
BASE
Show details
5
Indexing and Querying Natural Language Text
Chubak, Pirooz. - : University of Alberta. Department of Computing Science., 2012
Abstract: Degree: Doctor of Philosophy ; Abstract: Natural language text is a prominent source of representing and communicating information and knowledge. It is often desirable to search in granularities of text that are smaller than a document or to query the syntactic roles and relationships within syntactically annotated text sentences, often represented by parse trees. In this thesis, we study the problems of efficiently indexing and querying natural language text in the scenarios where (1) text is modelled as flat sequences of words and (2) text is modelled as collections of syntactically annotated trees. In the first scenario, we study some of the index structures that are capable of answering the class of queries referred to here as wild card queries and perform an analysis of their performance. Our experimental results on a large class of queries from different sources (including query logs and parse trees) and with various datasets reveal some of the performance barriers of these indexes. We present Word Permuterm Index (WPI) and show that it supports a wide range of wild card queries, is quick to construct and is highly scalable. Our experimental results comparing WPI to alternative methods on a wide range of wild card queries show a few orders of magnitude performance improvement for WPI while the memory usage is kept the same for all compared systems. In the second scenario, we study index structures and access methods that improve the performance of querying over syntactically parsed sentences. We propose a novel indexing scheme over unique subtrees as index keys. We also introduce the root-split coding scheme that concisely stores structural subtree information, making it possible to perform exact axes matching over subtrees. We theoretically study the properties of our coding and the limitations it imposes over query processing. Our extensive set of experiments show that root-split coding reduces the index size of a baseline index which stores the interval codes of all nodes by a factor of up to 5 (i.e. 80% reduction) and speeds up querying runtime by more than 6 times on average, when subtrees of sizes 1, . . . , 5 are indexed.
Keyword: Index Structures; Indexing; Natural Language Text; Querying; Subtree Index; Tree Pattern Matching
URL: http://hdl.handle.net/10402/era.25328
https://era.library.ualberta.ca/items/bed0e282-8b01-4bc3-936e-9f47b2f86fb4
https://doi.org/10.7939/R3GQ6Q
BASE
Hide details
6
A Conversational Study of the Particle ne in Mandarin Chinese
Qin, Longlu. - : University of Alberta. Department of East Asian Studies., 2012
BASE
Show details
7
Indexing and Querying Natural Language Text
Chubak, Pirooz. - : University of Alberta. Department of Computing Science., 2012
BASE
Show details
8
A Conversational Study of the Particle ne in Mandarin Chinese
Qin, Longlu. - : University of Alberta. Department of East Asian Studies., 2012
BASE
Show details

Catalogues
Bibliographies
Linked Open Data catalogues
Online resources
Open access documents
8
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern