DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5
Hits 1 – 20 of 94

1
Towards Parallel Algorithms for Abstract Dialectical Frameworks ...
Hofer, Mathias. - : TU Wien, 2022
BASE
Show details
2
Detecting Signal Corruptions in Voice Recordings for Speech Therapy ; Igenkänning av Signalproblem i Röstinspelningar för Logopedi
Nylén, Helmer. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021
BASE
Show details
3
Noise-skipping Earley parsing and in-order tree extraction from shared packed parse forests
Dohmann, Jeremy. - 2021
Abstract: In this thesis, I identify 3 shortcomings of state of the art Earley parsing and offer a unified, end-to-end Earley parsing algorithm which remedies these shortcomings. In particular I address the following issues: 1. The Earley algorithm traditionally returns parses without any rank-ordering associated with them. In addition to not ranking the parses during parse time, the data structure used to represent parse results, the shared-packed parse forest (SPPF), doesn’t have any intrinsic way to extract trees in a particular order. This is a major shortcoming because, for all but the most trivial grammars and inputs, the number of potential parses for an input is intractably large, making the parser effectively useless barring some external mechanism to filter results. 2. The Earley algorithm requires that parses explain contiguous spans of tokens without any unexplained out-of-vocabulary items or tokens interceding. In other words, traditional Earley parsing parses only whole strings of tokens, as opposed to permitting parsing only discontinuous subsequences of the input tokens. This is a shortcoming in applications where the input contains spurious tokens or where the grammar is only intended to describe a subset of the input. 3. The Earley algorithm’s run time is proportional to grammar size. Current state of the art methods substantially restrict the size of grammars which can be practically parsed with reasonable memory and time constraints. This is a major shortcoming because it can be difficult to represent sufficiently expressive languages for many applications using anything short of a massive grammar. The narrative arc of this thesis can be understood as an attempt to redress the three issues above by extending the Earley algorithm to enable parse ranking and quick extraction, make it robust to noise skipping, and make parsing with massive grammars tractable. In chapter two I address issue 1 by introducing a framework for ranking parses as a function of their intrinsic attributes. I present this approach as an extension of the work done in [41] and [21]. [41] introduces parsing as a form of deductive reasoning on a logical system defined by a context-free grammar, while [21] unifies a number of concepts from deductive parsing (e.g. recognition, assigning probabilities to parses, etc.) as performing calculations on a suitably defined semiring. Thus chapter two introduces a semiring I refer to as the ‘attribute semiring’ and show how Earley parsing can be used to derive the attributes of the best derivations for Earley states - where “best” is evaluated by applying a utility function f (·) to a state’s attributes. I use the semiring formalism to show which types of attributes and utility functions can be used for this purpose. In chapters three and four, I address the first two issues above by introducing extensions to the standard Earley parser and standard SPPF-construction algorithm to handle skipping tokens in noisy inputs as well innovations to associate each Earley state and SPPF node with the attributes of its best derivation. In chapter 3, I prove that the modified noise-skipping parsing algorithm runs in O(w · n 2 ) time where w is the number of tokens allowed to be skipped between any explained tokens, also referred to as the ‘skip width’. Chapter four details the construction of a novel variant of the SPPF called an ordered-SPPF which, by design, permits us to associate the best derivation attributes with SPPF nodes and makes extracting the best tree from a forest trivially easy. In that chapter, I show that construction of the ordered-SPPF can be done in O(w 2 n 3 · log(wn)) time In chapter five, I address issue two, by showing that the ordered SPPF from chapter four can be used to not only efficiently extract the best tree in the forest but also the top k parses ranked by applying the utility function to their attributes. I show my efficient top-k algorithm can extract the top k parses in O(kn 2 ·log(kn) time. In appendix A, I address the third issue, by showing an algorithm to filter grammars at run time in order to make large-grammar parsing more efficient. Finally, in appendix B, I introduce an extension that allows the Earley parser to run continuously and return ranked parses in a streamed manner. This extension requires nontrivial modifications to the core logic in order to maintain memory efficiency while not losing any valid parses. Thus, this thesis details a unified approach permitting the Earley algorithm to parse valid subsequences of noisy inputs while using massive grammars. The system detailed here is also capable of ranking the parses by customized utility functions over user-defined attributes with the constraints governing such functions detailed in chapter 2.
Keyword: algorithms; Computer science; data structures; Earley parsing; Linguistics; parse ranking; semiring parsing; shared packed parse forest
URL: https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37367424
BASE
Hide details
4
Temporal Social Network Analysis using Harvard Caselaw Access Project
Trias, Fernando. - 2021
BASE
Show details
5
Going beyond our means: A proposal for improving psycholinguistic methods
BASE
Show details
6
Extracting Human Behaviour and Personality Traits from Social Media
Singh, Ravinder. - 2021
BASE
Show details
7
Towards provably efficient algorithms for learning neural networks ...
Goel, Surbhi. - : The University of Texas at Austin, 2020
BASE
Show details
8
Factions: acts of worldbuilding on social media platforms ...
Little, Dana L.. - : University of Glasgow, 2020
BASE
Show details
9
Parallel text index construction ...
Kurpicz, Florian. - : Technische Universität Dortmund, 2020
BASE
Show details
10
Understanding and generating language with abstract meaning representation
Damonte, Marco. - : The University of Edinburgh, 2020
BASE
Show details
11
Detección de linguaxe misóxino e xenófobo en redes sociais mediante aprendizaxe máquina
BASE
Show details
12
Parallel text index construction
BASE
Show details
13
Towards provably efficient algorithms for learning neural networks
Goel, Surbhi. - 2020
BASE
Show details
14
Emoción, percepción, producción: un estudio psicolingüístico para detectar emociones en el habla
Gibson, M. (Mark); González-Machorro, M. (Mónica). - 2020
BASE
Show details
15
Language Recognition in the Sliding Window Model ... : Formale Sprachen im Sliding-Window-Modell ...
Ganardi, Moses. - : Universitätsbibliothek Siegen, 2019
BASE
Show details
16
System-Aware Algorithms For Machine Learning
Mendler-Dünner, Celestine. - : ETH Zurich, 2019
BASE
Show details
17
Seguridad del paciente: estudio de factores para su consecución
Figueiredo Escribá, Carlos de. - : Universitat de Barcelona, 2019
In: TDX (Tesis Doctorals en Xarxa) (2019)
BASE
Show details
18
Fast machine translation on parallel and massively parallel hardware
Bogoychev, Nikolay Veselinov. - : The University of Edinburgh, 2019
BASE
Show details
19
Preference inference based on lexicographic and Pareto models
George, Anne-Marie. - : University College Cork, 2019
BASE
Show details
20
Seguridad del paciente: estudio de factores para su consecución
Figueiredo Escribá, Carlos de. - : Universitat de Barcelona, 2019
BASE
Show details

Page: 1 2 3 4 5

Catalogues
0
0
0
0
0
0
1
Bibliographies
0
0
0
0
0
0
0
0
8
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
85
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern