1 |
Classifying Bias in Large Multilingual Corpora via Crowdsourcing and Topic Modeling
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A random forest system combination approach for error detection in digital dictionaries ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A random forest system combination approach for error detection in digital dictionaries
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Citation Handling: Processing Citation Texts in Scientific Documents
|
|
|
|
Abstract:
Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on generic natural language processing components, such as parsers or sentence compressors, will perform poorly if those components cannot handle citations correctly. In this thesis, I examine the effect of citation handling on parsing, sentence compression, and multi-document summarization. There are two types of citations that occur in citation sentences: constituent citations and parenthetical citations. I propose an automatic citation classifier based on training data created through Mechanical Turk tasks. I demonstrate that the use of type-specific citation handling as pre-processing improves the performance of a state-of-the-art generic parser, both for quality of the parse trees and running time. Extrinsic evaluations demonstrate that improving the performance of a parser on citation sentences in turn improves the performance of a sentence compressor, Trimmer (Zajic et al., 2007), and a multi-document summarization system, MASCS, according to several summarization measures.
|
|
Keyword:
citation; Computer science; multi-document summarization; parsing; sentence compression
|
|
URL: http://hdl.handle.net/1903/13176
|
|
BASE
|
|
Hide details
|
|
9 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Citation Handling for Improved Summarization of Scientific Documents
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Error Correction for Arabic Dictionary Lookup
|
|
|
|
In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valetta, 17 - 23 May 2010 (2010), 263-268
|
|
IDS OBELEX meta
|
|
Show details
|
|
15 |
Multiple Alternative Sentene Compressions as a Tool for Automatic Summarization Tasks
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Headline Generation for Written and Broadcast News
|
|
|
|
In: DTIC (2005)
|
|
BASE
|
|
Show details
|
|
17 |
Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation
|
|
|
|
In: DTIC (2003)
|
|
BASE
|
|
Show details
|
|
|
|