DE eng

Search in the Catalogues and Directories

Hits 1 – 18 of 18

1
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications
In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC) ; International Conference on Language Resources and Evaluation (LREC) ; 11 (2018)
Abstract: In this paper, we describe our effort to create a new corpus for the evaluation of detecting and linking so-called survey variables in social science publications (e.g., "Do you believe in Heaven?"). The task is to recognize survey variable mentions in a given text, disambiguate them, and link them to the corresponding variable within a knowledge base. Since there are generally hundreds of candidates to link to and due to the wide variety of forms they can take, this is a challenging task within NLP. The contribution of our work is the first gold standard corpus for the variable detection and linking task. We describe the annotation guidelines and the annotation process. The produced corpus is multilingual - German and English - and includes manually curated word and phrase alignments. Moreover, it includes text samples that could not be assigned to any variables, denoted as negative examples. Based on the new dataset, we conduct an evaluation of several state-of-the-art text classification and textual similarity methods. The annotated corpus is made available along with an open-source baseline system for variable mention identification and linking.
Keyword: 30200; 50200; algorithm; Algorithmus; computational linguistics; Computerlinguistik; data; Daten; Information Science; Informationswissenschaft; journalism; Journalismus,Verlagswesen; Linguistics; Linguistik; Literatur; Literature; Literaturwissenschaft; News media; publication; Publikation; publishing; Publizistische Medien; rhetoric and criticism; Rhetorik; Science of Literature; social science; Sozialwissenschaft; Sprachwissenschaft; text mining; semantic textual similarity; paraphrase detection; linking
URL: http://nbn-resolving.org/urn:nbn:de:0168-ssoar-57723-2
https://www.ssoar.info/ssoar/handle/document/57723
BASE
Hide details
2
Sociolinguistically Informed Natural Language Processing: Automating Irony Detection
In: DTIC (2015)
BASE
Show details
3
Semantic Information eXchange Architecture (SIXA)
In: DTIC (2010)
BASE
Show details
4
Novel Topic Impact on Authorship Attribution
In: DTIC (2009)
BASE
Show details
5
Detecting Age in Online Chat
In: DTIC (2009)
BASE
Show details
6
Improving Information Extraction and Translation Using Component Interactions
In: DTIC (2008)
BASE
Show details
7
IIT Kharagpur at TREC 2008 Blog Track
In: DTIC (2008)
BASE
Show details
8
Text Mining the Biomedical Literature
In: DTIC (2007)
BASE
Show details
9
Effectively Using Syntax for Recognizing False Entailment
In: DTIC (2006)
BASE
Show details
10
Sentence Level Information Patterns for Novelty Detection
In: DTIC (2006)
BASE
Show details
11
Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings
In: DTIC (2005)
BASE
Show details
12
Discriminative Slot Detection Using Kernel Methods
In: DTIC (2004)
BASE
Show details
13
Learning to Identify TV News Monologues by Style and Context
In: DTIC (2003)
BASE
Show details
14
DANDE: Deductive Anomaly Detection With Program Synthesis
In: DTIC AND NTIS (2003)
BASE
Show details
15
Integrated Feasibility Experiment for Bio-Security: IFE-Bio, A TIDES Demonstration
In: DTIC (2001)
BASE
Show details
16
Text Detection and Translation from Natural Scenes
In: DTIC (2001)
BASE
Show details
17
Automatic Verification of Multiagent Conversations
In: DTIC (2000)
BASE
Show details
18
Focus of Tipster Phases I and 2
In: DTIC (1996)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
18
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern