1 |
Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
3 |
Tasks, Domains, and Languages
|
|
|
|
In: DTIC (1993)
|
|
Abstract:
The Fifth Message Understanding Conference (MUC-5) involved the same tasks, domains and languages as the information extraction portion of the ARPA TIPSTER program. These tasks center on automatically filling object-oriented data structures, called templates, with information extracted from free text in news stories (for discussion of templates and objects, see "Template Design for Information Extraction" in this volume). For each task, a generic type of information that is specified for extraction corresponds to each of the slots in the templates. With text as input, the MUC-5 systems first detect whether the text contains relevant information. If available, the systems extract specific instances of The generic types from the text and output that information by filling the template slots with the appropriately formatted data representations. These slots are then scored by using an automatic scoring program with analyst-produced templates as the keys. Human analysts also prepared development set templates for each domain, which served as training models for system developers (for discussion of the data preparation effort, see "Corpora and Data Preparation" in this volume).
|
|
Keyword:
*INFORMATION RETRIEVAL; *TEMPLATES; *TEXT PROCESSING; ANALYSTS; AUTOMATIC PROGRAMMING; DATA BASES; DATA MANAGEMENT; EXTRACTION; Information Science; Linguistics; MODELS; PERSONNEL; PREPARATION; SCORING; SYSTEMS ENGINEERING; TRAINING
|
|
URL: http://www.dtic.mil/docs/citations/ADA459848 http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA459848
|
|
BASE
|
|
Hide details
|
|
4 |
Corpora and Data Preparation for Information Extraction
|
|
|
|
In: DTIC (1993)
|
|
BASE
|
|
Show details
|
|
5 |
Tasks, Domains, and Languages for Information Extraction
|
|
|
|
In: DTIC (1993)
|
|
BASE
|
|
Show details
|
|
|
|