1 |
Corpora and Data Preparation
|
|
|
|
In: DTIC (1993)
|
|
Abstract:
The data selection and data preparation efforts which led to the TIPSTER and Fifth Message Understanding Conference (MUC-5) evaluation corpora involved substantial effort, time and resources. The Government commitment to these selection and preparation efforts stems from four TIPSTER Program objectives: (1) to provide training data that would promote the development of information extraction technology, (2) to provide accurate test data to evaluate and baseline system performance in an objective manner, (3) to provide a baseline for human performance to understand and interpret machine performance, and (4) to support the larger Natural Language Processing community by making available a unique set of texts and templates in multiple domains and languages under ARPA support. This commitment was demonstrated through the managerial, technical, and administrative support to these efforts from various Government agencies, as well as through the contractual efforts with the Institute for Defense Analyses for data preparation and New Mexico State University for software tool development. ; Presented at the Message Understanding Conference (5th) held in Baltimore, MD on 25-27 August 1993. Pub. in the Message Understanding Conference (5th), p1-5, 1993. ISBN 1-55860-336-0.
|
|
Keyword:
*CORPORA; *DATA MANAGEMENT; *INFORMATION RETRIEVAL; *KNOWLEDGE BASED SYSTEMS; *LANGUAGE TRANSLATION; *MESSAGE UNDERSTANDING; *TEXT PROCESSING; ACCURACY; Cybernetics; DATA EXTRACTION; EXPERIMENTAL DATA; Information Science; LANGUAGE DOMAIN PAIRS; Linguistics; NATURAL LANGUAGE; PERFORMANCE(ENGINEERING); PERFORMANCE(HUMAN); PREPARATION; SOFTWARE ENGINEERING; SOFTWARE TOOLS; SYMPOSIA; TEMPLATES; TIPSTER PROGRAM; TRAINING; USER NEEDS
|
|
URL: http://www.dtic.mil/docs/citations/ADA460923 http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA460923
|
|
BASE
|
|
Hide details
|
|
3 |
Corpora and Data Preparation for Information Extraction
|
|
|
|
In: DTIC (1993)
|
|
BASE
|
|
Show details
|
|
4 |
Tasks, Domains, and Languages for Information Extraction
|
|
|
|
In: DTIC (1993)
|
|
BASE
|
|
Show details
|
|
|
|