DE eng

Search in the Catalogues and Directories

Page: 1...4 5 6 7 8 9 10 11 12...37
Hits 141 – 160 of 723

141
Managing fieldwork data with Toolbox and the Natural Language Toolkit
BASE
Show details
142
Watch out! Unicode is coming
BASE
Show details
143
Dictionary Development Program
Moe, Ronald. - 2007
BASE
Show details
144
Off the cassette tape and onto CD: migrating analog audio data to digital format
Mead, David. - 2007
BASE
Show details
145
Following natural language route instructions
BASE
Show details
146
Clustering methodologies for identifying country core competencies
Kostoff, Ronald N.; del Rio, J. Antonio; Cortes, Héctor D.. - : Sage Publications Ltd, 2007
BASE
Show details
147
Trust in Security-Policy Enforcement Mechanisms
In: DTIC (2006)
BASE
Show details
148
From Cognition To Language: The Modeling Field Theory Approach
In: DTIC (2006)
BASE
Show details
149
Automated Communications Analysis System using Latent Semantic Analysis
In: DTIC (2006)
BASE
Show details
150
A Hybrid Approach for QA Track Definitional Questions
In: DTIC (2006)
BASE
Show details
151
Semantic Web Services with Web Ontology Language (OWL-S) - Specification of Agent-Services for DARPA Agent Markup Language (DAML)
In: DTIC (2006)
BASE
Show details
152
Exploring the Utility of ResearchCyc for Reasoning from Natural Language
In: DTIC (2006)
BASE
Show details
153
3D Photonic Crystals Build Up By Self-Organization Of Nanospheres
In: DTIC (2006)
BASE
Show details
154
A Methodology for End-to-End Evaluation of Arabic Document Image Processing Software
In: DTIC (2006)
Abstract: This paper describes a methodology for end-to-end evaluation of Arabic document image processing software. Various software solutions have been proposed for digitization and understanding of noisy, complex Arabic document images. Optical-character-recognition-based (OCR-based) solutions have been available for decades; however this technology is often tailored to the most common document image type: clean, monolingual documents. Real-world documents often involve multiple languages, handwriting, logos, signatures, pictures, stylized text, and other document aspects. Real-world documents involve noise introduced by document aging, reproduction, or exposure to environment factors. Document image processing solutions are maturing to deal with such complexities. Such systems include image clean-up algorithms and page segmentation, followed by various recognition or digitization algorithms: OCR, handwritten word recognition (HWR), logo identification, signature identification, sub-image or picture identification. Indexing digitized document renditions into a search engine enables ad hoc querying of the collection. Some researchers have proposed semi-automation, a process in which human readers interpret complex documents and record a spoken rendition; the audio recordings are then processed by a spoken document retrieval (SDR) system, employing automatic speech recognition (ASR) for digitization and an information retrieval solution to enable ad hoc queries. To handle foreign language, machine translation may be included in any of the aforementioned document image processing systems. This array of approaches results in widely varying performance. This paper discusses a methodology for evaluating the end-to-end retrieval performance of these systems: the ad-hoc use case. The methodology can be easily tailored to other languages, and to other document formats (e.g., audio and video). ; The original document contains color images.
Keyword: *ARABIC LANGUAGE; *COMPUTER PROGRAMS; *IMAGE PROCESSING; *INFORMATION RETRIEVAL; *METHODOLOGY; *RETRIEVAL PERFORMANCE; *SOFTWARE EVALUATION METHODOLOGY; *TEST AND EVALUATION; AD HOC USE CASE; ALGORITHMS; ASR(AUTOMATIC SPEECH RECOGNITION); CLEAN-UP ALGORITHMS; Computer Programming and Software; DIGITAL IMAGES; DIGITIZATION; Equipment and Methods; HWR(HANDWRITTEN WORD RECOGNITION); ILLEGIBLE DOCUMENTS; IMAGE PROCESSING SOFTWARE; Linguistics; MACHINE TRANSLATION; MULTILINGUAL DOCUMENTS; NOISY DOCUMENT IMAGES; OPTICAL CHARACTER RECOGNITION; PAGE SEGMENTATION; PERFORMANCE(ENGINEERING); PICTURES; PRECISION; RETRIEVAL PRECISION; RETRIEVAL RECALL; SDR(SPOKEN DOCUMENT RETRIEVAL); SPEECH RECOGNITION; SYMBOLS; Test Facilities; TREC MEASURES; TREC(TEXT RETRIEVAL CONFERENCES); WEAR; WORD RECOGNITION
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA468394
http://www.dtic.mil/docs/citations/ADA468394
BASE
Hide details
155
A new mass elicitation technique: the dictionary development program
Berg, René van den; Shore, Susan. - : Linguistic Society of the Philippines and SIL International, 2006
BASE
Show details
156
One dictionary, one language, one team, but different locations? Version control and file management turn chaos into quality
Grimes, Charles E.. - : Linguistic Society of the Philippines and SIL International, 2006
BASE
Show details
157
The SIL FieldWorks language explorer approach to morphological parsing
BASE
Show details
158
Ensuring that digital data last: the priority of archival form over working form and presentation form
Simons, Gary F.. - : SIL, 2006
BASE
Show details
159
A CAI program for teaching Filipino
McFarland, Curtis D.. - : Linguistic Society of the Philippines and SIL International, 2006
BASE
Show details
160
Extending WebCrow into English
BASE
Show details

Page: 1...4 5 6 7 8 9 10 11 12...37

Catalogues
0
0
0
0
0
0
2
Bibliographies
0
0
0
0
0
0
0
0
19
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
702
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern