1 |
Document Understanding: Research Directions
|
|
|
|
In: http://www.cedar.buffalo.edu/Publications/Postscript/Survey.ps (1992)
|
|
Abstract:
A document image is a visual representation of a printed page such as a journal article page, a facsimile cover page, a technical document, an office letter, etc. Document understanding as a research endeavor consists of studying all processes involved in taking a document through various representations: from a scanned physical document to high-level semantic descriptions of the document. Some of the types of representation that are useful are: editable descriptions, descriptions that enable exact reproductions and high-level semantic descriptions about document content. This report is a definition of five research subdomains within document understanding as pertaining to predominantly printed documents. The topics described are: modular architectures for document understanding; decomposition and structural analysis of documents; model-based OCR; table, diagram and image understanding; and performance evaluation under distortion and noise. 1 Each of the main sections of this paper we.
|
|
Keyword:
Contents
|
|
URL: http://www.cedar.buffalo.edu/Publications/Postscript/Survey.ps http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.47.7718
|
|
BASE
|
|
Hide details
|
|
|
|