1 |
CNN-Based Page Segmentation and Object Classification for Counting Population in Ottoman Archival Documentation
|
|
|
|
In: Journal of Imaging ; Volume 6 ; Issue 5 (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Adaptive Algorithms for Automated Processing of Document Images
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Portable Language-Independent Adaptive Translation from OCR. Phase 1
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
4 |
PLATO: Portable Language-Independent Adaptive Translation from OCR
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
5 |
A Methodology for End-to-End Evaluation of Arabic Document Image Processing Software
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
6 |
Parsing And Tagging Of Bilingual Dictionary
|
|
|
|
In: http://www.umiacs.umd.edu/lamp/pubs/TechReports/LAMP_106/LAMP_106.pdf (2003)
|
|
BASE
|
|
Show details
|
|
7 |
Parsing And Tagging Of Bilingual Dictionary
|
|
|
|
In: http://www.cs.umd.edu/Library/TRs/CS-TR-4529/CS-TR-4529.pdf (2003)
|
|
Abstract:
Bilingual dictionaries hold great potential as a source of lexical resources for training and testing automated systems for optical character recognition, machine translation, and cross-language information retrieval. In this paper, we describe a system for extracting term lexicons from printed bilingual dictionaries. Our work was divided into three phases - dictionary segmentation, entry tagging, and generation. In segmentation, pages are divided into logical entries based on structural features learned from selected examples. The extracted entries are associated with functional labels and passed to a tagging module which associates linguistic labels with each word or phrase in the entry. The output of the system is a structure that represents the entries from the dictionary. We have used this approach to parse a variety of dictionaries with both Latin and non-Latin alphabets, and demonstrate the results of term lexicon generation for retrieval from a collection of French news stories using English queries.
|
|
Keyword:
Bilingual Dictionaries; Cross-Language IR; Logical Analysis; OCR; Page Segmentation
|
|
URL: http://www.cs.umd.edu/Library/TRs/CS-TR-4529/CS-TR-4529.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.5.4616
|
|
BASE
|
|
Hide details
|
|
8 |
PARSING AND TAGGING OF BILINGUAL DICTIONARY
|
|
|
|
In: http://www.cs.umd.edu/Library/TRs/CS-TR-4529/CS-TR-4529.pdf (2003)
|
|
BASE
|
|
Show details
|
|
10 |
Parsing and Tagging of Binlingual Dictionary
|
|
|
|
In: DTIC (2003)
|
|
BASE
|
|
Show details
|
|
11 |
Text extraction in complex color documents
|
|
|
|
In: http://ipml.ee.duth.gr/~papamark/color_documents.pdf (2002)
|
|
BASE
|
|
Show details
|
|
12 |
Performance evaluation of document layout analysis algorithms on the UW data set
|
|
|
|
In: http://isl.ee.washington.edu/~jliang/Postscript/spie97-1.ps.gz (1997)
|
|
BASE
|
|
Show details
|
|
13 |
Extraction of Text-Related Features for Condensing Image Documents
|
|
|
|
In: http://www.parc.xerox.com/istl/members/fchen/././projects/qca/dimsum/spie96dimsum.ps.Z (1996)
|
|
BASE
|
|
Show details
|
|
14 |
OPTICAL CHARACTER RECOGNITION OF HISTORICAL TEXTS: END-USER FOCUSED RESEARCH FOR SLOVENIAN BOOKS AND NEWSPAPERS FROM THE 18TH AND 19TH CENTURY
|
|
|
|
In: http://nl.ijs.si/imp/bib/NCD21117.pdf
|
|
BASE
|
|
Show details
|
|
15 |
G.3 [Probability and Statistics]: Distribution Functions General Terms
|
|
|
|
In: http://www2009.org/proceedings/pdf/p1165.pdf
|
|
BASE
|
|
Show details
|
|
|
|