DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Using SMT for OCR error correction of historical texts
In: Afli, Haithem orcid:0000-0002-7449-4707 , Qui, Zhengwei, Way, Andy orcid:0000-0001-5736-5930 and Sheridan, Páraic (2016) Using SMT for OCR error correction of historical texts. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), 23-28 May 2016, Portorož, Slovenia. ISBN 978-2-9517408-9-1 (2016)
Abstract: A trend to digitize historical paper-based archives has emerged in recent years, with the advent of digital optical scanners. A lot of paper-based books, textbooks, magazines, articles, and documents are being transformed into electronic versions that can be manipulated by a computer. For this purpose, Optical Character Recognition (OCR) systems have been developed to transform scanned digital text into editable computer text. However, different kinds of errors in the OCR system output text can be found, but Automatic Error Correction tools can help in performing the quality of electronic texts by cleaning and removing noises. In this paper, we perform a qualitative and quantitative comparison of several error-correction techniques for historical French documents. Experimentation shows that our Machine Translation for Error Correction method is superior to other Language Modelling correction techniques, with nearly 13% relative improvement compared to the initial baseline.
Keyword: Language Modelling; Machine translating; Optical Character Recognition; SpeechToSpeech Translation
URL: http://doras.dcu.ie/23226/
BASE
Hide details
2
Domain adaptation for social localisation-based SMT: a Case study using the Trommons platform
In: Du, Jinhua orcid:0000-0002-3267-4881 , Way, Andy orcid:0000-0001-5736-5930 , Qui, Zhengwei, Wasala, Asanka and Schäler, Reinhard (2015) Domain adaptation for social localisation-based SMT: a Case study using the Trommons platform. In: MT Summit Workshop on Post-Editing Technology and Practice (WPTP4) as part of Machine Translation Summit XV, 3 Oct-3 Nov, 2015, Miami, FL, USA. (2015)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern