DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
COVID-19 Twitter Monitor: Aggregating and Visualizing COVID-19 Related Trends in Social Media ...
BASE
Show details
2
UZH@CRAFT-ST: a Sequence-labeling Approach to Concept Recognition ...
Furrer, Lenz; Cornelius, Joseph; Rinaldi, Fabio. - : Association for Computational Linguistics, 2019
BASE
Show details
3
Approaching SMM4H with Merged Models and Multi-task Learning
In: Ellendorff, Tilia; Furrer, Lenz; Colic, Nicola; Aepli, Noëmi; Rinaldi, Fabio (2019). Approaching SMM4H with Merged Models and Multi-task Learning. In: Proceedings of the 4th Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task, Florence, Italy, 2 August 2019 - 2 August 2019, 58-61. (2019)
BASE
Show details
4
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus ...
Clematide, Simon; Furrer, Lenz; Volk, Martin. - : Gesellschaft für Sprachtechnologie und Computerlinguistik (GSCL), 2018
BASE
Show details
5
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus
In: Clematide, Simon; Furrer, Lenz; Volk, Martin (2018). Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus. Journal for Language Technology and Computational Linguistics (JLCL), 33(1):25-47. (2018)
Abstract: Crowdsourcing approaches for post-correction of OCR output (Optical Character Recognition) have been successfully applied to several historical text collections. We report on our crowd-correction platform Kokos, which we built to improve the OCR quality of the digitized yearbooks of the Swiss Alpine Club (SAC) from the 19th century. This multilingual heritage corpus consists of Alpine texts mainly written in German and French, all typeset in Antiqua font. Finding and engaging volunteers for correcting large amounts of pages into high quality text requires a carefully designed user interface, an easy-to-use workflow, and continuous efforts for keeping the participants motivated. More than 180,000 characters on about 21,000 pages were corrected by volunteers in about 7 months, achieving an OCR ground truth with a systematically evaluated accuracy of 99.7 on the word level. The crowdsourced OCR ground truth and the corresponding original OCR recognition results from Abbyy FineReader for each page are available as a resource for machine learning and evaluation. Additionally, the scanned images (300 dpi) of all pages are included to enable tests with other OCR software.
Keyword: 000 Computer science; 410 Linguistics; crowdsourcing; Institute of Computational Linguistics; knowledge & systems; ocr
URL: https://jlcl.org/content/2-allissues/1-heft1-2018/jlcl_2018-1_2.pdf
https://www.zora.uzh.ch/id/eprint/162395/
https://doi.org/10.5167/uzh-162395
https://www.zora.uzh.ch/id/eprint/162395/1/ClematideFurrer2018.pdf
BASE
Hide details
6
OGER: OntoGene’s Entity Recogniser in the BeCalm TIPS Task
In: Furrer, Lenz; Rinaldi, Fabio (2017). OGER: OntoGene’s Entity Recogniser in the BeCalm TIPS Task. In: BioCreative V.5 Challenge Evaluation Workshop, Barcelona, Spain, 26 April 2017 - 27 April 2017, 175-182. (2017)
BASE
Show details
7
Efficient and Accurate Entity Recognition for Biomedical Text
In: Rinaldi, Fabio; Furrer, Lenz; Basaldella, Marco (2017). Efficient and Accurate Entity Recognition for Biomedical Text. In: BioCreative VI Workshop, Bethesda, MD, USA, 18 October 2017 - 20 October 2017, 195-197. (2017)
BASE
Show details
8
Retrospective analysis of 11 years of livestock necropsy data : evaluation for animal health surveillance
In: Faverjon, Céline; Küker, Susanne; Furrer, Lenz; Berezowski, John; Posthaus, Horst; Rinaldi, Fabio; Vial, Flavie (2017). Retrospective analysis of 11 years of livestock necropsy data : evaluation for animal health surveillance. In: 3rd International Conference on Animal Health Surveillance, Rotorua, New Zealand, 30 April 2017 - 4 May 2017, 228-230. (2017)
BASE
Show details
9
Ontogene Term and Relation Recognition for CDR
In: Ellendorff, Tilia Renate; Clematide, Simon; van der Lek, Adrian; Furrer, Lenz; Rinaldi, Fabio (2015). Ontogene Term and Relation Recognition for CDR. In: BioCreative V, Sevilla, 9 September 2015 - 11 September 2015, 305-310. (2015)
BASE
Show details
10
Unsupervised Text Segmentation for Automated Error Reduction
Furrer, Lenz [Verfasser]; Faaß, Gertrud [Herausgeber]. - Hildesheim : Universitätsbibliothek Hildesheim, 2014
DNB Subject Category Language
Show details
11
Unsupervised Text Segmentation for Automated Error Reduction
Furrer, Lenz. - 2014
BASE
Show details
12
Unsupervised Text Segmentation for Automated Error Reduction
In: Furrer, Lenz (2014). Unsupervised Text Segmentation for Automated Error Reduction. In: KONVENS 2014, Hildesheim, 8 October 2014 - 10 October 2014, 178-185. (2014)
BASE
Show details
13
Disambiguation of the Semantics of German Prepositions: a Case Study
In: Clematide, Simon; Klenner, Manfred; Furrer, Lenz (2013). Disambiguation of the Semantics of German Prepositions: a Case Study. In: Proceedings of NLPCS 2013: 10th International Workshop on Natural Language Processing and Cognitive Science, Marseille, France — Octobre 2013, Marseille, France, 15 October 2013 - 16 October 2013, 137-150. (2013)
BASE
Show details
14
Strategies for reducing and correcting OCR errors
In: Volk, Martin; Furrer, Lenz; Sennrich, Rico (2011). Strategies for reducing and correcting OCR errors. In: Sporleder, Caroline; van den Bosch, Antal; Zervanou, Kalliopi. Language Technology for Cultural Heritage. Berlin: Springer, 3-22. (2011)
BASE
Show details
15
Reducing OCR errors in Gothic-script documents
In: Furrer, Lenz; Volk, Martin (2011). Reducing OCR errors in Gothic-script documents. ERCIM News, (86):29-30. (2011)
BASE
Show details
16
Challenges in building a multilingual alpine heritage corpus ...
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern