Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year
- Medium
- Type
- BLLDB-Access:
  - free (8)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 8 of 8

1	Getting at the Cognitive Complexity of Linguistic Metadata Annotation: A Pilot Study Using Eye-Tracking
	Lohman, Steffen; Tomanek, Katrin; Ziegler, Jurgen...
	In: Lohman, Steffen; Tomanek, Katrin; Ziegler, Jurgen; & Hahn, Udo. (2010). Getting at the Cognitive Complexity of Linguistic Metadata Annotation: A Pilot Study Using Eye-Tracking. Proceedings of the Cognitive Science Society, 32(32). Retrieved from: http://www.escholarship.org/uc/item/88h8d92b (2010)
	BASE
	Show details

2	A cognitive cost model of annotations based on eye-tracking data
	Tomanek, Katrin; Hahn, Udo; Lohmann, Steffen...
	In: Association for Computational Linguistics. Proceedings of the conference. - Stroudsburg, Penn. : ACL 48 (2010) 2, 1158-1167
	BLLDB
	Show details

3	Resource-aware annotation through active learning ...
	Tomanek, Katrin. - : Technische Universität Dortmund, 2010
	BASE
	Show details

4	Message Understanding Conference 7 Timed (MUC7_T)
	Tomanek, Katrin; Hahn, Udo. - : Linguistic Data Consortium, 2010. : https://www.ldc.upenn.edu, 2010
	BASE
	Show details

5	Message Understanding Conference 7 Timed (MUC7_T) ...
	Tomanek, Katrin; Hahn, Udo. - : Linguistic Data Consortium, 2010
	BASE
	Show details

6	Resource-aware annotation through active learning
	Tomanek, Katrin. - 2010
	Abstract: The annotation of corpora has become a crucial prerequisite for information extraction systems which heavily rely on supervised machine learning techniques and therefore require large amounts of annotated training material. Annotation, however, requires human intervention and is thus an extremely costly, labor-intensive, and error-prone process. The burden of annotation is one of the major obstacles when well-established information extraction systems are to be applied to real-world problems and so a pressing research question is how annotation can be made more efficient. Most annotated corpora are built by collecting the documents to be annotated on a random sampling basis or based on simple keyword search. Only recently, more sophisticated approaches to select the base material in order to reduce annotation effort are being investigated. One promising direction is known as Active Learning (AL) where only examples of high utility for classifier training are selected for manual annotation. Because of this intelligent selection, classifiers of a certain target performance can be yieled with less labeled data points. This thesis centers around the question how AL can be applied as resource-aware strategy for linguistic annotation. A set of requirements is defined and several approaches and adaptations to the standard form of AL are proposed to meet these requirements. This includes: (1) a novel method to monitor and stop the AL-driven annotation process; (2) an approach to semi-supervised AL where only highly critical tokens have to actually be manually annotated while the rest is automatically tagged; (3) a discussion and empirical investigation of the reusability of actively drawn samples; (4) a comparative study how class imbalance can be reduced right upfront during AL-driven data acquisition; (5) two methods for selective sampling of examples which are useful for multiple learning problems; (6) an extensive evaluation of the proposed approaches to AL for Named Entity Recognition with respect to both savings in corpus size and actual annotation time; and finally (7) three methods how these approaches can be made cost-conscious so as to reduce annotation time even more.
	Keyword: Active learning; Corpus annotation; ddc:004; Information extraction; Machine learning; Named entity recognition; Natural language processing
	URL: http://hdl.handle.net/2003/27172 https://nbn-resolving.org/urn:nbn:de:hbz:290-2003/27172-1 https://doi.org/10.17877/DE290R-15670
	BASE
	Hide details

7	BootStrep annotation scheme encoding information for text mining
	Piao, Scott; Buyko, Ekaterina; Tsuruoka, Yoshimasa. - 2007
	BASE
	Show details

8	An annotation type system for a data-driven NLP pipeline
	Hahn, Udo; Buyko, Ekaterina; Tomanek, Katrin. - 2007
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern