2 |
Quality and Efficiency of Manual Annotation: Data from the Pre-annotation Bias Experiment (part of the PDT-C 2.0 project)
|
|
|
|
BASE
|
|
Show details
|
|
5 |
PDT-Vallex: Czech Valency lexicon linked to treebanks 4.0 (PDT-Vallex 4.0)
|
|
|
|
BASE
|
|
Show details
|
|
8 |
FAUST 0.5
|
|
|
|
Abstract:
Syntactic (including deep-syntactic - tectogrammatical) annotation of user-generated noisy sentences. The annotation was made on Czech-English and English-Czech Faust Dev/Test sets. The English data includes manual annotations of English reference translations of Czech source texts. This texts were translated independently by two translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. Both the reference translations were annotated, which means 2000 annotated segments in total. The Czech data includes manual annotations of Czech reference translations of English source texts. This texts were translated independently by three translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. All three reference translations were annotated, which means 3000 annotated segments in total. Faust is part of PDT-C 1.0 (http://hdl.handle.net/11234/1-3185).
|
|
Keyword:
noisy texts; parallel corpus; tectogrammatics; treebank
|
|
URL: http://hdl.handle.net/11234/1-3308
|
|
BASE
|
|
Hide details
|
|
11 |
Search for the Relation of Form and Function Using the ForFun Database
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 110, Iss 1, Pp 71-84 (2018) (2018)
|
|
BASE
|
|
Show details
|
|
14 |
Difference between Written and Spoken Czech: The Case of Verbal Nouns Denoting an Action
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 107, Iss 1, Pp 19-38 (2017) (2017)
|
|
BASE
|
|
Show details
|
|
|
|