3 |
Prague Dependency Treebank 2.0 ...
|
|
Sgall, Petr; Pajas, Petr; Mikulová, Marie; Hajič, Jan; Panevová, Jarmila; Hajičová, Eva; Štěpánek, Jan; Havelka, Jiří; Žabokrtský, Zdeněk; Ševčíková-Razímová, Magda; Urešová, Zdeňka. - : Linguistic Data Consortium, 2006
|
|
Abstract:
Introduction The Prague Dependency Treebank 2.0 (PDT 2.0) was developed by Charles University and contains approximately 2 million words of Czech text with complex and interlinked morphological, syntactic, and complex semantic annotation. In addition, certain properties of sentence information structure and coreference relations are annotated at the semantic level. PDT 2.0 follows Prague Dependency Treebank 1.0 (LDC2001T10) and is based on the long-standing Praguian linguistic tradition, adapted for the current Computational Linguistics research needs. The corpus itself uses the latest annotation technology. Software tools for corpus search, annotation, and language analysis are included. Extensive documentation (in English) is provided as well. Data The data in this corpus comes from four sources: - Lidové Noviny (daily newspapers), 1991, 1994, 1995
- Mladá ...
|
|
URL: https://catalog.ldc.upenn.edu/LDC2006T01 https://dx.doi.org/10.35111/e6p0-9s32
|
|
BASE
|
|
Hide details
|
|
|
|