DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 28

1
SYN v9: large corpus of written Czech
Abstract: Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mostly the 1990-2019 period and features rich metadata including detailed bibliographical information, text-type classification etc. SYN v9 contains a wide variety of text types (fiction, non-fiction, newspapers), but the newspapers prevail noticeably. The corpus is lemmatized and morphologically tagged by the new CNC tagset first utilized for the annotation of the SYN2020 corpus. SYN v9 is provided in a CoNLL-U-like vertical format used as an input to the Manatee query engine. The data thus correspond to the corpus available via the KonText query interface to the registered users of CNC at http://www.korpus.cz with one important exception: the corpus is shuffled, i.e. divided into blocks sized max. 100 words (respecting the sentence boundaries) with ordering randomized within the given document.
Keyword: corpus; written language
URL: http://hdl.handle.net/11234/1-4635
BASE
Hide details
2
Universal Dependencies 2.9
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details
3
Universal Dependencies 2.8.1
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details
4
Universal Dependencies 2.8
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details
5
Universal Dependencies 2.7
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2020
BASE
Show details
6
Universal Dependencies 2.6
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2020
BASE
Show details
7
Universal Dependencies 2.5
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2019
BASE
Show details
8
AKCES-GEC Grammatical Error Correction Dataset for Czech
Šebesta, Karel; Bedřichová, Zuzanna; Šormová, Kateřina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2019
BASE
Show details
9
Universal Dependencies 2.4
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2019
BASE
Show details
10
Universal Dependencies 2.2
In: https://hal.archives-ouvertes.fr/hal-01930733 ; 2018 (2018)
BASE
Show details
11
Universal Dependencies 2.3
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2018
BASE
Show details
12
Universal Dependencies 2.2
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2018
BASE
Show details
13
Universal Dependencies 2.1
In: https://hal.inria.fr/hal-01682188 ; 2017 (2017)
BASE
Show details
14
CzeSL Grammatical Error Correction Dataset (CzeSL-GEC)
Šebesta, Karel; Bedřichová, Zuzanna; Šormová, Kateřina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2017
BASE
Show details
15
FicTree 1.0
Jelínek, Tomáš; Hnátková, Milena; Skoumalová, Hana. - : Charles University, Faculty of Arts, Institute of Theoretical and Computational Linguistics, 2017
BASE
Show details
16
Universal Dependencies 2.1
Nivre, Joakim; Agić, Željko; Ahrenberg, Lars. - : Universal Dependencies Consortium, 2017
BASE
Show details
17
SYN v4: large corpus of written Czech
Křen, Michal; Cvrček, Václav; Čapka, Tomáš. - : Charles University, Faculty of Arts, Institute of the Czech National Corpus, 2016
BASE
Show details
18
SYN2015: representative corpus of written Czech
Křen, Michal; Cvrček, Václav; Čapka, Tomáš. - : Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague, 2015
BASE
Show details
19
AKCES 5 (CzeSL-SGT) Release 2
BASE
Show details
20
SYN2013PUB: corpus of written Czech newspapers
Křen, Michal; Hnátková, Milena; Jelínek, Tomáš. - : Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague, 2014
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
27
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern