DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 21 – 26 of 26

21
CMC training corpus Janes-Syn 1.0
Arhar Holdt, Špela; Erjavec, Tomaž; Fišer, Darja. - : Jožef Stefan Institute, 2017
BASE
Show details
22
Universal Dependencies 1.4
Nivre, Joakim; Agić, Željko; Ahrenberg, Lars. - : Universal Dependencies Consortium, 2016
BASE
Show details
23
Universal Dependencies 1.3
Nivre, Joakim; Agić, Željko; Ahrenberg, Lars. - : Universal Dependencies Consortium, 2016
BASE
Show details
24
Training corpus ssj500k 1.4
Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2016
BASE
Show details
25
Universal Dependencies 1.2
Nivre, Joakim; Agić, Željko; Aranzabe, Maria Jesus. - : Universal Dependencies Consortium, 2015
BASE
Show details
26
Training corpus ssj500k 1.3
Krek, Simon; Erjavec, Tomaž; Dobrovoljc, Kaja; Može, Sara; Ledinek, Nina; Holz, Nanika. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
Abstract: The ssj500k training corpus is based on two training corpora built within the JOS project (http://nl.ijs.si/jos/). It contains the jos100k corpus and additional material from the jos1M corpus forming a training corpus with 500,000 words, manually checked and annotated on the levels of tokenization, segmentation, morphosyntactic tagging, syntactic dependency parsing and named entities. The ssj500k corpus uses the JOS morphosyntactic tagset with 1,902 tags and dependencies with 10 labels. The part of the corpus annotated with dependency relations contains 11,411 sentences, named entities are annotated in the original jos100k part of the corpus.
Keyword: dependency treebank; manual annotation; named entities; parsing; tagging; TEI; tokenisation
URL: http://hdl.handle.net/11356/1029
BASE
Hide details

Page: 1 2

Catalogues
Bibliographies
Linked Open Data catalogues
Online resources
Open access documents
26
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern