Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 21 – 26 of 26

21	CMC training corpus Janes-Syn 1.0
	Arhar Holdt, Špela; Erjavec, Tomaž; Fišer, Darja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

22	Universal Dependencies 1.4
	Nivre, Joakim; Agić, Željko; Ahrenberg, Lars. - : Universal Dependencies Consortium, 2016
	BASE
	Show details

23	Universal Dependencies 1.3
	Nivre, Joakim; Agić, Željko; Ahrenberg, Lars. - : Universal Dependencies Consortium, 2016
	BASE
	Show details

24	Training corpus ssj500k 1.4
	Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2016
	BASE
	Show details

25	Universal Dependencies 1.2
	Nivre, Joakim; Agić, Željko; Aranzabe, Maria Jesus. - : Universal Dependencies Consortium, 2015
	BASE
	Show details

26	Training corpus ssj500k 1.3
	Krek, Simon; Erjavec, Tomaž; Dobrovoljc, Kaja; Može, Sara; Ledinek, Nina; Holz, Nanika. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
	Abstract: The ssj500k training corpus is based on two training corpora built within the JOS project (http://nl.ijs.si/jos/). It contains the jos100k corpus and additional material from the jos1M corpus forming a training corpus with 500,000 words, manually checked and annotated on the levels of tokenization, segmentation, morphosyntactic tagging, syntactic dependency parsing and named entities. The ssj500k corpus uses the JOS morphosyntactic tagset with 1,902 tags and dependencies with 10 labels. The part of the corpus annotated with dependency relations contains 11,411 sentences, named entities are annotated in the original jos100k part of the corpus.
	Keyword: dependency treebank; manual annotation; named entities; parsing; tagging; TEI; tokenisation
	URL: http://hdl.handle.net/11356/1029
	BASE
	Hide details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern