Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher:
- Year:
  - 2018 (1)
  - 2017 (8)
  - 2016 (2)
  - 2015 (1)
- Medium:
  - Online (12)
- Type:
  - Article (12)
- BLLDB-Access:
  - free (12)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 12 of 12

1	Training corpus ssj500k 2.1
	Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2018
	BASE
	Show details

2	CMC training corpus Janes-Tag 2.0
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2017
	BASE
	Show details

3	Croatian Twitter training corpus ReLDI-NormTag-hr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

4	Serbian Twitter training corpus ReLDI-NormTag-sr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

5	Croatian Twitter training corpus ReLDI-NormTag-hr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

6	Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.0
	Ljubešić, Nikola; Erjavec, Tomaž; Miličević, Maja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

7	Training corpus ssj500k 2.0
	Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2017
	BASE
	Show details

8	Serbian Twitter training corpus ReLDI-NormTag-sr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

9	Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.0
	Ljubešić, Nikola; Erjavec, Tomaž; Miličević, Maja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

10	Training corpus ssj500k 1.4
	Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž; Može, Sara; Ledinek, Nina; Holz, Nanika. - : Centre for Language Resources and Technologies, University of Ljubljana, 2016
	Abstract: The ssj500k training corpus contains 500,000 words, manually annotated on the levels of tokenization, sentence segmentation, morphosyntactic tagging, lemmatisation, named entities, and, partially, syntactic dependencies. The ssj500k corpus uses the MULTEXT-East / JOS morphosyntactic tagset and the JOS dependency schema and is based on the jos100k and jos1M corpora. Note that this entry updates ssj500k 1.3 by fixing many annotation errors.
	Keyword: dependency treebank; manual annotation; named entities; parsing; tagging; TEI; tokenisation
	URL: http://hdl.handle.net/11356/1052
	BASE
	Hide details

11	CMC training corpus Janes-Tag 1.2
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2016
	BASE
	Show details

12	Training corpus ssj500k 1.3
	Krek, Simon; Erjavec, Tomaž; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern