Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher:
- Year:
  - 2021 (2)
  - 2019 (3)
  - 2018 (4)
  - 2017 (7)
  - 2016 (2)
- Medium:
  - Online (18)
- Type:
  - Article (18)
- BLLDB-Access:
  - free (18)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 18 of 18

1	Choice of plausible alternatives dataset in Croatian COPA-HR
	Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
	BASE
	Show details

2	Slovenian Twitter hate speech dataset IMSyPP-sl
	Kralj Novak, Petra; Mozetič, Igor; Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
	BASE
	Show details

3	Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
	Ljubešić, Nikola; Erjavec, Tomaž; Batanović, Vuk. - : Jožef Stefan Institute, 2019
	BASE
	Show details

4	CMC training corpus Janes-Tag 2.1
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2019
	BASE
	Show details

5	Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.1
	Ljubešić, Nikola; Erjavec, Tomaž; Batanović, Vuk. - : Jožef Stefan Institute, 2019
	BASE
	Show details

6	Training corpus SETimes.SR 1.0
	Batanović, Vuk; Ljubešić, Nikola; Samardžić, Tanja. - : Regional Linguistic Data Initiative Centre ReLDI, 2018
	BASE
	Show details

7	Bilingual terminology extraction dataset KAS-biterm 1.0
	Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2018
	BASE
	Show details

8	Terminology identification dataset KAS-term 1.0
	Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2018
	BASE
	Show details

9	Training corpus hr500k 1.0
	Ljubešić, Nikola; Agić, Željko; Klubička, Filip. - : Jožef Stefan Institute, 2018
	BASE
	Show details

10	CMC training corpus Janes-Tag 2.0
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2017
	BASE
	Show details

11	Croatian Twitter training corpus ReLDI-NormTag-hr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

12	Serbian Twitter training corpus ReLDI-NormTag-sr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

13	Croatian Twitter training corpus ReLDI-NormTag-hr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

14	Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.0
	Ljubešić, Nikola; Erjavec, Tomaž; Miličević, Maja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

15	Serbian Twitter training corpus ReLDI-NormTag-sr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip; Erjavec, Tomaž; Miličević, Maja; Vuković, Teodora. - : Jožef Stefan Institute, 2017
	Abstract: ReLDI-NormTag-sr 1.1 is a manually annotated corpus of Serbian tweets. It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic tagging and lemmatisation of non-standard Serbian. Each tweet is also annotated for its automatically assigned standardness levels (T = technical standardness, L = linguistic standardness). As an update to version 1.0, 1.1 corrects some minor errors. The corpus construction is (partially) described in: MILIČEVIĆ, Maja, LJUBEŠIĆ, Nikola. Tviterasi, tviteraši or twitteraši? Producing and analysing a normalised dataset of Croatian and Serbian tweets. Slovenščina 2.0: empirical, applied and interdisciplinary research, 4/2, 2016. ISSN 2335-2736. http://dx.doi.org/10.4312/slo2.0.2016.2.156-188
	Keyword: computer-mediated communication; lemmatisation; manual annotation; tagging; TEI; tokenisation; word normalisation
	URL: http://hdl.handle.net/11356/1120
	BASE
	Hide details

16	Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.0
	Ljubešić, Nikola; Erjavec, Tomaž; Miličević, Maja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

17	Dataset of normalised Slovene text KonvNormSl 1.0
	Ljubešić, Nikola; Zupan, Katja; Fišer, Darja. - : Jožef Stefan Institute, 2016
	BASE
	Show details

18	CMC training corpus Janes-Tag 1.2
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2016
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern