Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 41 – 55 of 55

41	Frequency lists of words from the Gigafida 2.0 corpus
	Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
	BASE
	Show details

42	Avtomatsko pridobivanje besednih zvez iz korpusa z uporabo leksikona SSJ
	Arcan, Mihael; Arhar Holdt, Špela. - : Centre for Slovene as a Second and Foreign Language, Univerity of Ljubljana, 2019
	BASE
	Show details

43	Simplicity matters: user evaluation of the Slovene reference corpus [<Journal>]
	Arhar Holdt, Špela [Verfasser]; Dobrovoljc, Kaja [Sonstige]; Logar, Nataša [Sonstige]
	DNB Subject Category Language
	Show details

44	Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
	Ramisch, Carlos; Cordeiro, Silvio Ricardo; Savary, Agata. - : PARSEME, 2018
	BASE
	Show details

45	Developmental corpus of Slovene (without language corrections) Šolar-Clear
	Rozman, Tadeja; Stritar Kučuk, Mojca; Kosem, Iztok. - : Trojina, Institute for Applied Slovene Studies, 2018
	BASE
	Show details

46	Training corpus ssj500k 2.1
	Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2018
	BASE
	Show details

47	Thesaurus of Modern Slovene 1.0
	Krek, Simon; Laskowski, Cyprian; Robnik-Šikonja, Marko. - : Centre for Language Resources and Technologies, University of Ljubljana, 2018
	BASE
	Show details

48	Terminology identification dataset KAS-term 1.0
	Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2018
	BASE
	Show details

49	Value of Language-Related Questions and Comments in Digital Media for Lexicographical User Research
	Arhar Holdt, Špela; Čibej, Jaka; Zwitter Vitez, Ana
	In: International Journal of Lexicography 30 (2017) 3, 285-308
	IDS OBELEX meta
	Show details

50	CMC training corpus Janes-Tag 2.0
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka; Arhar Holdt, Špela; Ljubešić, Nikola; Zupan, Katja. - : Jožef Stefan Institute, 2017
	Abstract: Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic tagging, lemmatisation and named entity annotation of non-standard Slovene. As the corpus has been carefully manually annotated, it is also suitable for detailed linguistic explorations which require highly accurate and reliable annotations. As an update to version 1.2, 2.0 corrects some minor errors and includes named entity annotation. A slightly older version of this corpus is described in: ERJAVEC, Tomaž, ČIBEJ, Jaka, ARHAR HOLDT, Špela, LJUBEŠIĆ, Nikola, FIŠER, Darja. Gold-standard datasets for annotation of Slovene computer-mediated communication. In Proceedings of RASLAN 2016: Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2016, pp. 29-40, https://nlp.fi.muni.cz/raslan/raslan16.pdf Note that a related corpus, Janes-Norm is also available, cf. http://hdl.handle.net/11356/1084.
	Keyword: computer-mediated communication; lemmatisation; manual annotation; named entities; tagging; TEI; tokenisation; word normalisation
	URL: http://hdl.handle.net/11356/1123
	BASE
	Hide details

51	CMC training corpus Janes-Syn 1.0
	Arhar Holdt, Špela; Erjavec, Tomaž; Fišer, Darja. - : Jožef Stefan Institute, 2017
	BASE
	Show details

52	Dictionary User Typology: The Slovenian Case
	Arhar Holdt, Špela; Kosem, Iztok; Gantar, Polona
	In: Proceedings of the 17th EURALEX International Congress: Lexicography and Linguistic Diversity. Tbilisi, Georgia 6 - 10 September 2016 (2016), 179-187
	IDS OBELEX meta
	Show details

53	CMC training corpus Janes-Tag 1.2
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2016
	BASE
	Show details

54	CMC training corpus Janes-Norm 1.2
	Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2016
	BASE
	Show details

55	Learners' corpus Šolar 1.0
	Rozman, Tadeja; Stritar Kučuk, Mojca; Kosem, Iztok. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern