Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	Corpus of Written Standard Slovene Gigafida 2.0
	Krek, Simon; Erjavec, Tomaž; Repar, Andraž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
	BASE
	Show details

2	Brexit stance annotated tweets
	Grčar, Miha; Cherepnalkoski, Darko; Mozetič, Igor. - : Jožef Stefan Institute, 2017
	BASE
	Show details

3	Dataset of European Parliament roll-call votes and Twitter activities MEP 1.0
	Cherepnalkoski, Darko; Karpf, Andreas; Mozetič, Igor. - : Jožef Stefan Institute, 2016
	BASE
	Show details

4	Twitter sentiment for 15 European languages
	Mozetič, Igor; Grčar, Miha; Smailović, Jasmina. - : Jožef Stefan Institute, 2016
	BASE
	Show details

5	Multilingual Twitter Sentiment Classification: The Role of Human Annotators ...
	Mozetic, Igor; Grcar, Miha; Smailovic, Jasmina. - : arXiv, 2016
	BASE
	Show details

6	Written corpus ccGigafida 1.0
	Logar, Nataša; Erjavec, Tomaž; Krek, Simon; Grčar, Miha; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
	Abstract: Corpus ccGigafida consists of paragraph samples from 31,722 documents, each containing information about the source (e.g. newspapers, magazines), year of publication, text type (fiction, newspaper), the title and author if they are known. The corpus is annotated with morphosyntactic descriptions (PoS-tagged) and lemmatised. It is encoded in XML TEI format (Text Encoding Initiative P5). The ccGigafida corpus contains approximately 9% of the Gigafida corpus, a reference corpus of Slovene: http://eng.slovenscina.eu/korpusi/gigafida. The corpus is available in source TEI-like XML and in the simpler and smaller vertical format, used by various concordancers. The XML file has PoS (MSD) tags in Slovenian only, while the vertical file has tags both in Slovenian and English. The corpus is also available as plain text, on file per text.
	Keyword: TEI
	URL: http://hdl.handle.net/11356/1035
	BASE
	Hide details

7	Written corpus ccKres 1.0
	Logar, Nataša; Erjavec, Tomaž; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
	BASE
	Show details

8	Stream-based active learning for sentiment analysis in the financial domain
	Smailović, Jasmina; Grčar, Miha; Lavrač, Nada...
	In: Information sciences. - New York, NY : Elsevier Science Inc. 285 (2014), 181-203
	OLC Linguistik
	Show details

9	Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
	Popović, Marko; Štefančić, Hrvoje; Sluban, Borut. - : Public Library of Science, 2014
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern