DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Corpus of Written Standard Slovene Gigafida 2.0
Krek, Simon; Erjavec, Tomaž; Repar, Andraž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
BASE
Show details
2
Brexit stance annotated tweets
Grčar, Miha; Cherepnalkoski, Darko; Mozetič, Igor. - : Jožef Stefan Institute, 2017
BASE
Show details
3
Dataset of European Parliament roll-call votes and Twitter activities MEP 1.0
Cherepnalkoski, Darko; Karpf, Andreas; Mozetič, Igor. - : Jožef Stefan Institute, 2016
BASE
Show details
4
Twitter sentiment for 15 European languages
Mozetič, Igor; Grčar, Miha; Smailović, Jasmina. - : Jožef Stefan Institute, 2016
BASE
Show details
5
Multilingual Twitter Sentiment Classification: The Role of Human Annotators ...
BASE
Show details
6
Written corpus ccGigafida 1.0
Logar, Nataša; Erjavec, Tomaž; Krek, Simon; Grčar, Miha; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
Abstract: Corpus ccGigafida consists of paragraph samples from 31,722 documents, each containing information about the source (e.g. newspapers, magazines), year of publication, text type (fiction, newspaper), the title and author if they are known. The corpus is annotated with morphosyntactic descriptions (PoS-tagged) and lemmatised. It is encoded in XML TEI format (Text Encoding Initiative P5). The ccGigafida corpus contains approximately 9% of the Gigafida corpus, a reference corpus of Slovene: http://eng.slovenscina.eu/korpusi/gigafida. The corpus is available in source TEI-like XML and in the simpler and smaller vertical format, used by various concordancers. The XML file has PoS (MSD) tags in Slovenian only, while the vertical file has tags both in Slovenian and English. The corpus is also available as plain text, on file per text.
Keyword: TEI
URL: http://hdl.handle.net/11356/1035
BASE
Hide details
7
Written corpus ccKres 1.0
Logar, Nataša; Erjavec, Tomaž; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
8
Stream-based active learning for sentiment analysis in the financial domain
In: Information sciences. - New York, NY : Elsevier Science Inc. 285 (2014), 181-203
OLC Linguistik
Show details
9
Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources
Popović, Marko; Štefančić, Hrvoje; Sluban, Borut. - : Public Library of Science, 2014
BASE
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern