4 |
Clitics in the wild : Empirical studies on the microvariation of the pronominal, reflexive and verbal clitics in Bosnian, Croatian and Serbian ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Historical development and contemporary usage of discourse structuring elements based on verba dicendi in Croatian
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0
|
|
|
|
BASE
|
|
Show details
|
|
10 |
24sata news comment dataset 1.0
|
|
|
|
Abstract:
The dataset of user comments provided for research purposes for the EMBEDDIA, a Horizon 2020 project, extracted from the database of user comments from the 24sata.hr news portal. The 24sata.hr is the largest-circulation daily newspaper in Croatia, reaching on average 2 million readers daily. The dataset provides the comments metadata including the link to the relevant article, the ID of the comment author (anonymized), and timestamp. The comments are also labelled if they are blocked by human moderators. Description of the Datasets. The 24sata dataset consists of 11 columns and 21548192 rows. Each row represents one user comment on the 24sata news portal. Comments are added by registered users below the published news article. Columns: 'comment_id' - The internal id of the comment. Unique for each row. 'user_id' - The internal id of the user writing the comment. Unique for each user. '0' for all blocked comments. 'content' - The content (text) of the user comment. 'site' - The site the comment came from. 'reply_to_id' - The 'comment_id' of the parent comment - if this comment was intended as a reply. 'created_date' - The date the comment was created. 'last_change' - The date the comment was last edited. 'article_id' - A public id of the article where this comment was posted. The article itself can be accessed by appending article_id to the site. So an article with article_id 614684 and site 'www.24sata.hr' can be found on 'www.24sata.hr/a-614684'. (note the added 'a-' before the article name) 'infringed_on_rule' - If the user has infringed on rules with this comment, id of the rule is given. The description of the rules is given below. 'like_counts' - A number of times other users have voted in favour of this comment, similar to the Like button. 'dislike_counts' - A number of times other users have voted against this comment, opposite of the Like button.
|
|
Keyword:
comment moderation; croatian comment moderation; news comments; offensive language
|
|
URL: http://hdl.handle.net/11356/1399
|
|
BASE
|
|
Hide details
|
|
11 |
Keyword extraction datasets for Croatian, Estonian, Latvian and Russian 1.0
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Multilingual comparable corpora of parliamentary debates ParlaMint 2.0
|
|
|
|
BASE
|
|
Show details
|
|
14 |
The semantic profile of the verbal prefix do- in Bulgarian and Croatian ; Семантический профиль глагольного префикса до- в болгарском и хорватском языках
|
|
|
|
In: Slověne = Словѣне. International Journal of Slavic Studies; Vol 10, No 2 (2021); 252-276 ; 2305-6754 ; 2304-0785 (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Glottolog 4.4 Resources for Croatian Standard
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
16 |
Factors contributing to prefixation of biaspectual verbs in Croatian Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Factors contributing to prefixation of biaspectual verbs in Croatian Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
English as a Lingua Franca (ELF): Croatian L1 Students' Perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
English as a Lingua Franca (ELF): Croatian L1 Students' Perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Mental simulation of the illusory and the factual in negation processing ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|