45 |
Corpus extraction tool LIST 1.2
|
|
Krsnik, Luka; Arhar Holdt, Špela; Čibej, Jaka. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Faculty of Computer and Information Science, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
|
|
BASE
|
|
Show details
|
|
49 |
Frequency lists of character-level n-grams from the Gigafida 2.0 corpus
|
|
|
|
BASE
|
|
Show details
|
|
51 |
Developmental corpus (without language corrections) Šolar 2.0 Clear
|
|
Kosem, Iztok; Arhar Holdt, Špela; Stritar Kučuk, Mojca; Krek, Simon; Krapš Vodopivec, Irena; Stabej, Marko; Kocjančič, Polonca; Laskowski, Cyprian; Klemenc, Bojan; Pori, Eva; Rozman, Tadeja. - : Trojina, Institute for Applied Slovene Studies, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019
|
|
Abstract:
Šolar 2.0 Clear is an adapted version of the Šolar 2.0 corpus, cf. http://hdl.handle.net/11356/1214. The Šolar 2.0 Clear corpus consists of texts written by students in Slovene primary and secondary schools. School essays form the majority of the corpus while other material includes texts created during lessons, such as text recapitulations or descriptions, examples of formal applications etc. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. Unlike the original Šolar 2.0 corpus (http://hdl.handle.net/11356/1214), Šolar 2.0 Clear includes student texts only: error annotations and other types of feedback from the teachers have been removed. The corpus can thus be used for processing tasks where the inclusion of corrections hinders or complicates the procedures (e.g. for comparative data extraction, training of language models etc).
|
|
Keyword:
developmental corpus; student writing
|
|
URL: http://hdl.handle.net/11356/1219
|
|
BASE
|
|
Hide details
|
|
52 |
Frequency lists of word-level n-grams from the Gigafida 2.0 corpus
|
|
|
|
BASE
|
|
Show details
|
|
53 |
Frequency lists of word-level n-grams from the GOS 1.0 corpus
|
|
|
|
BASE
|
|
Show details
|
|
54 |
Frequency lists of character-level n-grams from the GOS 1.0 corpus
|
|
|
|
BASE
|
|
Show details
|
|
55 |
Corpus extraction tool LIST 1.0
|
|
Krsnik, Luka; Arhar Holdt, Špela; Čibej, Jaka. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Faculty of Computer and Information Science, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
|
|
BASE
|
|
Show details
|
|
60 |
The ELEXIS Interface for Interoperable Lexical Resources ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|