1 |
The Orange workflow for observing collocation trends ColTrend 1.0
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Slovene ontology of semantic types for nouns SLONEST-noun 1.0
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Multiword Expressions lexicon extracted from the Gigafida 2.1 corpus
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The Orange workflow for observing collocation clusters ColEmbed 1.0
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Frequency lists of collocations from the Gigafida 2.1 corpus
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Language Teachers and Crowdsourcing: Insights from a Cross-European Survey
|
|
|
|
In: ISSN: 1331-6745 ; EISSN: 1849-0379 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje ; https://hal.inria.fr/hal-02974069 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje, 2020, 46 (1), pp.1-28. ⟨10.31724/rihjj.46.1.1⟩ (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Frequency lists of character-level n-grams from the GOS 1.0 corpus 1.1
|
|
|
|
Abstract:
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The lists contain 1-5-gram combinations of characters occurring in the corpus along with their absolute and relative frequencies, percentages, and distribution across the text-types included in the corpus taxonomy. Character-level n-grams were extracted from lemmas (5 files), lower-case word forms (5 files), and standardized word forms (5 files). Compared to the previous version (http://hdl.handle.net/11356/1268), this one includes fixes of several typos and substitutes all instances of "normalized forms" with the more adequate term "standardized forms" (as used in the SSJ project).
|
|
Keyword:
characters; frequency list; n-grams; Slovenian language; spoken corpus
|
|
URL: http://hdl.handle.net/11356/1363
|
|
BASE
|
|
Hide details
|
|
19 |
Frequency lists of word-level n-grams from the GOS 1.0 corpus 1.1
|
|
|
|
BASE
|
|
Show details
|
|
|
|