DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8
Hits 41 – 60 of 160

41
Training corpus ssj500k 2.2
Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
42
Frequency lists of word parts from the Gigafida 2.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
43
Morphological lexicon Sloleks 2.0
Dobrovoljc, Kaja; Krek, Simon; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
44
Frequency lists of words from the GOS 1.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
Abstract: Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The lists contain all words occurring in the corpus along with their absolute and relative frequencies, percentages, and distribution across the text-types included in the corpus taxonomy. The lists were extracted for each part-of-speech category. For each part-of-speech, two lists were extracted: 1) one containing lemmas and their text-type distribution, 2) one containing lower-case word forms as well as their normalized forms, lemmas, and morphosyntactic tags along with their text-type distribution. In addition, four lists were extracted from all words (regardless of their part-of-speech category): 1) a list of all lemmas along with their part-of-speech category and text-type distribution; 2) a list of all lower-case word forms with their lemmas, part-of-speech categories, and text-type distribution; 3) a list of all lower-case word forms with their normalized word forms, lemmas, part-of-speech categories, and text-type distribution; 4) a list of all morphosyntactic tags and their text-type distribution (the tags are also split into several columns).
Keyword: frequency list; lemmas; normalized forms; Slovenian language; spoken corpus; words
URL: http://hdl.handle.net/11356/1269
BASE
Hide details
45
Frequency lists of word parts from the GOS 1.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
46
Corpus extraction tool LIST 1.2
Krsnik, Luka; Arhar Holdt, Špela; Čibej, Jaka. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Faculty of Computer and Information Science, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
47
Developmental corpus Šolar 2.0
Kosem, Iztok; Arhar Holdt, Špela; Stritar Kučuk, Mojca. - : Trojina, Institute for Applied Slovene Studies, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
48
Character-level part-of-speech tagger of Slovene language
Belej, Primož; Robnik-Šikonja, Marko; Krek, Simon. - : Faculty of Computer and Information Science, University of Ljubljana, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
49
Collocations Dictionary of Modern Slovene KSSS 1.0
Kosem, Iztok; Gantar, Polona; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
50
Frequency lists of character-level n-grams from the Gigafida 2.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
51
Error-annotated developmental corpus Šolar 2.0 Error
Arhar Holdt, Špela; Goli, Teja; Lavrič, Polona. - : Trojina, Institute for Applied Slovene Studies, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
52
Developmental corpus (without language corrections) Šolar 2.0 Clear
Kosem, Iztok; Arhar Holdt, Špela; Stritar Kučuk, Mojca. - : Trojina, Institute for Applied Slovene Studies, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
53
Frequency lists of word-level n-grams from the Gigafida 2.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
54
Frequency lists of word-level n-grams from the GOS 1.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
55
Frequency lists of character-level n-grams from the GOS 1.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
56
Corpus extraction tool LIST 1.0
Krsnik, Luka; Arhar Holdt, Špela; Čibej, Jaka. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Faculty of Computer and Information Science, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
57
Frequency lists of words from the Gigafida 2.0 corpus
Čibej, Jaka; Arhar Holdt, Špela; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
58
Training corpus jos1M 1.2
Erjavec, Tomaž; Krek, Simon; Dobrovoljc, Kaja. - : Jožef Stefan Institute, 2019
BASE
Show details
59
The ELEXIS interface for interoperable lexical resources
BASE
Show details
60
Towards a Global Lexicographic Infrastructure ...
BASE
Show details

Page: 1 2 3 4 5 6 7 8

Catalogues
0
1
0
0
21
0
0
Bibliographies
0
0
1
0
0
0
11
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
126
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern