DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
Universal Dependencies 1.2
Nivre, Joakim; Agić, Željko; Aranzabe, Maria Jesus. - : Universal Dependencies Consortium, 2015
BASE
Show details
2
Morphological lexicon Sloleks 1.0
Dobrovoljc, Kaja; Krek, Simon; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
3
Spoken corpus Gos 1.0
Zwitter Vitez, Ana; Zemljarič Miklavčič, Jana; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
4
Written corpus ccGigafida 1.0
Logar, Nataša; Erjavec, Tomaž; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
5
MULTEXT-East free lexicons 4.0
Erjavec, Tomaž; Bruda, Ştefan; Derzhanski, Ivan. - : Jožef Stefan Institute, 2015
BASE
Show details
6
Training corpus jos1M 1.1
Erjavec, Tomaž; Krek, Simon. - : Jožef Stefan Institute, 2015
BASE
Show details
7
Reference corpus of historical Slovene goo300k 1.2
Erjavec, Tomaž. - : Jožef Stefan Institute, 2015
Abstract: goo300k is a manually annotated reference corpus of historical Slovene. It contains 1,100 pages (about 300,000 tokens) sampled from 89 texts from the period 1584-1899. Each text contains extensive meta-data and per-page links to facsimiles, while the word tokens in the texts are annotated with their modernised word-form, lemma, part-of-speech, and, for archaic words, their nearest modern synonyms or short explanation. The corpus is available in source TEI P5 XML and in the simpler and smaller vertical format, used by various concordancers. Note that the vertical format does not contain all the information from the source TEI.
Keyword: historical language; lemmatisation; manual annotation; part-of-speech tagging; TEI; word modernisation
URL: http://hdl.handle.net/11356/1025
BASE
Hide details
8
Morphological lexicon Sloleks 1.2
Dobrovoljc, Kaja; Krek, Simon; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
9
MULTEXT-East "1984" document corpus 4.0
Erjavec, Tomaž; Bruda, Ştefan; Dimitrova, Ludmila. - : Jožef Stefan Institute, 2015
BASE
Show details
10
MULTEXT-East non-commercial lexicons 4.0
Erjavec, Tomaž; Derzhanski, Ivan; Divjak, Dagmar. - : Jožef Stefan Institute, 2015
BASE
Show details
11
Written corpus ccKres 1.0
Logar, Nataša; Erjavec, Tomaž; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
12
Lexicon of historical Slovene imp25k 1.1
Erjavec, Tomaž. - : Jožef Stefan Institute, 2015
BASE
Show details
13
Training corpus ssj500k 1.3
Krek, Simon; Erjavec, Tomaž; Dobrovoljc, Kaja. - : Centre for Language Resources and Technologies, University of Ljubljana, 2015
BASE
Show details
14
Digital library and corpus of historical Slovene IMP 1.1
Erjavec, Tomaž. - : Jožef Stefan Institute, 2015
BASE
Show details
15
MULTEXT-East "1984" annotated corpus 4.0
Erjavec, Tomaž; Barbu, Ana-Maria; Derzhanski, Ivan. - : Jožef Stefan Institute, 2015
BASE
Show details
16
Japanese web corpus with difficulty levels jpWaC-L 1.0
Erjavec, Tomaž; Hmeljak Sangawa, Kristina; Kawamura, Yoshiko. - : Jožef Stefan Institute, 2015
BASE
Show details
17
Main results of MONDILEX project
In: Cognitive Studies | Études cognitives; No 11 (2011); 265-290 ; 2392-2397 (2015)
BASE
Show details
18
MONDILEX – towards the research infrastructure for digital resources in Slavic lexicography
In: Cognitive Studies | Études cognitives; No 10 (2010); 147-162 ; 2392-2397 (2015)
BASE
Show details
19
The Japanese-Slovene dictionary jaSlo: its development, enhancement and use
In: Cognitive Studies | Études cognitives; No 10 (2010); 203-216 ; 2392-2397 (2015)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
19
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern