124 |
Written corpus ccKres 1.0
|
|
|
|
Abstract:
Corpus ccKres consists of 9,376 documents, each containing information about the source (e.g. newspapers, magazines), year of publication, text type (fiction, newspaper), the title and author if they are known. The corpus is POS-tagged and lemmatised, and encoded in XML TEI format (Text Encoding Initiative P5). The ccKres corpus contains approximately 9% of the Kres corpus, a balanced corpus of Slovene: http://eng.slovenscina.eu/korpusi/kres.
|
|
Keyword:
TEI
|
|
URL: http://hdl.handle.net/11356/1034
|
|
BASE
|
|
Hide details
|
|
126 |
Cross-lingual Dependency Parsing of Related Languages with Rich Morphosyntactic Tagsets
|
|
|
|
BASE
|
|
Show details
|
|
127 |
Slovene Lexical Database
|
|
|
|
In: Natural Language Processing, Multilinguality. Sixth International Conference, Modra, Slovakia, 20 - 21 October 2011 (2011), 72-80
|
|
IDS OBELEX meta
|
|
Show details
|
|
|
|