DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
The Janes project: language resources and tools for Slovene user generated content [<Journal>]
Fišer, Darja [Verfasser]; Ljubešić, Nikola [Sonstige]; Erjavec, Tomaž [Sonstige]
DNB Subject Category Language
Show details
2
Universal Dependencies 2.2
In: https://hal.archives-ouvertes.fr/hal-01930733 ; 2018 (2018)
BASE
Show details
3
Universal Dependencies 2.3
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2018
BASE
Show details
4
Universal Dependencies 2.2
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2018
BASE
Show details
5
Dictionary of Twitterese Janes-Dict 1.0
Gantar, Polona; Škrjanec, Iza; Fišer, Darja. - : Faculty of Arts, University of Ljubljana, 2018
BASE
Show details
6
English-Montenegrin parallel corpus of subtitles Opus-MontenegrinSubs 1.0
Božović, Petar; Erjavec, Tomaž; Tiedemann, Jörg. - : Jožef Stefan Institute, 2018
BASE
Show details
7
Training corpus SETimes.SR 1.0
Batanović, Vuk; Ljubešić, Nikola; Samardžić, Tanja; Erjavec, Tomaž. - : Regional Linguistic Data Initiative Centre ReLDI, 2018
Abstract: The SETimes.SR training corpus contains 86 726 tokens manually annotated on the levels of tokenisation, sentence segmentation, morphosyntactic tagging, lemmatisation, syntactic dependencies, and named entities. The annotations (and other aspects) of the corpus are documented in the teiHeader and back element of the TEI encoded corpus. In short, they follow (1) the MULTEXT-East V5 morphosyntactic specifications, http://nl.ijs.si/ME/V5/msd/, (2) the UDv2 Guidelines, http://universaldependencies.org/guidelines.html, and (3) the Janes annotation guidelines for named entities, http://nl.ijs.si/janes/wp-content/uploads/2017/09/SlovenianNER-eng-v1.1.pdf.
Keyword: dependency treebank; manual annotation; named entities; parsing; part-of-speech tagging; TEI; tokenisation
URL: http://hdl.handle.net/11356/1200
BASE
Hide details
8
Spoken corpus Gos VideoLectures 3.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2018
BASE
Show details
9
Automatically constructed multiword lexicon slMWELex v0.5
Ljubešić, Nikola; Krek, Simon; Dobrovoljc, Kaja. - : Jožef Stefan Institute, 2018
BASE
Show details
10
Dataset and baseline model of moderated content FRENK-MMC-RTV 1.0
Ljubešić, Nikola; Erjavec, Tomaž; Fišer, Darja. - : Jožef Stefan Institute, 2018
BASE
Show details
11
JRC EU DGT Translation Memory Parsebank DGT-UD 1.0
Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2018
BASE
Show details
12
Training corpus ssj500k 2.1
Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2018
BASE
Show details
13
Word embeddings CLARIN.SI-embed.sl 1.0
Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2018
BASE
Show details
14
Bilingual terminology extraction dataset KAS-biterm 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2018
BASE
Show details
15
Terminology identification dataset KAS-term 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2018
BASE
Show details
16
Croatian language corpus Riznica 0.1
Brozović Rončević, Dunja; Ćavar, Damir; Ćavar, Małgorzata. - : Institute of Croatian Language and Linguistics, 2018
BASE
Show details
17
Training corpus hr500k 1.0
Ljubešić, Nikola; Agić, Željko; Klubička, Filip. - : Jožef Stefan Institute, 2018
BASE
Show details
18
Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
Ljubešić, Nikola; Erjavec, Tomaž; Fišer, Darja. - : Jožef Stefan Institute, 2018
BASE
Show details
19
hr500k – A Reference Training Corpus of Croatian.
In: Conference papers (2018)
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
18
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern