DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7...10
Hits 41 – 60 of 191

41
SimLex-999 Slovenian translation SimLex-999-sl 1.0
Pollak, Senja; Vulić, Ivan; Pelicon, Andraž. - : University of Ljubljana, 2021
BASE
Show details
42
The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs
In: Behav Res Methods (2021)
BASE
Show details
43
The KAS corpus of Slovenian academic writing [<Journal>]
Erjavec, Tomaž [Verfasser]; Fišer, Darja [Verfasser]; Ljubešić, Nikola [Verfasser]
DNB Subject Category Language
Show details
44
Universal Dependencies 2.7
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2020
BASE
Show details
45
Universal Dependencies 2.6
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2020
BASE
Show details
46
The CLASSLA-StanfordNLP model for lemmatisation of standard Macedonian 1.0
Ljubešić, Nikola; Zdravkova, Katerina; Erjavec, Tomaž. - : Jožef Stefan Institute, 2020
BASE
Show details
47
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.0
Ljubešić, Nikola; Zdravkova, Katerina; Stojanoska, Sanja. - : Jožef Stefan Institute, 2020
BASE
Show details
48
Semantic hypergraph corpus SemCRO 1.0
Vasić, Daniel; Žitko, Branko; Gašpar, Angelina. - : University of Mostar, 2020. : University of Split, 2020. : Jožef Stefan Institute, 2020
BASE
Show details
49
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian 1.1
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
50
The CLASSLA-StanfordNLP model for JOS dependency parsing of standard Slovenian 1.0
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
51
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.1
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
52
Multilingual comparable corpora of parliamentary debates ParlaMint 1.0
Abstract: ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starting at the end of 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the corpora are marked as belonging to the COVID-19 period (after October 2019), or being "reference" (before that date). The corpora have extensive meta-data about the speakers (name, gender, party affiliation, MP status), are structured into time-stamped terms, sessions and meetings, with each speech being marked by its speaker and their role (chair, regular speaker). The speeches also contain marked-up transcriber comments, such as gaps in the transcription, interruptions, applause, etc. The corpora are encoded according to the Parla-CLARIN TEI recommendation, but have been validated to the compatible but much stricter ParlaMint schemas. The schemas are included in the distribution, along with scripts to convert the corpora into other formats. The ZIP files with the TEI encoded corpora also include the automatically derived plain text version of the corpus, along with metadata on the speeches. In addition to the ParlaMint TEI encoded corpora, their linguistically encoded variants (".ana") are also available. The annotation includes named entities, lemmatisation, part-of-speech tagging, and morphological features and syntactic parses according to the Universal Dependencies recommendations. State-of-the-art tools have been used to perform the annotations. The .ana.zip corpora include the ParlaMint encoded XML, as well as derived formats, in particular, CoNLL-U and vertical files.
Keyword: Bulgarian Parliament; COVID-19; Croatian Parliament; Parla-CLARIN; parliamentary debates; Polish Parliament; Slovenian Parliament; TEI
URL: http://hdl.handle.net/11356/1345
BASE
Hide details
53
Word embeddings CLARIN.SI-embed.mk 0.1
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
54
The CLASSLA-StanfordNLP model for named entity recognition of standard Slovenian 1.0
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
55
The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
56
The CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.1
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
57
The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
58
The CLASSLA-StanfordNLP model for lemmatisation of standard Serbian 1.2
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
59
The CLASSLA-StanfordNLP model for lemmatisation of standard Serbian 1.1
Ljubešić, Nikola. - : Jožef Stefan Institute, 2020
BASE
Show details
60
The CLASSLA-StanfordNLP model for lemmatisation of non-standard Serbian 1.1
Ljubešić, Nikola; Štefanec, Vanja. - : Jožef Stefan Institute, 2020
BASE
Show details

Page: 1 2 3 4 5 6 7...10

Catalogues
0
0
0
0
5
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
1
Open access documents
183
0
2
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern