85 |
Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
|
|
|
|
BASE
|
|
Show details
|
|
86 |
hr500k – A Reference Training Corpus of Croatian.
|
|
|
|
In: Conference papers (2018)
|
|
Abstract:
In this paper we present hr500k, a Croatian reference training corpus of 500 thousand tokens, segmented at document, sentence and word level, and annotated for morphosyntax, lemmas, dependency syntax, named entities, and semantic roles. We present each annotation layer via basic label statistics and describe the final encoding of the resource in CoNLL and TEI formats. We also give a description of the rather turbulent history of the resource and give insights into the topic and genre distribution in the corpus. Finally, we discuss further enrichments of the corpus with additional layers, which are already underway.
|
|
Keyword:
annotation; computational linguistics; Croatian; Digital Humanities; linguistic resource; machine learning; reference corpus; Slavic Languages and Societies
|
|
URL: https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1254&context=scschcomcon https://arrow.tudublin.ie/scschcomcon/244
|
|
BASE
|
|
Hide details
|
|
88 |
Integrating corpora of computer-mediated communication into the language resources landscape: Initiatives and best practices from French, German, Italian and Slovenian projects
|
|
|
|
DNB Subject Category Language
|
|
Show details
|
|
90 |
TEI-Lex0 guidelines for the encoding of dictionary information on written and spoken forms
|
|
|
|
In: Electronic Lexicography in the 21st Century: Proceedings of ELex 2017 Conference ; https://hal.inria.fr/hal-01757108 ; Electronic Lexicography in the 21st Century: Proceedings of ELex 2017 Conference, Sep 2017, Leiden, Netherlands (2017)
|
|
BASE
|
|
Show details
|
|
91 |
Universal Dependencies 2.1
|
|
|
|
In: https://hal.inria.fr/hal-01682188 ; 2017 (2017)
|
|
BASE
|
|
Show details
|
|
92 |
Closing a gap in the language resources landscape : Groundwork and best practices from projects on computer-mediated communication in four European countries.
|
|
|
|
In: CLARIN Annual Conference 2016 ; https://hal.archives-ouvertes.fr/hal-01379621 ; CLARIN Annual Conference 2016, Oct 2016, Aix-en-Provence, France. 136, Linköping Electronic Conference Proceedings, pp.1-19, 2017, Selected papers from the CLARIN Annual Conference 2016, 978-91-7685-499-0 ; http://www.ep.liu.se/ecp/contents.asp?issue=136 (2017)
|
|
BASE
|
|
Show details
|
|
95 |
Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data
|
|
|
|
BASE
|
|
Show details
|
|
|
|