6 |
Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers
|
|
Glavas, Goran; Agic, Zeljko; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.345, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
11 |
Training corpus hr500k 1.0
|
|
|
|
Abstract:
The hr500k training corpus contains about 500,000 tokens manually annotated on the levels of tokenisation, sentence segmentation, morphosyntactic tagging, lemmatisation and named entities. About half of the corpus is also manually annotated with syntactic dependencies. Furthermore, about a fifth of the corpus is annotated with semantic role labels. The annotations (and other aspects) of the hr500k corpus are documented in the teiHeader and back element of the TEI encoded corpus. In short, they follow (1) the MULTEXT-East V5 morphosyntactic specifications for Croatian, http://nl.ijs.si/ME/V5/msd/, (2) the UDv2 Guidelines, http://universaldependencies.org/guidelines.html, and (3) the Janes annotation guidelines for named entities, http://nl.ijs.si/janes/wp-content/uploads/2017/09/SlovenianNER-eng-v1.1.pdf, while (4) the semantic role labelling annotation guidelines are currently in the publication process.
|
|
Keyword:
dependency treebank; manual annotation; named entities; parsing; part-of-speech tagging; semantic role labelling; TEI; tokenisation
|
|
URL: http://hdl.handle.net/11356/1183
|
|
BASE
|
|
Hide details
|
|
12 |
hr500k – A Reference Training Corpus of Croatian.
|
|
|
|
In: Conference papers (2018)
|
|
BASE
|
|
Show details
|
|
13 |
Parsing Universal Dependencies without training
|
|
|
|
In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, ; EACL 2017 - 15th Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.inria.fr/hal-01677405 ; EACL 2017 - 15th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2017, Valencia, Spain. pp.229 - 239 ; http://eacl2017.org/ (2017)
|
|
BASE
|
|
Show details
|
|
16 |
Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages
|
|
|
|
In: Samardžić, Tanja; Starović, Mirjana; Agić, Željko; Ljubešić, Nikola (2017). Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages. In: Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, Valencia, Spain, 4 April 2017, Association for Computational Linguistic. (2017)
|
|
BASE
|
|
Show details
|
|
20 |
Multilingual Projection for Parsing Truly Low-Resource Languageš
|
|
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.inria.fr/hal-01426754 ; Transactions of the Association for Computational Linguistics, The MIT Press, 2016 (2016)
|
|
BASE
|
|
Show details
|
|
|
|