43 |
Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data
|
|
|
|
BASE
|
|
Show details
|
|
47 |
Deltacorpus 1.1
|
|
|
|
Abstract:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
|
|
Keyword:
cross-language; part of speech; semi-supervised; tagging
|
|
URL: http://hdl.handle.net/11234/1-1743
|
|
BASE
|
|
Hide details
|
|
52 |
Gibbs Sampling Segmentation of Parallel Dependency Trees for Tree-Based Machine Translation
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 105, Iss 1, Pp 101-110 (2016) (2016)
|
|
BASE
|
|
Show details
|
|
|
|