Page: 1 2 3 4 5 6 7 8 9... 29
81 |
Literal readings of multiword expressions: as scarce as hen's teeth
|
|
|
|
In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16), Jan 2018, Prague, Czech Republic ; https://hal.archives-ouvertes.fr/hal-01694995 ; Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16), Jan 2018, Prague, Czech Republic, Jan 2018, Prague, Czech Republic. pp.64 - 72 (2018)
|
|
BASE
|
|
Show details
|
|
82 |
Establishing a Language by Annotating a Corpus ; Establishing a Language by Annotating a Corpus: The Case of Naija, a Post-creole Spoken in Nigeria
|
|
|
|
In: annDH 2018 Annotation in Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-01958330 ; annDH 2018 Annotation in Digital Humanities, Aug 2018, Sofia, Bulgaria. pp.7-11 ; http://ceur-ws.org/Vol-2155/ (2018)
|
|
BASE
|
|
Show details
|
|
83 |
An Experiment on Plato’s Gorgias as an Introduction to Textometry
|
|
|
|
In: Classics@ ; https://halshs.archives-ouvertes.fr/halshs-01730373 ; Classics@, Center for Hellenic Studies/Harvard University, In press, Digital text analysis in Classics ; https://chs.harvard.edu/CHS/article/display/1167.classics-introduction-to-journal (2018)
|
|
BASE
|
|
Show details
|
|
90 |
CoNLL 2018 Shared Task - UDPipe Baseline Models and Supplementary Materials
|
|
Straka, Milan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
|
|
BASE
|
|
Show details
|
|
93 |
Training corpus hr500k 1.0
|
|
|
|
Abstract:
The hr500k training corpus contains about 500,000 tokens manually annotated on the levels of tokenisation, sentence segmentation, morphosyntactic tagging, lemmatisation and named entities. About half of the corpus is also manually annotated with syntactic dependencies. Furthermore, about a fifth of the corpus is annotated with semantic role labels. The annotations (and other aspects) of the hr500k corpus are documented in the teiHeader and back element of the TEI encoded corpus. In short, they follow (1) the MULTEXT-East V5 morphosyntactic specifications for Croatian, http://nl.ijs.si/ME/V5/msd/, (2) the UDv2 Guidelines, http://universaldependencies.org/guidelines.html, and (3) the Janes annotation guidelines for named entities, http://nl.ijs.si/janes/wp-content/uploads/2017/09/SlovenianNER-eng-v1.1.pdf, while (4) the semantic role labelling annotation guidelines are currently in the publication process.
|
|
Keyword:
dependency treebank; manual annotation; named entities; parsing; part-of-speech tagging; semantic role labelling; TEI; tokenisation
|
|
URL: http://hdl.handle.net/11356/1183
|
|
BASE
|
|
Hide details
|
|
94 |
Introduction. The added value of diachronic treebanks for historical linguistics
|
|
|
|
BASE
|
|
Show details
|
|
95 |
Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies
|
|
|
|
BASE
|
|
Show details
|
|
96 |
Discovering syntactic phenomena with and within precision grammars
|
|
|
|
BASE
|
|
Show details
|
|
97 |
Lemmatising Treebanks. Corpus Annotation with Knowledge Bases
|
|
|
|
In: RAEL: revista electrónica de lingüística aplicada, ISSN 1885-9089, null 17, Nº. 1, 2018, pags. 99-120 (2018)
|
|
BASE
|
|
Show details
|
|
100 |
Universal Dependencies 2.1
|
|
|
|
In: https://hal.inria.fr/hal-01682188 ; 2017 (2017)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 29
|
|