DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
MULTEXT-East free lexicons 4.0
Erjavec, Tomaž; Bruda, Ştefan; Derzhanski, Ivan. - : Jožef Stefan Institute, 2015
BASE
Show details
2
MULTEXT-East "1984" annotated corpus 4.0
Abstract: The novel "1984" by George Orwell is the central component of the MULTEXT-East corpus. This parallel and sentence aligned corpus contains the novel in the English original (about 100,000 words in length), and its translations into a number of languages. This version of the corpus contains the linguistically annotated texts, with each word tagged by its lemma and its MULTEXT(-East) morphosyntactic description (MSD, i.e., a fine-grained feature-structure based PoS tag). The structurally annotated texts are a separate submission (http://hdl.handle.net/11356/1044), also with somewhat different languages.
Keyword: manual annotation; multilingual; parallel corpus; part-of-speech tagging; Slavic languages; TEI
URL: http://hdl.handle.net/11356/1043
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern