DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Universal Segmentations 1.0 (UniSegments 1.0)
Abstract: Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
Keyword: Armenian language; Bengali language; Catalan language; Croatian language; Czech language; English language; Erzya language; Finnish language; French language; German language; Hindi language; Hungarian language; Italian language; Kannada language; Komi-Zyrian language; Latin language; Malayalam language; Marathi language; Mari (Russia) language; Moksha language; Mongolian language; morph; morphemes; morphological dictionary; morphological segmentation; morphology; multilingual; Persian language; Polish language; Portuguese language; Russian language; segmentation; Serbo-Croatian language; Spanish language; Swedish language; Tajik language; Udmurt language; unisegments; universal segmentations; word segmentation
URL: http://hdl.handle.net/11234/1-4629
BASE
Hide details
2
DeriNet 2.1
Vidra, Jonáš; Žabokrtský, Zdeněk; Kyjánek, Lukáš. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern