1 |
Universal Segmentations 1.0 (UniSegments 1.0)
|
|
Žabokrtský, Zdeněk; Bafna, Nyati; Bodnár, Jan; Kyjánek, Lukáš; Svoboda, Emil; Ševčíková, Magda; Vidra, Jonáš; Angle, Sachi; Ansari, Ebrahim; Arkhangelskiy, Timofey; Batsuren, Khuyagbaatar; Bella, Gábor; Bertinetto, Pier Marco; Bonami, Olivier; Celata, Chiara; Daniel, Michael; Fedorenko, Alexei; Filko, Matea; Giunchiglia, Fausto; Haghdoost, Hamid; Hathout, Nabil; Khomchenkova, Irina; Khurshudyan, Victoria; Levonian, Dmitri; Litta, Eleonora; Medvedeva, Maria; Muralikrishna, S. N.; Namer, Fiammetta; Nikravesh, Mahshid; Padó, Sebastian; Passarotti, Marco; Plungian, Vladimir; Polyakov, Alexey; Potapov, Mihail; Pruthwik, Mishra; Rao B, Ashwath; Rubakov, Sergei; Samar, Husain; Sharma, Dipti Misra; Šnajder, Jan; Šojat, Krešimir; Štefanec, Vanja; Talamo, Luigi; Tribout, Delphine; Vodolazsky, Daniil; Vydrin, Arseniy; Zakirova, Aigul; Zeller, Britta. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2022
|
|
Abstract:
Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
|
|
Keyword:
Armenian language; Bengali language; Catalan language; Croatian language; Czech language; English language; Erzya language; Finnish language; French language; German language; Hindi language; Hungarian language; Italian language; Kannada language; Komi-Zyrian language; Latin language; Malayalam language; Marathi language; Mari (Russia) language; Moksha language; Mongolian language; morph; morphemes; morphological dictionary; morphological segmentation; morphology; multilingual; Persian language; Polish language; Portuguese language; Russian language; segmentation; Serbo-Croatian language; Spanish language; Swedish language; Tajik language; Udmurt language; unisegments; universal segmentations; word segmentation
|
|
URL: http://hdl.handle.net/11234/1-4629
|
|
BASE
|
|
Hide details
|
|
4 |
Members of the Polish Language Council on the Problems of Linguistic Diversity and Linguistic Inclusion in Poland
|
|
|
|
In: Social Inclusion ; 9 ; 1 ; 63-74 ; Social Inclusion and Multilingualism: The Impact of Linguistic Justice, Economy of Language and Language Policy (2022)
|
|
BASE
|
|
Show details
|
|
5 |
W Sejmie : Ślōnskiego języka nie ma, ale może być etnolekt ; In the Polish Parliamentthe Silesian language does not exist, but the Silesian ethnolect may
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Niemieckie zaniechania ; The German minority leadership's resignations from securing this monority's cultural and linguistic rights in postcommunist Poland
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Der Altersfaktor beim fortgeschrittenen Zweitspracherwerb : Die Wortstellung im Deutschen bei polnisch-deutsch bilingualen Kindern
|
|
|
|
BLLDB
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
12 |
Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Multilingual comparable corpora of parliamentary debates ParlaMint 2.0
|
|
|
|
BASE
|
|
Show details
|
|
16 |
WALS Online Resources for Polish
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
17 |
‘Our cat has the power’: the polysemy of a third language in maintaining the power/solidarity equilibrium in family interactions
|
|
|
|
BASE
|
|
Show details
|
|
18 |
The Dialect of the Siberian Hollanders: Materials from Field Research in 2015 ; Говор сибирских голендров по материалам экспедиции 2015 г.
|
|
|
|
In: Slověne = Словѣне. International Journal of Slavic Studies; Vol 10, No 1 (2021); 425-449 ; 2305-6754 ; 2304-0785 (2021)
|
|
BASE
|
|
Show details
|
|
19 |
Glottolog 4.4 Resources for Polish Sign Language
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
20 |
Glottolog 4.4 Resources for Polish
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
|
|