2 |
A morph-based and a word-based treebank for Beja
|
|
|
|
In: SyntaxFest ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories ; https://hal.archives-ouvertes.fr/hal-03494462 ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories, Mar 2022, Sofia, Bulgaria (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Generación de flexión morfológica con UniMorph.: Evaluación con base de datos relacional y pautas de entrenamiento
|
|
|
|
In: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 68, 2022, pags. 61-70 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
A morph-based and a word-based treebank for Beja
|
|
|
|
In: SyntaxFest ; https://hal.archives-ouvertes.fr/hal-03494462 ; SyntaxFest, In press (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Old Catalan Morphosyntax: developing an annotated corpus
|
|
|
|
In: EISSN: 2059-481X ; Journal of Open Humanities Data ; https://hal.archives-ouvertes.fr/hal-03617737 ; Journal of Open Humanities Data, Ubiquity Press, 2021, 7, pp.30. ⟨10.5334/johd.54⟩ (2021)
|
|
Abstract:
International audience ; This paper presents a full procedure for the development of a Part-of-Speech (POS) tagged corpus of Old Catalan. As an extremely low-resource language with rich inflection and frequent homographs, Old Catalan poses non-trivial problems in the development of a searchable constituency-based treebank. We demonstrate, however, that a carefully designed, semi-supervised method of incrementally building training data using both neural and memory-based taggers together with the Pyrrha annotation tool is highly efficient and yields accurate results. We propose that this simple and effective method could easily be extended to other low-resource historical languages for which no NLP tools exist yet.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; Historical treebank; Old Catalan; POS tagging
|
|
URL: https://hal.archives-ouvertes.fr/hal-03617737 https://doi.org/10.5334/johd.54
|
|
BASE
|
|
Hide details
|
|
|
|