1 |
Morphological Processing of Low-Resource Languages: Where We Are and What's Next ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Translating the Unseen? Yoruba-English MT in Low-Resource, Morphologically-Unmarked Settings ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Do RNN States Encode Abstract Phonological Alternations? ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Noise Isn't Always Negative: Countering Exposure Bias in Sequence-to-Sequence Inflection Models ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
UniMorph 3.0: Universal Morphology
|
|
Kirov, Christo; Nicolai, Garrett; Arkhangelskij, Timofey; Ernštreits, Valts; Nidhi, Amrit; Krizhanovsky, Natalya; Krizhanovsky, Andrew; Cotterell, Ryan; Mansfield, John; Vylomova, Ekaterina; Grella, Matteo; Pinter, Yuval; Xia, Patrick; McCarthy, Arya D.; Klyachko, Elena; Gorman, Kyle; Mielke, Sabrina J.; Hulden, Mans; Yarowsky, David; Jacobs, Cassandra L.; Silfverberg, Miikka; Sorokin, Alexey
|
|
In: Proceedings of the 12th Language Resources and Evaluation Conference (2020)
|
|
Abstract:
The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological paradigms for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. We have implemented several improvements to the extraction pipeline which creates most of our data, so that it is both more complete and more correct. We have added 66 new languages, as well as new parts of speech for 12 languages. We have also amended the schema in several ways. Finally, we present three new community tools: two to validate data for resource creators, and one to make morphological data available from the command line. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland. This paper details advances made to the schema, tooling, and dissemination of project resources since the UniMorph 2.0 release described at LREC 2018.
|
|
Keyword:
lexical database; morphology; multilinguality
|
|
URL: https://hdl.handle.net/20.500.11850/462327 https://doi.org/10.3929/ethz-b-000462327
|
|
BASE
|
|
Hide details
|
|
13 |
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Marrying Universal Dependencies and Universal Morphology ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Sound Analogies with Phoneme Embeddings
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2018)
|
|
BASE
|
|
Show details
|
|
|
|