1 |
[Class elicitation session on noun subjects (incomplete)] ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
[Class elicitation session on lexicon and noun-of-noun constructions] ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Collaborative transcription in Australian Aboriginal communities
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Bootstrapping Techniques for Polysynthetic Morphological Analysis ...
|
|
|
|
Abstract:
Polysynthetic languages have exceptionally large and sparse vocabularies, thanks to the number of morpheme slots and combinations in a word. This complexity, together with a general scarcity of written data, poses a challenge to the development of natural language technologies. To address this challenge, we offer linguistically-informed approaches for bootstrapping a neural morphological analyzer, and demonstrate its application to Kunwinjku, a polysynthetic Australian language. We generate data from a finite state transducer to train an encoder-decoder model. We improve the model by "hallucinating" missing linguistic structure into the training data, and by resampling from a Zipf distribution to simulate a more natural distribution of morphemes. The best model accounts for all instances of reduplication in the test set and achieves an accuracy of 94.7% overall, a 10 percentage point improvement over the FST baseline. This process demonstrates the feasibility of bootstrapping a neural morph analyzer from ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2005.00956 https://arxiv.org/abs/2005.00956
|
|
BASE
|
|
Hide details
|
|
10 |
Enabling Interactive Transcription in an Indigenous Community ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Multidimensional Exploration of Online Linguistic Field Data
|
|
|
|
In: North East Linguistics Society (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Evaluating phonemic transcription of low-resource tonal languages for language documentation
|
|
|
|
In: LREC 2018 (Language Resources and Evaluation Conference) ; https://halshs.archives-ouvertes.fr/halshs-01709648 ; LREC 2018 (Language Resources and Evaluation Conference), May 2018, Miyazaki, Japan. pp.3356-3365 (2018)
|
|
BASE
|
|
Show details
|
|
14 |
Evaluating phonemic transcription of low-resource tonal languages for language documentation
|
|
|
|
In: LREC 2018 (Language Resources and Evaluation Conference) ; https://halshs.archives-ouvertes.fr/halshs-01709648 ; LREC 2018 (Language Resources and Evaluation Conference), May 2018, Miyazaki, Japan. pp.3356-3365 (2018)
|
|
BASE
|
|
Show details
|
|
15 |
Documenting Recipes
|
|
|
|
In: Fifth International Conference on Language Documentation and Conservation (ICLDC5) ; https://halshs.archives-ouvertes.fr/halshs-01514911 ; Fifth International Conference on Language Documentation and Conservation (ICLDC5), Mar 2017, Honolulu, United States ; http://icldc5.icldc-hawaii.org/ (2017)
|
|
BASE
|
|
Show details
|
|
16 |
Treasure Language Storytelling: Cross-cultural Language Recognition and Wellbeing
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Expressing language resource metadata as Linked Data: The case of the Open Language Archives Community
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Treasure Language Storytelling: Cross-cultural Language Recognition and Wellbeing
|
|
|
|
BASE
|
|
Show details
|
|
|
|