Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year
- Medium
- Type
- BLLDB-Access:
  - free (4)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 4 of 4

1	One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia ...
	Aji, Alham Fikri; Winata, Genta Indra; Koto, Fajri. - : arXiv, 2022
	BASE
	Show details

2	IndoNLI: A Natural Language Inference Dataset for Indonesian ...
	Mahendra, Rahmad; Aji, Alham Fikri; Louvan, Samuel; Rahman, Fahrurrozi; Vania, Clara. - : arXiv, 2021
	Abstract: We present IndoNLI, the first human-elicited NLI dataset for Indonesian. We adapt the data collection protocol for MNLI and collect nearly 18K sentence pairs annotated by crowd workers and experts. The expert-annotated data is used exclusively as a test set. It is designed to provide a challenging test-bed for Indonesian NLI by explicitly incorporating various linguistic phenomena such as numerical reasoning, structural changes, idioms, or temporal and spatial reasoning. Experiment results show that XLM-R outperforms other pre-trained models in our data. The best performance on the expert-annotated data is still far below human performance (13.4% accuracy gap), suggesting that this test set is especially challenging. Furthermore, our analysis shows that our expert-annotated data is more diverse and contains fewer annotation artifacts than the crowd-annotated data. We hope this dataset can help accelerate progress in Indonesian NLP research. ... : Accepted at EMNLP 2021 main conference ...
	Keyword: Computation and Language cs.CL; FOS Computer and information sciences
	URL: https://dx.doi.org/10.48550/arxiv.2110.14566 https://arxiv.org/abs/2110.14566
	BASE
	Hide details

3	IndoNLI: A Natural Language Inference Dataset for Indonesian ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Aji, Alham Fikri; Louvan, Samuel. - : Underline Science Inc., 2021
	BASE
	Show details

4	Multi-Task Active Learning for Neural Semantic Role Labeling on Low Resource Conversational Corpus ...
	Ikhwantri, Fariz; Louvan, Samuel; Kurniawan, Kemal. - : arXiv, 2018
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern