DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization ...
Abstract: Multilingual language models exhibit better performance for some languages than for others, and many languages do not seem to benefit from multilingual sharing at all, presumably as a result of poor multilingual segmentation. This work explores the idea of learning multilingual language models based on clustering of monolingual segments. We show significant improvements over standard multilingual segmentation across nine languages on a question answering task, both in a small model regime and for a model of the size of BERT-base. ...
Keyword: Computational Linguistics; Language Models; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
URL: https://underline.io/lecture/39629-clustering-monolingual-vocabularies-to-improve-cross-lingual-generalization
https://dx.doi.org/10.48448/9x7f-w807
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern