1 |
{PROST}: {P}hysical Reasoning about Objects through Space and Time ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data ...
|
|
|
|
Abstract:
Read paper: https://www.aclanthology.org/2021.acl-short.139 Abstract: High-performing machine translation (MT) systems can help overcome language barriers while making it possible for everyone to communicate and use language technologies in the language of their choice. However, such systems require large amounts of parallel sentences for training, and translators can be difficult to find and expensive. Here, we present a data collection strategy for MT which, in contrast, is cheap and simple, as it does not require bilingual speakers. Based on the insight that humans pay specific attention to movements, we use graphics interchange formats (GIFs) as a pivot to collect parallel sentences from monolingual annotators. We use our strategy to collect data in Hindi, Tamil and English. As a baseline, we also collect data using images as a pivot. We perform an intrinsic evaluation by manually evaluating a subset of the sentence pairs and an extrinsic evaluation by finetuning mBART (Liu et al., 2020) on the collected ...
|
|
Keyword:
Computational Linguistics; Condensed Matter Physics; Deep Learning; Electromagnetism; FOS Physical sciences; Information and Knowledge Engineering; Neural Network; Semantics
|
|
URL: https://dx.doi.org/10.48448/4fa8-4846 https://underline.io/lecture/25805-don't-rule-out-monolingual-speakers-a-method-for-crowdsourcing-machine-translation-data
|
|
BASE
|
|
Hide details
|
|
4 |
How to Adapt Your Pretrained Multilingual Model to 1600 Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Acquisition of Inflectional Morphology in Artificial Neural Networks With Prior Knowledge
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
|
|