DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples ...
BASE
Show details
2
Emergent Communication Pretraining for Few-Shot Machine Translation ...
Li, Yaoyiran; Ponti, Edoardo; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
3
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
Ponti, Edoardo; Glavaš, Goran; Majewska, Olga. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
4
Emergent Communication Pretraining for Few-Shot Machine Translation ...
BASE
Show details
5
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
BASE
Show details
6
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
7
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity ...
Vulic, Ivan; Baker, Simon; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
8
Probing Pretrained Language Models for Lexical Semantics ...
Vulic, Ivan; Ponti, Edoardo; Litschko, Robert. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
9
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity
Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.118, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
BASE
Show details
10
Probing Pretrained Language Models for Lexical Semantics
Vulic, Ivan; Ponti, Edoardo; Litschko, Robert. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
BASE
Show details
11
Emergent Communication Pretraining for Few-Shot Machine Translation
Vulic, Ivan; Ponti, Edoardo; Korhonen, Anna; Li, Yaoyiran. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.416, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
Abstract: While state-of-the-art models that rely upon massively multilingual pretrained encoders achieve sample efficiency in downstream applications, they still require abundant amounts of unlabelled text. Nevertheless, most of the world’s languages lack such resources. Hence, we investigate a more radical form of unsupervised knowledge transfer in the absence of linguistic data. In particular, for the first time we pretrain neural networks via emergent communication from referential games. Our key assumption is that grounding communication on images—as a crude approximation of real-world environments—inductively biases the model towards learning natural languages. On the one hand, we show that this substantially benefits machine translation in few-shot settings. On the other hand, this also provides an extrinsic evaluation protocol to probe the properties of emergent languages ex vitro. Intuitively, the closer they are to natural languages, the higher the gains from pretraining on them should be. For instance, in this work we measure the influence of communication success and maximum sequence length on downstream performances. Finally, we introduce a customised adapter layer and annealing strategies for the regulariser of maximum-a-posteriori inference during fine-tuning. These turn out to be crucial to facilitate knowledge transfer and prevent catastrophic forgetting. Compared to a recurrent baseline, our method yields gains of 59.0%∼147.6% in BLEU score with only 500 NMT training instances and 65.1%∼196.7% with 1, 000 NMT training instances across four language pairs. These proof-of-concept results reveal the potential of emergent communication pretraining for both natural language processing tasks in resource-poor settings and extrinsic evaluation of artificial languages.
URL: https://doi.org/10.17863/CAM.62217
https://www.repository.cam.ac.uk/handle/1810/315110
BASE
Hide details
12
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Liu, Qianchu; Korhonen, Anna-Leena; Majewska, Olga. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
BASE
Show details
13
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing ...
Ponti, Edoardo; O'Horan, Helen; Berzak, Yevgeni. - : Apollo - University of Cambridge Repository, 2019
BASE
Show details
14
Specialising Distributional Vectors of All Words for Lexical Entailment ...
Kamath, Aishwarya; Pfeiffer, Jonas; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2019
BASE
Show details
15
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
Reichart, Roi; Shutova, Ekaterina; Korhonen, Anna-Leena. - : MIT Press - Journals, 2019. : COMPUTATIONAL LINGUISTICS, 2019
BASE
Show details
16
Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP ...
Ponti, Edoardo; Reichart, Roi; Korhonen, Anna-Leena. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
17
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction ...
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
18
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : MIT Press - Journals, 2018. : Transactions of the Association for Computational Linguistics, 2018
BASE
Show details
19
Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP
Vulic, Ivan; Ponti, Edoardo; Reichart, Roi. - : Association for Computational Linguistics, 2018. : Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), 2018
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
19
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern