DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
Delving Deeper into Cross-lingual Visual Question Answering ...
BASE
Show details
2
Semantic Data Set Construction from Human Clustering and Spatial Arrangement ...
Majewska, Olga; McCarthy, Diana; Van Den Bosch, Jasper JF. - : Apollo - University of Cambridge Repository, 2021
BASE
Show details
3
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02975786 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2020, 46 (4), pp.847-897 ; https://direct.mit.edu/coli/article/46/4/847/97326/Multi-SimLex-A-Large-Scale-Evaluation-of (2020)
BASE
Show details
4
Multidirectional Associative Optimization of Function-Specific Word Representations ...
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
5
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis ...
Majewska, Olga; Vulic, Ivan; McCarthy, Diana. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
6
Emergent Communication Pretraining for Few-Shot Machine Translation ...
Li, Yaoyiran; Ponti, Edoardo; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
7
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
8
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction ...
Karan, Mladen; Vulic, Ivan; Korhonen, Anna. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
9
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces ...
Vulic, Ivan; Korhonen, Anna; Glavas, Goran. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
10
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces
Vulic, Ivan; Korhonen, Anna; Glavas, Goran. - : 5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), 2020
BASE
Show details
11
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity
Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.118, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
BASE
Show details
12
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis
Majewska, Olga; Vulic, Ivan; McCarthy, Diana. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.423, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
BASE
Show details
13
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction
Karan, Mladen; Vulic, Ivan; Korhonen, Anna. - : Association for Computational Linguistics, 2020
BASE
Show details
14
Emergent Communication Pretraining for Few-Shot Machine Translation
Vulic, Ivan; Ponti, Edoardo; Korhonen, Anna; Li, Yaoyiran. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.416, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
Abstract: While state-of-the-art models that rely upon massively multilingual pretrained encoders achieve sample efficiency in downstream applications, they still require abundant amounts of unlabelled text. Nevertheless, most of the world’s languages lack such resources. Hence, we investigate a more radical form of unsupervised knowledge transfer in the absence of linguistic data. In particular, for the first time we pretrain neural networks via emergent communication from referential games. Our key assumption is that grounding communication on images—as a crude approximation of real-world environments—inductively biases the model towards learning natural languages. On the one hand, we show that this substantially benefits machine translation in few-shot settings. On the other hand, this also provides an extrinsic evaluation protocol to probe the properties of emergent languages ex vitro. Intuitively, the closer they are to natural languages, the higher the gains from pretraining on them should be. For instance, in this work we measure the influence of communication success and maximum sequence length on downstream performances. Finally, we introduce a customised adapter layer and annealing strategies for the regulariser of maximum-a-posteriori inference during fine-tuning. These turn out to be crucial to facilitate knowledge transfer and prevent catastrophic forgetting. Compared to a recurrent baseline, our method yields gains of 59.0%∼147.6% in BLEU score with only 500 NMT training instances and 65.1%∼196.7% with 1, 000 NMT training instances across four language pairs. These proof-of-concept results reveal the potential of emergent communication pretraining for both natural language processing tasks in resource-poor settings and extrinsic evaluation of artificial languages.
URL: https://doi.org/10.17863/CAM.62217
https://www.repository.cam.ac.uk/handle/1810/315110
BASE
Hide details
15
Multidirectional Associative Optimization of Function-Specific Word Representations
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Association for Computational Linguistics, 2020. : 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020
BASE
Show details
16
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
In: https://hal.archives-ouvertes.fr/hal-01856176 ; 2018 (2018)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
16
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern