DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4
Hits 41 – 60 of 71

41
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction ...
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
42
Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation ...
Vulic, Ivan; Korhonen, Anna-Leena; Linguist, Assoc Computat. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
43
Investigating the cross-lingual translatability of VerbNet-style classification. ...
Majewska, Olga; Vulić, Ivan; McCarthy, Diana. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
44
Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources ...
Vulic, Ivan; Glavaš, Goran; Mrkšić, Nikola. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
45
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo; Naradowsky, Jason; Reichart, Roi; Korhonen, Anna-Leena. - : MIT Press - Journals, 2018. : Transactions of the Association for Computational Linguistics, 2018
Abstract: Neural architectures are prominent in the construction of language models (LMs). However, word-level prediction is typically agnostic of subword-level information (characters and character sequences) and operates over a closed vocabulary, consisting of a limited word set. Indeed, while subword-aware models boost performance across a variety of NLP tasks, previous work did not evaluate the ability of these models to assist next-word prediction in language modeling tasks. Such subword-level informed models should be particularly effective for morphologically-rich languages (MRLs) that exhibit high type-to-token ratios. In this work, we present a large-scale LM study on 50 typologically diverse languages covering a wide variety of morphological systems, and offer new LM benchmarks to the community, while considering subword-level information. The main technical contribution of our work is a novel method for injecting subword-level information into semantic word vectors, integrated into the neural language modeling training, to facilitate word-level prediction. We conduct experiments in the LM setting where the number of infrequent words is large, and demonstrate strong perplexity gains across our 50 languages, especially for morphologically-rich languages. Our code and data sets are publicly available. ; This work is supported by the ERC Consolidator Grant LEXICAL (648909)
URL: https://www.repository.cam.ac.uk/handle/1810/279936
https://doi.org/10.17863/CAM.27304
BASE
Hide details
46
Investigating the cross-lingual translatability of VerbNet-style classification.
Majewska, Olga; Vulić, Ivan; McCarthy, Diana. - : Springer Science and Business Media LLC, 2018. : Lang Resour Eval, 2018
BASE
Show details
47
Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation
Vulic, Ivan; Korhonen, Anna-Leena; Linguist, Assoc Computat. - : REPRESENTATION LEARNING FOR NLP, 2018
BASE
Show details
48
Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
Chiu, Billy; Pyysalo, Sampo; Vulić, Ivan. - : BioMed Central, 2018. : BMC bioinformatics, 2018
BASE
Show details
49
Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
Chiu, Billy; Pyysalo, Sampo; Vulić, Ivan. - : Springer Science and Business Media LLC, 2018. : BMC Bioinformatics, 2018
BASE
Show details
50
Dependency parsing of learner English
Huang, Yan; Murakami, Akira; Alexopoulou, Dora. - : John Benjamins Publishing Company, 2018. : International Journal of Corpus Linguistics, 2018
BASE
Show details
51
Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP
Vulic, Ivan; Ponti, Edoardo; Reichart, Roi. - : Association for Computational Linguistics, 2018. : Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), 2018
BASE
Show details
52
Bio-SimVerb
BASE
Show details
53
Acquiring verb classes through bottom-up semantic verb clustering
Majewska, Olga; McCarthy, D; Vulić, I. - : LREC 2018 - 11th International Conference on Language Resources and Evaluation, 2018
BASE
Show details
54
Automatic Selection of Context Configurations for Improved Class-Specific Word Representations ...
Vulić, Ivan; Schwartz, Roy; Rappoport, Ari. - : Apollo - University of Cambridge Repository, 2017
BASE
Show details
55
Decoding sentiment from distributed representations of sentences ...
Ponti, Edoardo; Vulić, I; Korhonen, Anna-Leena. - : Apollo - University of Cambridge Repository, 2017
BASE
Show details
56
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment ...
Vulić, I; Gerz, D; Kiela, D. - : Apollo - University of Cambridge Repository, 2017
BASE
Show details
57
Morph-fitting: Fine-tuning word vector spaces with simple language-specific rules ...
Vulic, Ivan; Mrkšic, N; Reichart, R. - : Apollo - University of Cambridge Repository, 2017
BASE
Show details
58
Decoding sentiment from distributed representations of sentences
Ponti, Edoardo; Vulić, I; Korhonen, Anna-Leena. - : Association for Computational Linguistics, 2017. : *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings, 2017
BASE
Show details
59
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment
Vulić, I; Gerz, D; Kiela, D. - : MIT Press, 2017. : Computational Linguistics, 2017
BASE
Show details
60
Event-Related Features in Feedforward Neural Networks Contribute to Identifying Causal Relations in Discourse
Ponti, Edoardo; Korhonen, Anna-Leena. - : LSDSem 2017 - 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-Level Semantics, Proceedings of the Workshop, 2017
BASE
Show details

Page: 1 2 3 4

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
71
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern