Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher
- Year
- Medium:
  - Online (9)
- Type:
  - Article (9)
- BLLDB-Access:
  - free (9)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Barikeri, Soumya; Glavaš, Goran. - : Underline Science Inc., 2021
	BASE
	Show details

2	How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; ., Iryna; ., Sebastian. - : Underline Science Inc., 2021
	BASE
	Show details

3	Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; ., Nigel; Korhonen, Anna. - : Underline Science Inc., 2021
	BASE
	Show details

4	LexFit: Lexical Fine-Tuning of Pretrained Language Models ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Glavaš, Goran; Korhonen, Anna. - : Underline Science Inc., 2021
	BASE
	Show details

5	A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; ., Hinrich; Korhonen, Anna. - : Underline Science Inc., 2021
	BASE
	Show details

6	Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
	Vulic, Ivan; Baker, Simon; Ponti, Edoardo Maria...
	In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02975786 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2020, 46 (4), pp.847-897 ; https://direct.mit.edu/coli/article/46/4/847/97326/Multi-SimLex-A-Large-Scale-Evaluation-of (2020)
	BASE
	Show details

7	A deep learning approach to bilingual lexicon induction in the biomedical domain. ...
	Heyman, Geert; Vulić, Ivan; Moens, Marie-Francine. - : Apollo - University of Cambridge Repository, 2018
	BASE
	Show details

8	A deep learning approach to bilingual lexicon induction in the biomedical domain.
	Heyman, Geert; Vulić, Ivan; Moens, Marie-Francine. - : Springer Science and Business Media LLC, 2018. : BMC Bioinformatics, 2018
	BASE
	Show details

9	Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
	Chiu, Billy; Pyysalo, Sampo; Vulić, Ivan; Korhonen, Anna-Leena. - : BioMed Central, 2018. : BMC bioinformatics, 2018
	Abstract: Background: Word representations support a variety of Natural Language Processing (NLP) tasks. The quality of these representations is typically assessed by comparing the distances in the induced vector spaces against human similarity judgements. Whereas comprehensive evaluation resources have recently been developed for the general domain, similar resources for biomedicine currently suﬀer from the lack of coverage, both in terms of word types included and with respect to the semantic distinctions. Notably, verbs have been excluded, although they are essential for the interpretation of biomedical language. Further, current resources do not discern between semantic similarity and semantic relatedness, although this has been proven as an important predictor of the usefulness of word representations and their performance in downstream applications. Results: We present two novel comprehensive resources targeting the evaluation of word representations in biomedicine. These resources, Bio-SimVerb and Bio-SimLex, address the previously mentioned problems, and can be used for evaluations of verb and noun representations respectively. In our experiments, we have computed the Pearson’s correlation between performances on intrinsic and extrinsic tasks using twelve popular state-of-the-art representation models (e.g. word2vec models). The intrinsic–extrinsic correlations using our datasets are notably higher than with previous intrinsic evaluation benchmarks such as UMNSRS and MayoSRS. In addition, when evaluating representation models for their abilities to capture verb and noun semantics individually, we show a considerable variation between performances across all models. Conclusion: Bio-SimVerb and Bio-SimLex enable intrinsic evaluation of word representations. This evaluation can serve as a predictor of performance on various downstream tasks in the biomedical domain. The results on Bio-SimVerb and Bio-SimLex using standard word representation models highlight the importance of developing dedicated evaluation resources for NLP in biomedicine for particular word classes (e.g. verbs). These are needed to identify the most accurate methods for learning class-speciﬁc representations. Bio-SimVerb and Bio-SimLex are publicly available.
	Keyword: Biomedical Technology; Databases as Topic; Humans; Language; Natural Language Processing; Semantics; Software
	URL: https://doi.org/10.17863/CAM.18170 https://www.repository.cam.ac.uk/handle/1810/276650
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern