DE eng

Search in the Catalogues and Directories

Hits 1 – 5 of 5

1
Verblexpor: A lexical resource with semantic roles for Portuguese
Villavicencio, A.; Finatto, M.J.B.; Zilio, L.. - : European Language Resources Association (ELRA), 2016
BASE
Show details
2
Multiword expressions in child language
Wilkens, R.; Idiart, M.; Villavicencio, A.. - : European Language Resources Association (ELRA), 2016
BASE
Show details
3
mwetoolkit+sem: Integrating word embeddings in the mwetoolkit for semantic MWE processing
Cordeiro, S.; Ramisch, C.; Villavicencio, A.. - : European Language Resources Association (ELRA), 2016
BASE
Show details
4
B2SG: a TOEFL-like task for Portuguese
Wilkens, R.; Zilio, L.; Ferreira, E.; Villavicencio, A.. - : European Language Resources Association (ELRA), 2016
Abstract: Resources such as WordNet are useful for NLP applications, but their manual construction consumes time and personnel, and frequently results in low coverage. One alternative is the automatic construction of large resources from corpora like distributional thesauri, containing semantically associated words. However, as they may contain noise, there is a strong need for automatic ways of evaluating the quality of the resulting resource. This paper introduces a gold standard that can aid in this task. The BabelNet-Based Semantic Gold Standard (B2SG) was automatically constructed based on BabelNet and partly evaluated by human judges. It consists of sets of tests that present one target word, one related word and three unrelated words. B2SG contains 2,875 validated relations: 800 for verbs and 2,075 for nouns; these relations are divided among synonymy, antonymy and hypernymy. They can be used as the basis for evaluating the accuracy of the similarity relations on distributional thesauri by comparing the proximity of the target word with the related and unrelated options and observing if the related word has the highest similarity value among them. As a case study two distributional thesauri were also developed: one using surface forms from a large (1.5 billion word) corpus and the other using lemmatized forms from a smaller (409 million word) corpus. Both distributional thesauri were then evaluated against B2SG, and the one using lemmatized forms performed slightly better.
URL: https://www.aclweb.org/anthology/L16-1580
http://eprints.whiterose.ac.uk/153568/
BASE
Hide details
5
Comparing the quality of focused crawlers and of the translation resources obtained from them
Laranjeira, B.R.; Moreira, V.P.; Villavicencio, A.. - : European Language Resources Association (ELRA), 2014
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern