Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Klubicka, Filip (3)
Kelleher, John D. (2)
Maldonado, Alfredo (2)
ADAPT Centre for Digital Content Technology (1)
Arcan, Mihael (1)
Horizon 2020 (1)
John D. Kelleher (1)
Mahalunkar, Abhijit (1)
Popovic, Maja (1)
SFI Research Centres Pro-gramme (1)
more
Year:
2019 (3)
Medium
Type:
Article (3)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 3 of 3
1
Language related issues for machine translation between closely related south Slavic languages
Arcan, Mihael
;
Klubicka, Filip
;
Popovic, Maja
. - : The COLING 2016 Organizing Committee, 2019
BASE
Show details
2
Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
Klubicka, Filip
;
Mahalunkar, Abhijit
;
Maldonado, Alfredo
;
Kelleher, John D.
In: Conference papers (2019)
Abstract:
Creating word embeddings that reflect semantic relationships encoded in lexical knowledge resources is an open challenge. One approach is to use a random walk over a knowledge graph to generate a pseudo-corpus and use this corpus to train embeddings. However, the effect of the shape of the knowledge graph on the generated pseudo-corpora, and on the resulting word embeddings, has not been studied. To explore this, we use English WordNet, constrained to the taxonomic (tree-like) portion of the graph, as a case study. We investigate the properties of the generated pseudo-corpora, and their impact on the resulting embeddings. We find that the distributions in the psuedo-corpora exhibit properties found in natural corpora, such as Zipf’s and Heaps’ law, and also ob- serve that the proportion of rare words in a pseudo-corpus affects the performance of its embeddings on word similarity.
Keyword:
Artificial Intelligence and Robotics
;
Computational Linguistics
;
corpus
;
evaluation
;
Numerical Analysis and Scientific Computing
;
random walk
;
representations
;
Software Engineering
;
taxonomy
;
word embeddings
;
word similarity
;
WordNet
URL:
https://arrow.tudublin.ie/scschcomcon/271
https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1283&context=scschcomcon
BASE
Hide details
3
Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
Maldonado, Alfredo
;
Klubicka, Filip
;
Kelleher, John D.
In: Articles (2019)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
3
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern