Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Kelleher, John D. (7)
Klubicka, Filip (5)
Maldonado, Alfredo (4)
SFI Research Centres Programme (3)
Klubička, Filip (2)
Mahalunkar, Abhijit (2)
ADAPT Centre for Dig- ital Content Technology (1)
ADAPT Centre for Digital Content Technology (1)
John D. Kelleher (1)
John Kelleher (1)
more
Year:
2022 (1)
2020 (2)
2019 (2)
2018 (2)
Medium
Type:
Article (5)
Miscellaneous (2)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 7 of 7
1
Shapley Idioms: Analysing BERT Sentence Embeddings for General Idiom Token Identification
Nedumpozhimana, Vasudevan
;
Klubička, Filip
;
Kelleher, John D.
In: Front Artif Intell (2022)
Abstract:
This article examines the basis of Natural Language Understanding of transformer based language models, such as BERT. It does this through a case study on idiom token classification. We use idiom token identification as a basis for our analysis because of the variety of information types that have previously been explored in the literature for this task, including: topic, lexical, and syntactic features. This variety of relevant information types means that the task of idiom token identification enables us to explore the forms of linguistic information that a BERT language model captures and encodes in its representations. The core of this article presents three experiments. The first experiment analyzes the effectiveness of BERT sentence embeddings for creating a general idiom token identification model and the results indicate that the BERT sentence embeddings outperform Skip-Thought. In the second and third experiment we use the game theory concept of Shapley Values to rank the usefulness of individual idiomatic expressions for model training and use this ranking to analyse the type of information that the model finds useful. We find that a combination of idiom-intrinsic and topic-based properties contribute to an expression's usefulness in idiom token identification. Overall our results indicate that BERT efficiently encodes a variety of information from topic, through lexical and syntactic information. Based on these results we argue that notwithstanding recent criticisms of language model based semantics, the ability of BERT to efficiently encode a variety of linguistic information types does represent a significant step forward in natural language understanding.
Keyword:
Artificial Intelligence
URL:
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964145/
https://doi.org/10.3389/frai.2022.813967
BASE
Hide details
2
Semantic Relatedness and Taxonomic Word Embeddings ...
Kacmajor, Magdalena
;
Kelleher, John D.
;
Klubicka, Filip
. - : arXiv, 2020
BASE
Show details
3
English WordNet Taxonomic Random Walk Pseudo-Corpora
Klubicka, Filip
;
Maldonado, Alfredo
;
Mahalunkar, Abhijit
...
In: Conference papers (2020)
BASE
Show details
4
Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
Klubicka, Filip
;
Mahalunkar, Abhijit
;
Maldonado, Alfredo
...
In: Conference papers (2019)
BASE
Show details
5
Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
Maldonado, Alfredo
;
Klubicka, Filip
;
Kelleher, John D.
In: Articles (2019)
BASE
Show details
6
Is it worth it? Budget-related evaluation metrics for model selection ...
Klubička, Filip
;
Salton, Giancarlo D.
;
Kelleher, John D.
. - : arXiv, 2018
BASE
Show details
7
Is it worth it? Budget-related evaluation metrics for model selection
Klubicka, Filip
;
Salton, Giancarlo
;
Kelleher, John D.
In: Conference papers (2018)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
7
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern