1 |
Islands and Bridges of Language: Bio-Inspired Structural Analysis of Language Embedding Data
|
|
|
|
BASE
|
|
Show details
|
|
2 |
An Ontology based Smart Management of Linguistic Knowledge
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03618012 ; Journal of Data Mining and Digital Humanities, Episciences.org, In press (2022)
|
|
BASE
|
|
Show details
|
|
3 |
PROTECT: A Pipeline for Propaganda Detection and Classification
|
|
|
|
In: CLiC-it 2021- Italian Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03417019 ; CLiC-it 2021- Italian Conference on Computational Linguistics, Jan 2022, Milan, Italy (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Le modèle Transformer: un « couteau suisse » pour le traitement automatique des langues
|
|
|
|
In: Techniques de l'Ingenieur ; https://hal.archives-ouvertes.fr/hal-03619077 ; Techniques de l'Ingenieur, Techniques de l'ingénieur, 2022, ⟨10.51257/a-v1-in195⟩ ; https://www.techniques-ingenieur.fr/base-documentaire/innovation-th10/innovations-en-electronique-et-tic-42257210/transformer-des-reseaux-de-neurones-pour-le-traitement-automatique-des-langues-in195/ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Structured, flexible, and robust: comparing linguistic plans and explanations generated by humans and large language models ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Eine agentenbasierte Architektur für Programmierung mit gesprochener Sprache ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Latin Lemmatization & POS Tagging. Issues, Resources, Tools ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
The First Gospel, the Gospel of the Poor: A New Reconstruction of Q and Resolution of the Synoptic Problem based on Marcion's Early Luke ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Latin Lemmatization & POS Tagging. Issues, Resources, Tools ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
The First Gospel, the Gospel of the Poor: A New Reconstruction of Q and Resolution of the Synoptic Problem based on Marcion's Early Luke ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Measuring and Comparing Social Bias in Static and Contextual Word Embeddings
|
|
|
|
In: Dissertations (2022)
|
|
Abstract:
Word embeddings have been considered one of the biggest breakthroughs of deep learning for natural language processing. They are learned numerical vector representations of words where similar words have similar representations. Contextual word embeddings are the promising second-generation of word embeddings assigning a representation to a word based on its context. This can result in different representations for the same word depending on the context (e.g. river bank and commercial bank). There is evidence of social bias (human-like implicit biases based on gender, race, and other social constructs) in word embeddings. While detecting bias in static (classical or non-contextual) word embeddings is a well-researched topic, there has been limited work in detecting bias in contextual word embeddings, mostly focussed on using the Word Embedding Association Test (WEAT). This paper explores measuring social bias (gender, ethnicity, and religion) in contextual word embeddings using a number of fairness metrics, including the Relative Norm Distance (RND), the Relative Negative Sentiment Bias (RNSB) and the already mentioned WEAT. It extends the Word Embeddings Fairness Evaluation (WEFE) framework to facilitate measuring social biases in contextual embeddings and compares these with biases in static word embeddings. The results show when ranking performance over a number of fairness metrics that contextual word embedding pre-trained models BERT and RoBERTa have more social bias than static word embedding pre-trained models GloVe and Word2Vec.
|
|
Keyword:
Computer Engineering; Computer Sciences; Contextual Word Embeddings; Fairness Evaluation; Natural Language Processing; Sentence Embeddings; Social Bias; Word Embeddings
|
|
URL: https://arrow.tudublin.ie/scschcomdis/250 https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1267&context=scschcomdis
|
|
BASE
|
|
Hide details
|
|
16 |
Data of the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021 ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Data of the Shared Task on the Disambiguation of German Verbal Idioms at KONVENS 2021 ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Brazilian Portuguese verbal databases ; Bases lexicais verbais do português brasileiro
|
|
|
|
In: Domínios de Lingu@gem; Ahead of Print ; 1980-5799 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Representation learning of natural language and its application to language understanding and generation
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Eine agentenbasierte Architektur für Programmierung mit gesprochener Sprache
|
|
|
|
BASE
|
|
Show details
|
|
|
|