DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 28

1
How does the pre-training objective affect what large language models learn about linguistic properties? ...
BASE
Show details
2
Automatic Identification and Classification of Bragging in Social Media ...
BASE
Show details
3
Analyzing Online Political Advertisements ...
BASE
Show details
4
Modeling the Severity of Complaints in Social Media ...
Jin, Mali; Aletras, Nikolaos. - : arXiv, 2021
BASE
Show details
5
Translation Error Detection as Rationale Extraction ...
BASE
Show details
6
Knowledge Distillation for Quality Estimation ...
BASE
Show details
7
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling ...
Abstract: Masked language modeling (MLM), a self-supervised pretraining objective, is widely used in natural language processing for learning text representations. MLM trains a model to predict a random sample of input tokens that have been replaced by a [MASK] placeholder in a multi-class setting over the entire vocabulary. When pretraining, it is common to use alongside MLM other auxiliary objectives on the token or sequence level to improve downstream performance (e.g. next sentence prediction). However, no previous work so far has attempted in examining whether other simpler linguistically intuitive or not objectives can be used standalone as main pretraining objectives. In this paper, we explore five simple pretraining objectives based on token-level classification tasks as replacements of MLM. Empirical results on GLUE and SQuAD show that our proposed methods achieve comparable or better performance to MLM using a BERT-BASE architecture. We further validate our methods using smaller models, showing that ... : Accepted at EMNLP 2021 ...
Keyword: Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
URL: https://arxiv.org/abs/2109.01819
https://dx.doi.org/10.48550/arxiv.2109.01819
BASE
Hide details
8
Analyzing Online Political Advertisements ...
BASE
Show details
9
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification ...
BASE
Show details
10
Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience ...
BASE
Show details
11
Modeling the Severity of Complaints in Social Media ...
NAACL 2021 2021; Aletras, Nikolaos; Jin, Mali. - : Underline Science Inc., 2021
BASE
Show details
12
Active Learning by Acquiring Contrastive Examples ...
BASE
Show details
13
In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering ...
BASE
Show details
14
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling ...
BASE
Show details
15
Knowledge Distillation for Quality Estimation ...
BASE
Show details
16
Machine Extraction of Tax Laws from Legislative Texts
In: Proceedings of the Natural Legal Language Processing Workshop 2021 (2021)
BASE
Show details
17
Point-of-Interest Type Prediction using Text and Images ...
BASE
Show details
18
Point-of-Interest Type Prediction using Text and Images ...
BASE
Show details
19
An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction ...
BASE
Show details
20
Knowledge distillation for quality estimation
Gajbhiye, Amit; Fomicheva, Marina; Alva-Manchego, Fernando. - : Association for Computational Linguistics, 2021
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
28
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern