DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8 9...63
Hits 81 – 100 of 1.259

81
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation ...
BASE
Show details
82
How effective is BERT without word ordering? Implications for language understanding and data privacy ...
BASE
Show details
83
GEM: Natural Language Generation, Evaluation, and Metrics - Part 4 ...
BASE
Show details
84
The statistical advantage of automatic NLG metrics at the system level ...
BASE
Show details
85
Counter-Argument Generation by Attacking Weak Premises ...
BASE
Show details
86
Supporting Cognitive and Emotional Empathic Writing of Students ...
BASE
Show details
87
What's in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus ...
Abstract: Read paper: https://www.aclanthology.org/2021.acl-short.24 Abstract: Whereas much of the success of the current generation of neural language models has been driven by increasingly large training corpora, relatively little research has been dedicated to analyzing these massive sources of textual data. In this exploratory analysis, we delve deeper into the Common Crawl, a colossal web corpus that is extensively used for training language models. We find that it contains a significant amount of undesirable content, including hate speech and sexually explicit content, even after filtering procedures. We discuss the potential impacts of this content on language models and conclude with future research directions and a more mindful approach to corpus collection and analysis. ...
Keyword: Computational Linguistics; Condensed Matter Physics; Deep Learning; Electromagnetism; FOS Physical sciences; Information and Knowledge Engineering; Neural Network; Semantics
URL: https://dx.doi.org/10.48448/ztc5-5r72
https://underline.io/lecture/25962-what's-in-the-boxquestion-an-analysis-of-undesirable-content-in-the-common-crawl-corpus
BASE
Hide details
88
Are Pretrained Convolutions Better than Pretrained Transformers? ...
BASE
Show details
89
Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards? ...
BASE
Show details
90
Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring ...
BASE
Show details
91
Hate Speech Detection Based on Sentiment Knowledge Sharing ...
BASE
Show details
92
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation ...
BASE
Show details
93
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction ...
BASE
Show details
94
WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation ...
BASE
Show details
95
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization ...
BASE
Show details
96
How does Attention Affect the Model? ...
BASE
Show details
97
Improve Query Focused Abstractive Summarization by Incorporating Answer Relevance ...
BASE
Show details
98
Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities ...
BASE
Show details
99
Neural Machine Translation with Monolingual Translation Memory ...
BASE
Show details
100
Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Conversational Systems ...
BASE
Show details

Page: 1 2 3 4 5 6 7 8 9...63

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.259
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern