Page: 1... 3 4 5 6 7 8 9 10 11... 104
121 |
Semi-automatic Annotation Proposal for Increasing a Fake News Dataset in Spanish
|
|
|
|
Abstract:
The digital era has become an ally of fake news, since it has increased the spread and amount of false information. Fake news is a global problem that causes disorder and generates fear. This phenomenon must be attacked in the same environment in which it is generated: in the digital environment. This paper presents the current state of my doctoral thesis which focuses on the linguistic modelling applied to the automatic detection of fake news through Natural Language Processing (NLP). In order to study the linguistic characteristics of fake news and to create computational models that automate its detection, labelled datasets are needed, but this is a costly task that requires time and expertise. A fake news dataset and an annotation guide were created ad hoc in a previous work to analyse all the parts and elements of a news item. However, after creating and training our system, we realised that the time spent was not proportional to the low annotated data obtained. The need of creating a larger corpus to train and test our hypothesis has led us to think about a way of increasing our corpus without spending so much time. For that purpose, a semi-automatic annotation is proposed for reducing time while increasing speed and quantity of the examples annotated. This proposal, besides allowing us to make progress in our research, may facilitate the creation of datasets, which are essential in NLP research. ; This research work has been partially funded by Generalitat Valenciana through project “SIIA: Tecnologías del lenguaje humano para una sociedad inclusiva, igualitaria, y accesible” with grant reference PROMETEU/2018/089, by the Spanish Government through the projects RTI2018-094653-BC22: “Modelang: Modeling the behavior of digital entities by Human Language Technologies” and RTI2018-094653-B-C21: “LIVING-LANG: Living Digital Entities by Human Language Technologies”, as well as being partially supported by a grant from the Fondo Europeo de Desarrollo Regional (FEDER).
|
|
Keyword:
Corpus annotation; Corpus creation; Fake news detection; Human Language Technologies; Lenguajes y Sistemas Informáticos; Natural Language Processing; Semi-automatic annotation
|
|
URL: http://hdl.handle.net/10045/120080
|
|
BASE
|
|
Hide details
|
|
122 |
Relaciones de sucesos, corpus digitales y extranjerismos ; News Pamphlets, Digital Corpora and Loanwords
|
|
|
|
BASE
|
|
Show details
|
|
124 |
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
|
|
|
|
BASE
|
|
Show details
|
|
125 |
A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal
|
|
|
|
BASE
|
|
Show details
|
|
126 |
Examining the State-of-the-Art in News Timeline Summarization
|
|
|
|
BASE
|
|
Show details
|
|
127 |
Fake News and Disinformation about Vaccines ; Fake news e desinformação sobre vacinas
|
|
|
|
In: Revista GTLex; v. 6 n. 2 (2021): Número atemático; 345-394 ; 2447-9551 (2021)
|
|
BASE
|
|
Show details
|
|
128 |
«Li deien el Pelusa, pels seus cabells»: anàlisi discursiva del Telenotícies i l’InfoK de TVC
|
|
|
|
BASE
|
|
Show details
|
|
129 |
The Mythologeme “Coronavirus” in the Modern Mass Media News in Europe and Asia ; Мифологема «коронавирус» в современных новостях масс-медиа Европы и Азии
|
|
|
|
BASE
|
|
Show details
|
|
130 |
Медийные ограничения периода пандемии: опыт COVID-цензуры ; Media Restrictions During the Pandemic: the COVID Censorship Experience
|
|
|
|
BASE
|
|
Show details
|
|
132 |
Das sogenannte generische Maskulinum in der „Zeit im Bild“-Berichterstattung des ORF
|
|
|
|
BASE
|
|
Show details
|
|
133 |
Exploring ideological messages in newspaper editorials and news reports on the first human gene-editing case
|
|
|
|
In: Lingue e Linguaggi; Volume 42 (2021) Special Issue; 101-122 (2021)
|
|
BASE
|
|
Show details
|
|
134 |
A corpus linguistic study of Australian and Chinese health news reporting on salt consumption
|
|
Zhao, Mengdan. - : The University of Sydney, 2021. : Department of Chinese Studies, 2021. : Faculty of Arts and Social Sciences, School of Languages and Cultures, 2021
|
|
BASE
|
|
Show details
|
|
135 |
Aboriginal and Torres Strait Islander people(s) in Australian print news: A corpus-based critical discourse analysis
|
|
Bray, Carly. - : Department of Linguistics, 2021. : Faculty of Arts and Social Sciences, School of Literature, Art and Media, 2021
|
|
BASE
|
|
Show details
|
|
137 |
On the Detection of False Information: From Rumors to Fake News
|
|
|
|
BASE
|
|
Show details
|
|
139 |
JOUR200A: Fundamentals of Editing and Reporting I — Teaching Lede Writing for Digital/Print, Broadcast and Social Media Simultaneously
|
|
|
|
In: UNL Faculty Course Portfolios (2021)
|
|
BASE
|
|
Show details
|
|
140 |
#Fakenews denunciada = eleições brasileiras de 2018 e comentários online ; HashtagFakenews
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1... 3 4 5 6 7 8 9 10 11... 104
|
|