5 |
A sentiment analysis approach to increase authorship identification
|
|
|
|
Abstract:
Writing style is considered the manner in which an author expresses his thoughts, influenced by language characteristics, period, school, or nation. Often, this writing style can identify the author. One of the most famous examples comes from 1914 in Portuguese literature. With Fernando Pessoa and his heteronyms Alberto Caeiro, alvaro de Campos, and Ricardo Reis, who had completely different writing styles, led people to believe that they were different individuals. Currently, the discussion of authorship identification is more relevant because of the considerable amount of widespread fake news in social media, in which it is hard to identify who authored a text and even a simple quote can impact the public image of an author, especially if these texts or quotes are from politicians. This paper presents a process to analyse the emotion contained in social media messages such as Facebook to identify the author's emotional profile and use it to improve the ability to predict the author of the message. Using preprocessing techniques, lexicon-based approaches, and machine learning, we achieved an authorship identification improvement of approximately 5% in the whole dataset and more than 50% in specific authors when considering the emotional profile on the writing style, thus increasing the ability to identify the author of a text by considering only the author's emotional profile, previously detected from prior texts. ; FCT has supported this work – Fundação para a Ciência e Tecnologia within the Project Scope: UID/CEC/00319/2019.
|
|
Keyword:
Ciências Naturais::Ciências da Computação e da Informação; Eletrónica e Informática; Engenharia e Tecnologia::Engenharia Eletrotécnica; machine learning; natural language processing; Science & Technology; sentiment analysis
|
|
URL: https://doi.org/10.1111/exsy.12469 http://hdl.handle.net/1822/68848
|
|
BASE
|
|
Hide details
|
|
6 |
C Tutor usage in relation to student achievement and progress: a study of introductory programming courses in Portugal and Serbia
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Urban Evolution of Fafe in the Last Two Centuries
|
|
Henriques, Pedro Rangel. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2019. : OASIcs - OpenAccess Series in Informatics. 8th Symposium on Languages, Applications and Technologies (SLATE 2019), 2019
|
|
BASE
|
|
Show details
|
|
8 |
Scraping news sites and social networks for prejudice term analysis
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Predicting Performance Problems Through Emotional Analysis (Short Paper)
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Predicting Performance Problems Through Emotional Analysis (Short Paper) ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Increasing authorship identification through emotional analysis
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Hate speech classification in social media using emotional analysis
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Applying Attribute Grammars to Teach Linguistic Rules
|
|
Henriques, Pedro Rangel. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2017. : OASIcs - OpenAccess Series in Informatics. 6th Symposium on Languages, Applications and Technologies (SLATE 2017), 2017
|
|
BASE
|
|
Show details
|
|
|
|