1 |
A Seed-Guided Latent Dirichlet Allocation Approach to Predict the Personality of Online Users Using the PEN Model
|
|
|
|
In: Algorithms; Volume 15; Issue 3; Pages: 87 (2022)
|
|
Abstract:
There is a growing interest in topic modeling to decipher the valuable information embedded in natural texts. However, there are no studies training an unsupervised model to automatically categorize the social networks (SN) messages according to personality traits. Most of the existing literature relied on the Big 5 framework and psychological reports to recognize the personality of users. Furthermore, collecting datasets for other personality themes is an inherent problem that requires unprecedented time and human efforts, and it is bounded with privacy constraints. Alternatively, this study hypothesized that a small set of seed words is enough to decipher the psycholinguistics states encoded in texts, and the auxiliary knowledge could synergize the unsupervised model to categorize the messages according to human traits. Therefore, this study devised a dataless model called Seed-guided Latent Dirichlet Allocation (SLDA) to categorize the SN messages according to the PEN model that comprised Psychoticism, Extraversion, and Neuroticism traits. The intrinsic evaluations were conducted to determine the performance and disclose the nature of texts generated by SLDA, especially in the context of Psychoticism. The extrinsic evaluations were conducted using several machine learning classifiers to posit how well the topic model has identified latent semantic structure that persists over time in the training documents. The findings have shown that SLDA outperformed other models by attaining a coherence score up to 0.78, whereas the machine learning classifiers can achieve precision up to 0.993. We also will be shared the corpus generated by SLDA for further empirical studies.
|
|
Keyword:
machine learning; PEN model; personality detection; topic modeling
|
|
URL: https://doi.org/10.3390/a15030087
|
|
BASE
|
|
Hide details
|
|
2 |
Explorando la ciberviolencia contra mujeres y niñas en Filipinas a través de Mining Online News
|
|
|
|
In: Comunicar: Revista científica iberoamericana de comunicación y educación, ISSN 1134-3478, Nº 70, 2022 (Ejemplar dedicado a: Nuevos retos del profesorado ante la enseñanza digital), pags. 125-138 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Topic models do not model topics: epistemological remarks and steps towards best practices
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03261599 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, 2021, ⟨10.46298/jdmdh.7595⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Characterizing Topics in Social Media Using Dynamics of Conversation
|
|
|
|
In: Entropy; Volume 23; Issue 12; Pages: 1642 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Digital Humanities - A Discipline in its Own Right? An Analysis of the Role and Position of DH in the Academic Landscape ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Digital Humanities - A Discipline in its Own Right? An Analysis of the Role and Position of DH in the Academic Landscape ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Exploring Online Depression Forums via Text Mining: A Comparison of Reddit and a Curated Online Forum ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Same same, but different ? On the Relation of Information Science and the Digital Humanities A Scientometric Comparison of Academic Journals Using LDA and Hierarchical Clustering ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
LDA-Based Topic Modeling Sentiment Analysis Using Topic/Document/Sentence (TDS) Model
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 23 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Real-Time Infoveillance of Moroccan Social Media Users’ Sentiments towards the COVID-19 Pandemic and Its Management
|
|
|
|
In: International Journal of Environmental Research and Public Health ; Volume 18 ; Issue 22 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Schizophrenia Detection Using Machine Learning Approach from Social Media Content
|
|
|
|
In: Sensors ; Volume 21 ; Issue 17 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Exploring Latent Topics and Research Trends in Mathematics Teachers’ Knowledge Using Topic Modeling: A Systematic Review
|
|
|
|
In: Mathematics ; Volume 9 ; Issue 22 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Relationship Between Poetic Meter and Meaning in Accentual-Syllabic Verse (data and replication code) ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Relationship Between Poetic Meter and Meaning in Accentual-Syllabic Verse (data and replication code) ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Relationship Between Poetic Meter and Meaning in Accentual-Syllabic Verse (data and replication code) ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Jeu vidéo et linéarité romanesque ... : La structure Aventure/mésaventures dans Dragon Age : Origins ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Ensemble Topic Modeling using Weighted Term Co-associations
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Examining the Social Media Antecedents of Racial Justice: Evidence from Twitter
|
|
|
|
BASE
|
|
Show details
|
|
20 |
EXPLORING PSEUDO-TOPIC-MODELING FOR CREATING AUTOMATED DISTANT-ANNOTATION SYSTEMS
|
|
|
|
In: Theses (2021)
|
|
BASE
|
|
Show details
|
|
|
|