1 |
A Comparative Study of Text Summarization on E-mail Data Using Unsupervised Learning Approaches
|
|
|
|
In: Dissertations (2020)
|
|
Abstract:
Over the last few years, email has met with enormous popularity. People send and receive a lot of messages every day, connect with colleagues and friends, share files and information. Unfortunately, the email overload outbreak has developed into a personal trouble for users as well as a financial concerns for businesses. Accessing an ever-increasing number of lengthy emails in the present generation has become a major concern for many users. Email text summarization is a promising approach to resolve this challenge. Email messages are general domain text, unstructured and not always well developed syntactically. Such elements introduce challenges for study in text processing, especially for the task of summarization. This research employs a quantitative and inductive methodologies to implement the Unsupervised learning models that addresses summarization task problem, to efficiently generate more precise summaries and to determine which approach of implementing Unsupervised clustering models outperform the best. The precision score from ROUGE-N metrics is used as the evaluation metrics in this research. This research evaluates the performance in terms of the precision score of four different approaches of text summarization by using various combinations of feature embedding technique like Word2Vec /BERT model and hybrid/conventional clustering algorithms. The results reveals that both the approaches of using Word2Vec and BERT feature embedding along with hybrid PHA-ClusteringGain k-Means algorithm achieved increase in the precision when compared with the conventional k-means clustering model. Among those hybrid approaches performed, the one using Word2Vec as feature embedding method attained 55.73% as maximum precision value.
|
|
Keyword:
Computer Engineering; Computer Sciences; Electronic mail; K-means clustering; PHA-Clustering Gain; ROUGE II; Text Summarization; Unsupervised Learning
|
|
URL: https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1220&context=scschcomdis https://arrow.tudublin.ie/scschcomdis/205
|
|
BASE
|
|
Hide details
|
|
2 |
Health Information-seeking Behaviors and Preferences of a Diverse, Multilingual Urban Cohort.
|
|
|
|
In: Medical care, vol 57 Suppl 6 Suppl 2, iss 6 (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Health Information-seeking Behaviors and Preferences of a Diverse, Multilingual Urban Cohort.
|
|
|
|
In: Medical care, vol 57 Suppl 6 Suppl 2, iss 6 (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Detection of Conferences Attendies from Interactions Can We Know if Our Colleague Attend a Given Conference?
|
|
|
|
In: 2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI) ; https://hal-utt.archives-ouvertes.fr/hal-02881475 ; 2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI), Sep 2018, Palermo, Italy. pp.1-4, ⟨10.1109/RTSI.2018.8548462⟩ (2018)
|
|
BASE
|
|
Show details
|
|
5 |
Detecção de spam em mensagens SMS utilizando aprendizagem de máquina ; Spam detection in sms messages using machine learning
|
|
Tibola, Rafael Henrique. - : Universidade Tecnológica Federal do Paraná, 2018. : Medianeira, 2018. : Brasil, 2018. : Ciência da Computação, 2018. : UTFPR, 2018
|
|
BASE
|
|
Show details
|
|
6 |
CaRE: A refinement calculus for requirements engineering based on argumentation semantics
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The Next Frontier in Communication and the ECLIPPSE Study: Bridging the Linguistic Divide in Secure Messaging.
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The Next Frontier in Communication and the ECLIPPSE Study: Bridging the Linguistic Divide in Secure Messaging.
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Gesture Mimicry in Expression of Laughter
|
|
|
|
In: In: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII). (pp. pp. 677-683). IEEE: Xi'an, China. (2015) (2015)
|
|
BASE
|
|
Show details
|
|
10 |
La “cyberpolitesse”: Formes de l’adresse, ouverture et clôture dans les courriers électroniques
|
|
|
|
In: Quaderns de Filologia - Estudis Lingüístics; Vol. 12 (2007): PRAGMÁTICA, DISCURSO Y SOCIEDAD; 35-56 ; 2444-1449 ; 1135-416X (2014)
|
|
BASE
|
|
Show details
|
|
11 |
Access, interest, and attitudes toward electronic communication for health care among patients in the medical safety net.
|
|
|
|
In: Journal of general internal medicine, vol 28, iss 7 (2013)
|
|
BASE
|
|
Show details
|
|
12 |
Robust and Efficient Anti-Phishing Techniques
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
13 |
'Vor Outlook sind wir alle gleich': Egalisierungs- und Hierarchisierungstendenzen im Zuge der E-Mail-Nutzung
|
|
|
|
In: kommunikation @ gesellschaft ; 3 ; 15 (2012)
|
|
BASE
|
|
Show details
|
|
14 |
Rezension: Joachim R. Höflich, Julian Gebhardt (Hrsg.), 2003, Vermittlungskulturen im Wandel. Brief, E-Mail, SMS
|
|
|
|
In: kommunikation @ gesellschaft ; 4 ; 3 ; Höflich, Joachim R. ; Gebhardt, Julian ; 2003 ; Vermittlungskulturen im Wandel: Brief - E-Mail - SMS ; Frankfurt am Main ; P. Lang ; 3-631-39456-X (2012)
|
|
BASE
|
|
Show details
|
|
15 |
Authorship Attribution in the E-mail Domain: A Study of the Effect of Size of Author Corpus and Topic on Accuracy of Identification
|
|
|
|
In: DTIC (2011)
|
|
BASE
|
|
Show details
|
|
16 |
E-mail Management of Japanese Hotels in Comparison with South Korean Hotels
|
|
|
|
In: UNLV Theses, Dissertations, Professional Papers, and Capstones (2011)
|
|
BASE
|
|
Show details
|
|
17 |
Idiolekto požymiai elektroninių laiškų leksikoje. Manifestations of idiolect in the lexis of electronic mail
|
|
|
|
In: Kalbotyra, Vol 63, Iss 3, Pp 149-164 (2011) (2011)
|
|
BASE
|
|
Show details
|
|
18 |
A corpus study of email writing in a business setting and its practical application in teaching English as a second language
|
|
|
|
In: CardinalScholar 1.0 (2010)
|
|
BASE
|
|
Show details
|
|
19 |
A multi-perspective analysis of the request e-mail discourse of a team of education professionals in Hong Kong
|
|
|
|
BASE
|
|
Show details
|
|
20 |
A Study of Topic and Topic Change in Conversational Threads
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
|
|