1 |
A Visualizable Evidence-Driven Approach for Authorship Attribution
|
|
|
|
In: https://dl.acm.org/citation.cfm?doid=2744298.2699910 (2015)
|
|
Abstract:
The Internet provides an ideal anonymous channel for concealing computer-mediated malicious activities ; as the network-based origins of critical electronic textual evidence (e.g. ; emails ; blogs ; forum posts ; chat logs ; etc.) can be easily repudiated. Authorship attribution is the study of identifying the actual author of the given anonymous documents based on the text itself ; and for decades ; many linguistic stylometry and computational techniques have been extensively studied for this purpose. However ; most of the previous research emphasizes promoting the authorship attribution accuracy ; and few works have been done for the purpose of constructing and visualizing the evidential traits. In addition ; these sophisticated techniques are difficult for cyber investigators or linguistic experts to interpret. In this article ; based on the End-to-End Digital Investigation (EEDI) framework ; we propose a visualizable evidence-driven approach ; namely VEA ; which aims at facilitating the work of cyber investigation. Our comprehensive controlled experiment and the stratified experiment on the real-life Enron email dataset demonstrate that our approach can achieve even higher accuracy than traditional methods ; meanwhile ; its output can be easily visualized and interpreted as evidential traits. In addition to identifying the most plausible author of a given text ; our approach also estimates the confidence for the predicted result based on a given identification context and presents visualizable linguistic evidence for each candidate.
|
|
Keyword:
Algorithms; Design; Experimentation
|
|
URL: http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=139629 https://doi.org/10.1145/2699910
|
|
BASE
|
|
Hide details
|
|
2 |
Ten Years of Rich Internet Applications: A Systematic Mapping Study, and Beyond
|
|
|
|
BASE
|
|
Show details
|
|
3 |
On the localness of software
|
|
|
|
In: http://macbeth.cs.ucdavis.edu/cache-model.pdf (2014)
|
|
BASE
|
|
Show details
|
|
4 |
Approximate Semantic Matching of Events for the Internet of Things
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Q.: Scale based region growing for scene text detection
|
|
|
|
In: http://www.stat.ucla.edu/%7Ejunhua.mao/papers/Scale_based_region_growing_ACM_MM13.pdf (2013)
|
|
BASE
|
|
Show details
|
|
6 |
Gigatensor: scaling tensor analysis up by 100 times - algorithms and discoveries
|
|
|
|
In: http://www.cs.cmu.edu/~christos/PUBLICATIONS/kdd12-gigatensor.pdf (2012)
|
|
BASE
|
|
Show details
|
|
7 |
Distributional Semantics with Eyes: Using Image Analysis to Improve Computational Representations of Word Meaning
|
|
|
|
In: http://clic.cimec.unitn.it/marco/publications/bruni-etal-acmmm-2012.pdf (2012)
|
|
BASE
|
|
Show details
|
|
8 |
Integrating document clustering and . . .
|
|
|
|
In: http://users.cis.fiu.edu/~taoli/pub/a14-wang.pdf (2011)
|
|
BASE
|
|
Show details
|
|
9 |
Folks in folksonomies: social link prediction from shared metadata
|
|
|
|
In: http://hal.archives-ouvertes.fr/docs/00/42/98/86/PDF/wsdm141-schifanella.pdf (2010)
|
|
BASE
|
|
Show details
|
|
10 |
Semantic lexicon adaptation for use in query interpretation
|
|
|
|
In: http://www.ra.ethz.ch/cdstore/www2010/www/p1167.pdf (2010)
|
|
BASE
|
|
Show details
|
|
12 |
Classifying latent user attributes in twitter
|
|
|
|
In: https://csc-869-mlog.googlecode.com/files/p37-rao.pdf (2010)
|
|
BASE
|
|
Show details
|
|
13 |
Spatiotemporal mapping of Wikipedia concepts
|
|
|
|
In: http://comupedia.org/adrian/articles/jcdl75-popescu.pdf (2010)
|
|
BASE
|
|
Show details
|
|
14 |
An Information-extraction system for Urdu—a resource-poor language
|
|
|
|
In: http://www.cedar.buffalo.edu/~rohini/Papers/ACM-TALIP.pdf (2010)
|
|
BASE
|
|
Show details
|
|
15 |
Transliteration for resource-scarce languages
|
|
|
|
In: http://www.cse.iitb.ac.in/~damani/papers/TALIP10/transliterationTALIP10.pdf (2010)
|
|
BASE
|
|
Show details
|
|
16 |
MorphoNet: Exploring the Use of Community Structure for Unsupervised Morpheme Analysis
|
|
|
|
In: http://clef.isti.cnr.it/2009/working_notes/morpho-papers/bernhard-paperCLEF2009.pdf (2009)
|
|
BASE
|
|
Show details
|
|
17 |
OpinionMiner: a novel machine learning system for web opinion mining and extraction
|
|
|
|
In: http://www.cedar.buffalo.edu/~rohini/Papers/KDD_Jin.pdf (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Word sense disambiguation: a survey
|
|
|
|
In: http://www.dsi.uniroma1.it/~navigli/pubs/ACM_Survey_2009_Navigli.pdf (2009)
|
|
BASE
|
|
Show details
|
|
19 |
Sentiment analysis of blogs by combining lexical knowledge with text classification
|
|
|
|
In: http://www.prem-melville.com/publications/pooling-multinomials-kdd09.pdf (2009)
|
|
BASE
|
|
Show details
|
|
20 |
A 2-poisson model for probabilistic coreference of named entities for improved text retrieval
|
|
|
|
In: http://www.comp.nus.edu.sg/~nght/pubs/sigir09.pdf (2009)
|
|
BASE
|
|
Show details
|
|
|
|