DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 49

1
Introduction to the Special Issue on Language in Social Media: Exploiting Discourse and Other Contextual Information
In: Computational Linguistics, Vol 44, Iss 4, Pp 663-681 (2018) (2018)
BASE
Show details
2
Author Style Analysis in Text Documents Based on Character and Word N-Grams
Abstract: We describe our research on text analytics methods for detecting differences and similarities in the style of authors of text documents. Automatic methods for analyzing the written style of authors have applications in the domains of forensics, plagiarism detection, security, and literary research. We present our method for the problem of authorship verification, that is, the problem of deciding whether a certain text was written by a specific person, given samples of their writing. Our proximity-based one-class classifier method is evaluated on a multilingual dataset of the Author Identification competition of PAN 2013 shared tasks on digital text forensics. A version of our method submitted to the task was the winner in the competition’s secondary evaluation. We also propose a visual analytics tool RNG-Sig for investigation of differences and similarities between text documents at the level of features that have been shown to be powerful for identification of authorship, that is at the level of character n-grams. The tool provides a visual interface for performing classification for authorship attribution — the task of deciding who among candidate authors wrote a considered text, based on samples of writing of the candidates — using CNG classifier proposed by Keselj et al. RNG-Sig allows for the visual interpretation of the inner workings of the classifier and for influencing the classification process by a user. Further, we systematically study authorship attribution in the situation when samples of writing of different candidates have different levels of topical similarity to a text that is attributed. We investigate how such a condition influences the behaviour of two supervised classifiers on two sets of features commonly used for the task, and we show that supervised models are biased towards attributing a questioned document to a candidate that has writing samples topically more similar to the document. We propose a method of character n-gram selection that alleviate this bias of classifiers.
Keyword: authorship analysis; Data mining; machine learning; text mining; Visual texture recognition
URL: http://hdl.handle.net/10222/72872
BASE
Hide details
3
Learning to Classify Documents According to Formal and Informal Style
In: http://elanguage.net/journals/lilt/article/viewFile/2844/2822/ (2012)
BASE
Show details
4
Hierarchical approach to emotion recognition and classification in texts
In: http://www.site.uottawa.ca/~diana/publications/DimanCanadianAI2010.pdf (2010)
BASE
Show details
5
Identification and Disambiguation of Cognates, False Friends, and Partial Cognates Using Machine Learning Techniques
In: http://www.macrothink.org/journal/index.php/ijl/article/viewFile/309/193/ (2009)
BASE
Show details
6
Visual development process for automatic generation of digital games narrative content
In: http://www.site.uottawa.ca/~diana/publications/caropresoetallACLWshort.pdf (2009)
BASE
Show details
7
Semantic text similarity using corpus-based word similarity and string similarity
In: http://www.site.uottawa.ca/~mdislam/publications/tkdd.pdf (2008)
BASE
Show details
8
A statistical model for nearsynonym choice
In: http://www.site.uottawa.ca/~diana/publications/ns_tslp.pdf (2007)
BASE
Show details
9
Semantic similarity knowledge and its applications
In: http://www.cs.ubbcluj.ro/~studia-i/2007-1/02-Inkpen.pdf (2007)
BASE
Show details
10
Machine Learning Experiments for Textual Entailment
In: http://u.cs.biu.ac.il/~nlp/RTE2/Proceedings/02.pdf (2006)
BASE
Show details
11
Building and Using a Lexical Knowledge-base of Near-Synonym Differences
In: http://www.site.uottawa.ca/~diana/publications/InkpenHirst_cl.pdf (2006)
BASE
Show details
12
Machine Learning Experiments for Textual Entailment
In: http://www.site.uottawa.ca/~diana/publications/entailment2006.pdf (2006)
BASE
Show details
13
Building and using a lexical knowledge base of near-synonym differences
In: http://ftp.cs.toronto.edu/pub/gh/Inkpen+Hirst-2006.pdf (2006)
BASE
Show details
14
Investigating cross-language speech retrieval for a spontaneous conversational speech collection
In: http://www.mt-archive.info/HLT-NAACL-2006-Inkpen.pdf (2006)
BASE
Show details
15
Second order co-occurrence PMI for determining the semantic similarity of words
In: http://www.site.uottawa.ca/~diana/publications/LREC_2006.pdf (2006)
BASE
Show details
16
A.: Using various indexing schemes and multiple translations in the CL-SR task at CLEF 2005
In: http://www.site.uottawa.ca/~diana/publications/UO_LNCS_CLEF05.pdf (2006)
BASE
Show details
17
Semi-supervised learning of partial cognates using bilingual bootstrapping
In: http://www.site.uottawa.ca/~diana/publications/ACL2006_cognates.pdf (2006)
BASE
Show details
18
Semi-supervised learning of partial cognates using bilingual bootstrapping
In: http://acl.ldc.upenn.edu/P/P06/P06-1056.pdf (2006)
BASE
Show details
19
A Text Processing Tool for the Romanian Language
In: http://www.site.uottawa.ca/~diana/Cross_Language_Knowledge_Induction_Workshop/P9.pdf (2005)
BASE
Show details
20
Using various indexing schemes and multiple translations
In: http://www.site.uottawa.ca/~mdislam/publications/CLEF_uOttawa_05.pdf (2005)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
49
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern