DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
In: Conference papers (2019)
BASE
Show details
2
Phonetic acquisition in cortical dynamics, a computational approach
In: Computer Science: Faculty Publications and Other Works (2019)
BASE
Show details
3
Practical Natural Language Generation from Knowledge Graphs
In: Embargoed Honors Theses, University of Nebraska-Lincoln (2019)
BASE
Show details
4
Efficient Inference, Search and Evaluation for Latent Variable Models of Text with Applications to Information Retrieval and Machine Translation
In: Doctoral Dissertations (2016)
BASE
Show details
5
A Study on the Efficacy of Sentiment Analysis in Author Attribution
In: Electronic Theses and Dissertations (2015)
BASE
Show details
6
CSC Senior Project: NLPStats
In: Computer Science and Software Engineering (2013)
BASE
Show details
7
Multilingual Vandalism Detection Using Language-Independent & Ex Post Facto Evidence
In: Departmental Papers (CIS) (2011)
Abstract: There is much literature on Wikipedia vandalism detection. However, this writing addresses two facets given little treatment to date. First, prior efforts emphasize zero-delay detection, classifying edits the moment they are made. If classification can be delayed (e.g., compiling offline distributions), it is possible to leverage ex post facto evidence. This work describes/evaluates several features of this type, which we find to be overwhelmingly strong vandalism indicators. Second, English Wikipedia has been the primary test-bed for research. Yet, Wikipedia has 200+ language editions and use of localized features impairs portability. This work implements an extensive set of language-independent indicators and evaluates them using three corpora (German, English, Spanish). The work then extends to include language-specific signals. Quantifying their performance benefit, we find that such features can moderately increase classifier accuracy, but significant effort and language fluency are required to capture this utility. Aside from these novel aspects, this effort also broadly addresses the task, implementing 65 total features. Evaluation produces 0.840 PR-AUC on the zero-delay task and 0.906 PR-AUC with ex post facto evidence (averaging languages). Performance matches the state-of-the-art (English), sets novel baselines (German, Spanish), and is validated by a first-place finish over the 2011 PAN-CLEF test set.
Keyword: collaborative security; collaborative software; CPS Internet of Things; Databases and Information Systems; feature selection; machine learning; Numerical Analysis and Scientific Computing; Other Computer Sciences; social software misuse; vandalism; Wikipedia
URL: https://repository.upenn.edu/cgi/viewcontent.cgi?article=1515&context=cis_papers
https://repository.upenn.edu/cis_papers/479
BASE
Hide details
8
Software Internationalization: A Framework Validated Against Industry Requirements for Computer Science and Software Engineering Programs
In: Master's Theses (2010)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern