DE eng

Search in the Catalogues and Directories

Hits 1 – 4 of 4

1
Using paraphrases for improving first story detection in news and twitter
In: http://homepages.inf.ed.ac.uk/miles/papers/naacl12.pdf (2012)
Abstract: First story detection (FSD) involves identifying first stories about events from a continuous stream of documents. A major problem in this task is the high degree of lexical variation in documents which makes it very difficult to detect stories that talk about the same event but expressed using different words. We suggest using paraphrases to alleviate this problem, making this the first work to use paraphrases for FSD. We show a novel way of integrating paraphrases with locality sensitive hashing (LSH) in order to obtain an efficient FSD system that can scale to very large datasets. Our system achieves state-of-the-art results on the first story detection task, beating both the best supervised and unsupervised systems. To test our approach on large data, we construct a corpus of events for Twitter, consisting of 50 million documents, and show that paraphrasing is also beneficial in this domain. 1
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.381.4393
http://homepages.inf.ed.ac.uk/miles/papers/naacl12.pdf
BASE
Hide details
2
Opinion retrieval in twitter
In: http://homepages.inf.ed.ac.uk/miles/papers/icwsm12.pdf (2012)
BASE
Show details
3
Using paraphrases for improving first story detection in news and twitter
In: http://www.aclweb.org/anthology-new/N/N12/N12-1034.pdf (2012)
BASE
Show details
4
Constructing parallel corpora for six indian languages via crowdsourcing
In: http://www.aclweb.org/anthology/W12-3152/ (2012)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
4
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern