DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Facet Classification of Blogs: Know-Center at the TREC 2009 Blog Distillation Task
In: DTIC (2009)
Abstract: In this paper, we outline our experiments carried out at the TREC 2009 Blog Distillation Task. Our system is based on a plain text index extracted from the XML feeds of the TREC Blogs08 dataset. This index was used to retrieve candidate blogs for the given topics. The resulting blogs were classified using a Support Vector Machine that was trained on a manually labelled subset of the TREC Blogs08 dataset. Our experiments included three runs on different features: firstly on nouns, secondly on stylometric properties, and thirdly on punctuation statistics. The facet identification based on our approach was successful, although a significant number of candidate blogs were not retrieved at all. ; Presented at the Text REtrieval Conference (TREC 2009, 18th) held in Gaithersburg, Maryland on 17-20 November 2009. Published in the Proceedings of the Text REtrieval Conference (TREC 2009, 18th), 2009. The conference was co-sponsored by the National Institute of Standards and Technology (NIST), the Defense Advanced Research Projects Agency (DARPA), and the Advanced Research and Development Activity (ARDA).
Keyword: *BLOGS(INTERNET); *INFORMATION RETRIEVAL; *INFORMATION SCIENCES; *INTERNET; *IR(INFORMATION RETRIEVAL); *SOCIAL COMMUNICATION; AUSTRIA; Computer Programming and Software; DISTILLATION; Equipment and Methods; FOREIGN REPORTS; Information Science; Linguistics; RELEVANCE(INFORMATION SCIENCES); REPRINTS; STYLOMETRICS; SVM(SUPPORT VECTOR MACHINES); SYMPOSIA; Test Facilities; WORDS(LANGUAGE)
URL: http://www.dtic.mil/docs/citations/ADA517854
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA517854
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern