1 |
Mining acronym expansions and their meanings using query click log
|
|
|
|
In: http://www.mpi-inf.mpg.de/~btaneva/downloads/fp099-taneva.pdf (2013)
|
|
BASE
|
|
Show details
|
|
2 |
Extending an Open Source Enterprise Service Bus for SQL Statement Transformation to Enable Cloud Data Access
|
|
Xia, Simin. - : Stuttgart, Germany, Universität Stuttgart, 2013
|
|
In: ftp://ftp.informatik.uni-stuttgart.de/pub/library/medoc.ustuttgart_fi/MSTR-3506/MSTR-3506.pdf (2013)
|
|
BASE
|
|
Show details
|
|
3 |
Identifying task-based sessions in search engine query logs
|
|
|
|
In: http://pomino.isti.cnr.it/%7Esilvestr/wp-content/uploads/2011/02/wsdm2011.pdf (2011)
|
|
Abstract:
The research challenge addressed in this paper is to devise effective techniques for identifying task-based sessions, i.e. sets of possibly non contiguous queries issued by the user of a Web Search Engine for carrying out a given task. In order to evaluate and compare different approaches, we built, by means of a manual labeling process, a ground-truth where the queries of a given query log have been grouped in tasks. Our analysis of this ground-truth shows that users tend to perform more than one task at the same time, since about 75 % of the submitted queries involve a multi-tasking ac-tivity. We formally define the Task-based Session Discov-ery Problem (TSDP) as the problem of best approximating the manually annotated tasks, and we propose several vari-ants of well known clustering algorithms, as well as a novel efficient heuristic algorithm, specifically tuned for solving the TSDP. These algorithms also exploit the collaborative knowledge collected by Wiktionary and Wikipedia for de-tecting query pairs that are not similar from a lexical content point of view, but actually semantically related. The pro-posed algorithms have been evaluated on the above ground-truth, and are shown to perform better than state-of-the-art approaches, because they effectively take into account the multi-tasking behavior of users.
|
|
Keyword:
Categories and Subject Descriptors H.2.8 [Database Management; Database Applications— Data mining; Design; Experimentation Keywords Query log analysis; H.3.3 [Information Storage and Retrieval; Information Search and Retrieval—Clustering; Query clustering; Query for- mulation; Query log session detection; Search process General Terms Algorithms; Task-based session; User search intent
|
|
URL: http://pomino.isti.cnr.it/%7Esilvestr/wp-content/uploads/2011/02/wsdm2011.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.680.6739
|
|
BASE
|
|
Hide details
|
|
4 |
Identifying task-based sessions in search engine query logs
|
|
|
|
In: http://www.dsi.unive.it/~orlando/PUB/wsdm2011.pdf (2011)
|
|
BASE
|
|
Show details
|
|
5 |
Frontex real-time news event extraction framework
|
|
|
|
In: http://users.cis.fiu.edu/~lzhen001/activities/KDD2011Program/docs/p749.pdf (2011)
|
|
BASE
|
|
Show details
|
|
6 |
Closed World Data Exchange
|
|
|
|
In: http://www.informatik.hu-berlin.de/%7Ehernich/pub/tods11.pdf (2011)
|
|
BASE
|
|
Show details
|
|
7 |
Evaluation of Automated Business Process Optimization
|
|
|
|
In: ftp://ftp.informatik.uni-stuttgart.de/pub/library/medoc.ustuttgart_fi/DIP-3152/DIP-3152.pdf (2011)
|
|
BASE
|
|
Show details
|
|
8 |
A taxonomy of sequential pattern mining algorithms
|
|
|
|
In: http://www.cs.uwindsor.ca/~cezeife/acmsurvey_paper.pdf (2010)
|
|
BASE
|
|
Show details
|
|
9 |
Probabilistic models for topic learning from images and captions in online biomedical literatures
|
|
|
|
In: http://cci.drexel.edu/faculty/yan/publications/acm_cikm_09_xinchen-final.pdf (2009)
|
|
BASE
|
|
Show details
|
|
10 |
Context-aware query suggestion by mining click-through and session data
|
|
|
|
In: http://rp-www.it.usyd.edu.au/~josiah/lemma/cao_et_al_kdd_2008.pdf (2008)
|
|
BASE
|
|
Show details
|
|
11 |
Extracting key-substring-group features for text classification
|
|
|
|
In: http://www.comp.nus.edu.sg/~leews/publications/dellzhang_kdd2006.pdf (2006)
|
|
BASE
|
|
Show details
|
|
12 |
Associative text categorization exploiting negated words
|
|
|
|
In: https://mailserver.di.unipi.it/ricerca/proceedings/SAC06/PDFs/Papers/T13P03.pdf (2006)
|
|
BASE
|
|
Show details
|
|
13 |
Improved Robustness of Signature-Based Near-Replica Detection via Lexicon Randomization
|
|
|
|
In: http://ir.iit.edu/~alek/470-kolcz2.pdf (2004)
|
|
BASE
|
|
Show details
|
|
14 |
Improved Robustness of Signature-Based Near-Replica Detection via Lexicon Randomization
|
|
|
|
In: http://ir.iit.edu/~abdur/publications/470-kolcz.pdf (2004)
|
|
BASE
|
|
Show details
|
|
15 |
Finding Surprising Patterns in a Time Series Database in Linear Time and Space
|
|
|
|
In: http://www.cse.unsw.edu.au/~qzhang/papers/p21.pdf (2002)
|
|
BASE
|
|
Show details
|
|
16 |
NCCU-MIG at NTCIR-10: Using Lexical, Syntactic, and Semantic Features for the RITE Tasks
|
|
|
|
In: http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings10/pdf/NTCIR/RITE/07-NTCIR10-RITE2-HuangW.pdf
|
|
BASE
|
|
Show details
|
|
17 |
The Web is becoming the larges.
|
|
|
|
In: http://wkd.iis.sinica.edu.tw/LiveTrans/pub/ACMTOIS2003.pdf
|
|
BASE
|
|
Show details
|
|
18 |
Research Track Poster Web Mining from Competitors ’ Websites
|
|
|
|
In: http://www.cs.uiuc.edu/class/fa05/cs591han/kdd05/docs/p550.pdf
|
|
BASE
|
|
Show details
|
|
19 |
IT Innovation Centre
|
|
|
|
In: http://users.ecs.soton.ac.uk/sem/ieee-is2014.pdf
|
|
BASE
|
|
Show details
|
|
20 |
Mining Named Entities with Temporally Correlated Bursts from Multilingual Web News Streams
|
|
|
|
In: http://www.mathcs.emory.edu/~kotov/papers/kotov-wsdm11.pdf
|
|
BASE
|
|
Show details
|
|
|
|