1 |
gSpan: Graph-Based Substructure Pattern Mining
|
|
|
|
In: http://www-faculty.cs.uiuc.edu/~hanj/pdf/icdm02_gspan.pdf (2002)
|
|
BASE
|
|
Show details
|
|
2 |
on the paper ”OPINOSIS- A Graph-Based Approach to Abstractive Summarization of Highly Redundant Opinions” by
|
|
|
|
In: http://www.mpi-inf.mpg.de/departments/d5/teaching/ws10_11/hir/reports/CosminaCroitoru.pdf
|
|
BASE
|
|
Show details
|
|
3 |
� Problems
|
|
|
|
In: http://kavita-ganesan.com/sites/default/files/opinosis-presentation.ppt.pdf
|
|
BASE
|
|
Show details
|
|
4 |
Semantic Frame-Based Document Representation for Comparable Corpora
|
|
|
|
In: http://web.engr.illinois.edu/~hanj/pdf/icdm13_hkim.pdf
|
|
Abstract:
Abstract—Document representation is a fundamental prob-lem for text mining. Many efforts have been done to generate concise yet semantic representation, such as bag-of-words, phrase, sentence and topic-level descriptions. Nevertheless, most existing techniques counter difficulties in handling mono-lingual comparable corpus, which is a collection of mono-lingual documents conveying the same topic. In this paper, we propose the use of frame, a high-level semantic unit, and construct frame-based representations to semantically describe documents by bags of frames, using an information network approach. One major challenge in this representation is that semantically similar frames may be of different forms. For example, “radiation leaked ” in one news article can appear as “the level of radiation increased ” in another article. To tackle the problem, a text-based information network is constructed among frames and words, and a link-based similarity measure called SynRank is proposed to calculate similarity between frames. As a result, different variations of the semantically similar frames are merged into a single descriptive frame using clustering, and a document can then be represented as a bag of representative frames. It turns out that frame-based document representation not only is more interpretable, but also can facilitate other text analysis tasks such as event track-ing effectively. We conduct both qualitative and quantitative experiments on three comparable news corpora, to study the effectiveness of frame-based document representation and the similarity measure SynRank, respectively, and demonstrate that the superior performance of frame-based document represen-tation on different real-world applications. Keywords-document representation; bag of frames; text in-formation network; link-based clustering I.
|
|
URL: http://web.engr.illinois.edu/~hanj/pdf/icdm13_hkim.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.459.4000
|
|
BASE
|
|
Hide details
|
|
5 |
iTopicModel: Information Network-Integrated Topic Modeling
|
|
|
|
In: http://www.cs.uiuc.edu/homes/hanj/pdf/icdm09_ysun.pdf
|
|
BASE
|
|
Show details
|
|
6 |
Content Coverage Maximization on Word Networks for Hierarchical Topic Summarization
|
|
|
|
In: http://www.cs.uiuc.edu/homes/hanj/pdf/cikm13_cwang.pdf
|
|
BASE
|
|
Show details
|
|
7 |
The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding
|
|
|
|
In: http://aclweb.org/anthology/C/C14/C14-1149.pdf
|
|
BASE
|
|
Show details
|
|
8 |
SCENE: Structural Conversation Evolution NEtwork
|
|
|
|
In: http://www.cs.uiuc.edu/%7Ehanj/pdf/asonam11_mdanilevsky.pdf
|
|
BASE
|
|
Show details
|
|
9 |
Exploiting Background Information Networks to Enhance Bilingual Event Extraction Through Topic Modeling
|
|
|
|
In: http://www.thinkmind.org/download.php?articleid=immm_2011_1_40_20203
|
|
BASE
|
|
Show details
|
|
|
|