DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Genre identification and goal-focused summarization
In: http://www.cs.fiu.edu/%7Elli003/Sum/CIKM/2007/p889-goldstein.pdf (2007)
Abstract: In this paper, we present a novel technique of first performing document genre identification, then utilizing the genre for producing tailored summaries based on a user’s information seeking needs – genre oriented goal-focused summarization – such as a plot or opinion summary of a movie review. We create a test corpus to determine genre classification accuracy for 16 genres, and examine performance on various amounts of training data for machine learning algorithms- Random Forests, SVM light and Naïve Bayes. Results show that Random Forests outperforms SVM light and Naïve Bayes. The genre tag is used to inform a downstream summarization engine. We define types of summaries for 7 genres, create a ground truth corpus and analyze the results of genre oriented goal-focused summarization, showing that this type of user based summarization requires different algorithms than the leading sentence baseline which is known to perform well in the case of news articles.
Keyword: Algorithms; Data Mining; Design; Evaluation; Experimentation; Machine Learning; Metadata Extraction; Performance Keywords Genre Identification; Summarization; Task-based Information Retrieval; Text Categorization; Text Classification
URL: http://www.cs.fiu.edu/%7Elli003/Sum/CIKM/2007/p889-goldstein.pdf
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.222.8762
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern