Please use this identifier to cite or link to this item:
Title: Developing learning strategies for topic-based summarization
Authors: Ouyang, Y
Li, S
Li, W 
Keywords: Document summarization
Support vector regression
Issue Date: 2007
Source: CIKM '07 Proceedings of the sixteenth ACM conference on Conference on Information and Knowledge Management, Lisboa, Portugal, November 6-9, 2007, p. 79-86 How to cite?
Abstract: Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assigning or experimentally tuning the weights of the features. In this paper, we discuss how to develop learning strategies in order to obtain the optimal feature weights automatically, which can be used for assigning a sound score to a sentence characterized with a set of features. The two fundamental issues are about training data and learning models. To save the costly manual annotation time and effort, we construct the training data by labeling the sentence with a "true" score calculated according to human summaries. The Support Vector Regression (SVR) model is then used to learn how to relate the "true" score of the sentence to its features. Once the relations have been mathematically modeled, SVR is able to predict the "estimated" score for any given sentence. The evaluations by ROUGE-2 criterion on DUC 2006 and DUC 2005 document sets demonstrate the competitiveness and the adaptability of the proposed approaches.
ISBN: 978-1-59593-803-9
DOI: 10.1145/1321440.1321454
Appears in Collections:Conference Paper

View full-text via PolyU eLinks SFX Query
Show full item record


Last Week
Last month
Citations as of Jul 29, 2018

Page view(s)

Last Week
Last month
Citations as of Aug 14, 2018

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.