Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/39816
Title: Developing learning strategies for topic-based summarization
Authors: Ouyang, Y
Li, S
Li, W 
Keywords: Document summarization
Support vector regression
Issue Date: 2007
Source: CIKM '07 Proceedings of the sixteenth ACM conference on Conference on Information and Knowledge Management, Lisboa, Portugal, November 6-9, 2007, p. 79-86 How to cite?
Abstract: Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assigning or experimentally tuning the weights of the features. In this paper, we discuss how to develop learning strategies in order to obtain the optimal feature weights automatically, which can be used for assigning a sound score to a sentence characterized with a set of features. The two fundamental issues are about training data and learning models. To save the costly manual annotation time and effort, we construct the training data by labeling the sentence with a "true" score calculated according to human summaries. The Support Vector Regression (SVR) model is then used to learn how to relate the "true" score of the sentence to its features. Once the relations have been mathematically modeled, SVR is able to predict the "estimated" score for any given sentence. The evaluations by ROUGE-2 criterion on DUC 2006 and DUC 2005 document sets demonstrate the competitiveness and the adaptability of the proposed approaches.
URI: http://hdl.handle.net/10397/39816
ISBN: 978-1-59593-803-9
DOI: 10.1145/1321440.1321454
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

32
Citations as of Feb 26, 2017

Page view(s)

27
Last Week
1
Last month
Checked on Aug 13, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.