Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/35942
Title: Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization
Authors: Cai, XY
Li, WJ 
Zhang, RX
Keywords: Document summarization
Information diversity
Information coverage
Subspace clustering
Issue Date: 2014
Publisher: Elsevier
Source: Information sciences, 2014, v. 279, p. 764-775 How to cite?
Journal: Information sciences 
Abstract: Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach.
URI: http://hdl.handle.net/10397/35942
ISSN: 0020-0255
EISSN: 1872-6291
DOI: 10.1016/j.ins.2014.04.028
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

2
Last Week
0
Last month
Citations as of Nov 23, 2017

WEB OF SCIENCETM
Citations

2
Last Week
0
Last month
Citations as of Nov 23, 2017

Page view(s)

26
Last Week
1
Last month
Checked on Nov 19, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.