Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/26688
Title: Ranking through clustering : an integrated approach to multi-document summarization
Authors: Cai, X
Li, W 
Keywords: Document summarization
sentence clustering
sentence ranking
Issue Date: 2013
Publisher: Institute of Electrical and Electronics Engineers
Source: IEEE transactions on audio, speech and language processing, 2013, v. 21, no. 7, 6480794, p. 1424-1433 How to cite?
Journal: IEEE transactions on audio, speech and language processing 
Abstract: Multi-document summarization aims to create a condensed summary while retaining the main characteristics of the original set of documents. Under such background, sentence ranking has hitherto been the issue of most concern. Since documents often cover a number of topic themes with each theme represented by a cluster of highly related sentences, sentence clustering has been explored in the literature in order to provide more informative summaries. For each topic theme, the rank of terms conditional on this topic theme should be very distinct, and quite different from the rank of terms in other topic themes. Existing cluster-based summarization approaches apply clustering and ranking in isolation, which leads to incomplete, or sometimes rather biased, analytical results. A newly emerged framework uses sentence clustering results to improve or refine the sentence ranking results. Under this framework, we propose a novel approach that directly generates clusters integrated with ranking in this paper. The basic idea of the approach is that ranking distribution of sentences in each cluster should be quite different from each other, which may serve as features of clusters and new clustering measures of sentences can be calculated accordingly. Meanwhile, better clustering results can achieve better ranking results. As a result, ranking and clustering by mutually and simultaneously updating each other so that the performance of both can be improved. The effectiveness of the proposed approach is demonstrated by both the cluster quality analysis and the summarization evaluation conducted on the DUC 2004-2007 datasets.
URI: http://hdl.handle.net/10397/26688
ISSN: 1558-7916
EISSN: 1558-7924
DOI: 10.1109/TASL.2013.2253098
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

9
Last Week
0
Last month
0
Citations as of Nov 10, 2017

WEB OF SCIENCETM
Citations

5
Last Week
0
Last month
0
Citations as of Nov 16, 2017

Page view(s)

51
Last Week
1
Last month
Checked on Nov 20, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.