Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/20906
Title: Combining co-clustering with noise detection for theme-based summarization
Authors: Cai, X
Li, W 
Zhang, R
Keywords: Document analysis
Noise detection
Sentence and word co-clustering
Theme-based summarization
Issue Date: 2013
Source: ACM transactions on speech and language processing, 2013, v. 10, no. 4, 16 How to cite?
Journal: ACM Transactions on Speech and Language Processing 
Abstract: To overcome the fact that the length of sentences is short and their content is limited, we regard words as independent text objects rather than features of sentences in sentence clustering and develop two coclustering frameworks, namely integrated clustering and interactive clustering, to cluster sentences and words simultaneously. Since real-world datasets always contain noise, we incorporate noise detection and removal to enhance clustering of sentences and words. Meanwhile, a semisupervised approach is explored to incorporate the query information (and the sentence information in early document sets) in themebased summarization. Thorough experimental studies are conducted. When evaluated on the DUC2005-2007 datasets and TAC 2008-2009 datasets, the performance of the two noise-detecting co-clustering approaches is comparable with that of the top three systems. The results also demonstrate that the interactive with noise detection algorithm is more effective than the noise-detecting integrated algorithm.
URI: http://hdl.handle.net/10397/20906
ISSN: 1550-4875
DOI: 10.1145/2513563
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

2
Last Week
0
Last month
0
Citations as of Dec 2, 2017

Page view(s)

53
Last Week
2
Last month
Checked on Dec 11, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.