Please use this identifier to cite or link to this item:
Title: Deriving taxonomy from documents at sentence level
Authors: Liu, Y
Loh, HT
Lu, WF
Issue Date: 2008
Publisher: Information Science Reference
Source: In HAD Prado & E Ferneda (Eds.), Emerging technologies of text mining : techniques and applications, p. 99-118. Hershey, PA: Information Science Reference, 2008 How to cite?
Abstract: This chapter introduces an approach of deriving taxonomy from documents using a novel document profile model that enables document representations with the semantic information systematically generated at the document sentence level. A frequent word sequence method is proposed to search for the salient semantic information and has been integrated into the document profile model. The experimental study of taxonomy generation using hierarchical agglomerative clustering has shown a significant improvement in terms of Fscore based on the document profile model. A close examination reveals that the integration of semantic information has a clear contribution compared to the classic bag-of-words approach. This study encourages us to further investigate the possibility of applying document profile model over a wide range of text based mining tasks.
ISBN: 9781599043739 (hbk.)
9781599043753 (ebook)
Appears in Collections:Book Chapter

View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

Last Week
Last month
Citations as of Aug 19, 2018

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.