Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/29200
Title: An evolutionary approach for discovering effective composite features for text categorization
Authors: Wong, AKS
Lee, JWT
Keywords: Evolutionary computation
Feature extraction
Sampling methods
Text analysis
Issue Date: 2007
Publisher: IEEE
Source: IEEE International Conference on Systems, Man and Cybernetics, 2007 : ISIC, 7-10 October 2007, Montreal, Que., p. 3045-3050 How to cite?
Abstract: The study of text categorization has assumed special significance in the Internet era in helping us navigate the ocean of web pages and emails that continue to grow in an unrelenting pace. In many previous works on text classifications, it has been shown that composite features consisting of multiple word tokens like statistical phrases can contribute effectively to the classification task. However finding useful composite features through comprehensive search from the vast number of possibilities is often prohibitive in terms of computing resource requirements. In the past, to make the search feasible, we often limit the search space by imposing some parametric constraints like minimum frequency and/or number of words in the composite feature. In this paper we proposed a new evolutionary approach to find effective composite features for classification, an approach that combines probabilistic feature generation with error-biased sampling We demonstrate the effectiveness of our approach using the Reuters-21578 test collection.
URI: http://hdl.handle.net/10397/29200
ISBN: 978-1-4244-0990-7
978-1-4244-0991-4 (E-ISBN)
DOI: 10.1109/ICSMC.2007.4413981
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

30
Last Week
0
Last month
Checked on Aug 20, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.