Please use this identifier to cite or link to this item:
Title: Classification of heterogeneous gene expression data
Authors: Fung, BYM
Ng, VTY 
Keywords: Classification
Feature selection
Gene expression data
Significance analysis of microarrays
Issue Date: 2003
Publisher: ACM
Source: ACM SIGKDD Explorations newsletter, 2003, v. 5, no. 2, p. 69-78 How to cite?
Journal: ACM SIGKDD Explorations newsletter 
Abstract: Recent advanced technologies in DNA microarray analysis are intensively applied in disease classification, especially for cancer classification. Most recent proposed gene expression classifiers can successfully classify testing samples obtained from the same microarray experiment as training samples with the assumption that the symmetric errors are constant among training and testing samples. However, the classification performance is degraded with heterogeneous testing samples obtained from different microarray experiments. In this paper, we propose the "impact factors" (IFs) to measure the variations between individual classes in training samples and heterogeneous testing samples, and integrate the IFs to classifiers for classification of heterogeneous samples. Two publicly available lung adenocarcinomas gene expression data sets are used in our experiments to demonstrate the effectiveness of the IFs. It shows that, with the integration of the IFs to the Golub and Slonim (GS) and k-nearest neighbors (kNN) classifiers, the classifiers can be further improved on the classification accuracy of heterogeneous samples. Even more, the classification accuracy of the integrated GS classifier is around 90%.
ISSN: 1931-0145
DOI: 10.1145/980972.980982
Appears in Collections:Journal/Magazine Article

View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

Last Week
Last month
Citations as of Sep 23, 2018

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.