Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/14172
Title: Discovering high-order patterns of gene expression levels
Authors: Wong, AKC
Au, WH
Chan, KCC 
Keywords: Bioinformatics
Data mining
Gene expression
Microarray data
Pattern discovery
Issue Date: 2008
Publisher: Mary Ann Liebert Inc
Source: Journal of computational biology, 2008, v. 15, no. 6, p. 625-637 How to cite?
Journal: Journal of Computational Biology 
Abstract: This paper reports the discovery of statistically significant association patterns of gene expression levels from microarray data. By association patterns, we mean certain gene expression intensity intervals having statistically significant associations among themselves and with the tissue classes, such as cancerous and normal tissues. We describe how the significance of the associations among gene expression levels can be evaluated using a statistical measure in an objective manner. If an association is found to be significant based on the measure, we say that it is statistically significant. Given a gene expression data set, we first cluster the entire gene pool comprising all the genes into groups by optimizing the correlation (or more precisely, interdependence) among the gene expression levels within gene groups. From each group, we select one or several genes that are most correlated with other genes within that group to form a smaller gene pool. This gene pool then constitutes the most representative genes from the original pool. Our pattern discovery algorithm is then used, for the first time, to discover the significant association patterns of gene expression levels among the genes from the small pool. With our method, it is more effective to discover and express the associations in terms of their intensity intervals. Hence, we discretize each gene expression levels into intervals maximizing the interdependence between the gene expression and the tissue classes. From this data set of gene expression intervals, we discover the association patterns representing statistically significant associations, some positively and some negatively, with different tissue classes. We apply our pattern discovery methodology to the colon-cancer microarray gene expression data set. It consists of 2000 genes and 62 samples taken from colon cancer or normal subjects. The statistically significant combinations of gene expression levels that repress or activate colon cancer are revealed in the colon-cancer data set. The discovered association patterns are ranked according to their statistical significance and displayed for interpretation and further analysis.
URI: http://hdl.handle.net/10397/14172
DOI: 10.1089/cmb.2007.0147
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

3
Last Week
0
Last month
0
Citations as of Nov 18, 2017

WEB OF SCIENCETM
Citations

2
Last Week
0
Last month
0
Citations as of Nov 15, 2017

Page view(s)

64
Last Week
1
Last month
Checked on Nov 19, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.