Please use this identifier to cite or link to this item:
Title: Spectral approaches for DNA sequence classification
Authors: Cheng, KO
Law, NF 
Siu, WC 
Keywords: Coding region
DNA sequence
Fourier approach
Spectral analysis
Z-curve approach
Issue Date: 2007
Source: Fourth International Conference on Information Technology and Applications, ICITA 2007, 2007, p. 541-544 How to cite?
Abstract: Z-curve features are one of the popular features used in DNA sequence classification. Here, we studied the Z-curve features from a signal processing point of view. In particular, the Z-curve features are re-interpreted through a spectral formulation. Our analysis showed that there are significant differences in the spectral interpretation between the Z-curve formulation and the FFT (Fast Fourier Transform) approach. From the spectral formulation of the Z-curve approach, we obtained three modified sequences that characterize different biological properties which are useful for coding region prediction. Spectral analysis on the modified sequences showed a much more prominent three-periodicity property in coding regions than using the FFT approach. Our experiments indicated that for long sequences, prominent peaks at 2II/3 are observed at coding regions. For short sequences, peaks can still be observed at coding regions. We also obtained good classification performance using the spectral features derived from the three modified sequences.
Description: 4th International Conference on Information Technology and Applications, ICITA 2007, Harbin, 15-18 January 2007
ISBN: 0980326702
Appears in Collections:Conference Paper

View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

Last Week
Last month
Citations as of Sep 17, 2018

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.