Open Access Open Access  Restricted Access Subscription or Fee Access

An Enhanced Method for Period-3 based Exon and Gene Prediction

Anshu Vishnoi, Dr.Neelam Rup Prakash

Abstract


Identification of gene locations in a DNA sequence is one of the important problems in the area of genomics. Nucleotides in exons of a DNA sequence show f = 1/3 periodicity. The period-3 property in exons of eukaryotic gene sequences enables signal processing based time-domain and frequency domain methods to predict these regions. Identification of the period-3 regions helps in predicting the gene locations within the billions long DNA sequence of eukaryotic cells. In this paper the DNA symbolic-to-numeric representations are presented and the existing methods of gene prediction are also discussed. Finally, an enhancement over the existing methods has been proposed that combines the features of two best existing computationally efficient methods namely, AMDF (Average Magnitude difference function) and the optimized method. The proposed method improves upon the existing methods in terms of gene prediction accuracy.

Keywords


AMDF, Binary Indicator Sequence, Complex Indicator Sequence, DFT, DNA, EIIP Indicator Sequence, Gene Prediction, Paired Numeric, Period-3, Protein Coding, Roc.

Full Text:

PDF

References


D. Anastassiou, “Genomic signal processing,” IEEE Signal Processing Magazine, vol. 18, no. 4, pp. 8–20, 2001.

P. P. Vaidyanathan and B.-J. Yoon, “Digital filters for gene prediction applications,” in Proc. Asilomar Conference on Signals, Systems, and Computers, pp. 306–310, Pacific Grove, Calif, USA, November 2002.

Samuel S. Gross, Michael R. Brent, “Using Multiple Alignments to Improve Gene Prediction” Journal of Computational Biology. March 2006, 13(2): 379-393.

M.K. Hota and V.K.Srivastava, “DSP technique for gene and exon prediction taking complex indicator sequence” in proc. IEEE TENCON, pp. 1-6, 2008.

Achuthsankar S. Nair and Sivarama Pillai Sreenadhan, “A coding measure scheme employing electron-ion interaction pseudo potential (EIIP),” Bioinformation 1(6), pp. 197-202, 2006.

P. D. Cristea, "Conversion of nucleotides sequences into genomic signals," J Cell. Mol. Med., vol. 6, no. 2, pp. 279-303, 2002.

C. Burge, "Identification of genes in human genomic DNA," PhD thesis Stanford University, Stanford, CA, 1997.

M. Akhtar, J. Epps, and E. Ambikairajah, “On DNA numerical representations for period-3 based exon prediction,” in Proc. IEEE GENSIPS (Tuusula, Finland), 2007.

M. Akhtar, J. Epps, and E. Ambikairajah, “Time and frequency domain methods for gene and exon prediction in eukaryotes,” in Proc. IEEE ICASSP, pp. 573−576, 2007.

E. Ambikairajah, J. Epps, and M. Akhtar, “Gene and exon prediction using time-domain algorithms,” IEEE 8th Int. Symp. On Sig. Proc. and its Appl., pp. 199-202, 2005.

B. Alberts, D. Bray, A. Johnson, J. Lewis, M. Roff, K. Roberts and P. Walter, Essential Cell Biology: Garland Publishing Inc., New York, 1998.

M. Akhtar, J. Epps, and E. Ambikairajah, “Optimizing period-3 methods for eukaryotic gene prediction,” in Proc. IEEE ICASSP, pp. 621-624, 2008.

T. S. Gunawan, E. Ambikairajah, J. Epps, “A signal boosting technique for gene prediction,” in Proc. IEEE ICICS, 2007.

S. Tiwari,S. Ramachandran, A. Bhattacharya, S. Bhattacharya,and R. Ramaswamy, "Prediction of probable genes by Fourier analysis of genomic sequences," Comput. Appl. Biosci., vol.13, pp. 263-270, 1997.

D. Kotlar, and Y. Lavner, "Gene prediction by spectral rotation measure: a new method for identifying protein-coding regions," Genome Res., vol. 18, pp. 1930-1937, 2003.

V. Makarov, “Computer programs for eukaryotic gene prediction,” Briefings in Bioinformatics, vol 3. no.2. 195–199. June 2002.

D.R. Westhead, J.H. Parish and R.M. Twyman, Bioinformatics, BIOS Scientific Publishers Limited.

P.P. Vaidyanathan, “Genomics and Proteomics: A Signal Processor’s Tour” IEEE Circuits And Systems Magazine, pp. 6-29, Fourth Quarter 2004.

Tsonis, A. A., Elsner, J. B. and Tsonis, P. A., “Periodicity in DNA coding sequences: Implications in Gene Evolution”, Journal of Theor. Biol., vol. 151, no. 3, 1991, pp. 323-331.

Trifonov, E. N., “Elucidating sequence codes: three codes for evolution”, Ann NY Acad Sci, vol. 870, 1999, pp. 330-338.

Vaidyanathan, P. P., and Yoon, B.-J., “Gene and exon prediction using allpass-based filters”, in Proc. IEEE Workshop on Gen. Sig. Proc and Stat., 2002.

P. D. Cristea, "Genetic signal representation and analysis," in Proc. SPIE Conference, International Biomedical Optics Symposium (BIOS'02), vol. 4623, pp. 77-84, 2002.

G. L. Rosen, "Signal processing for biologically-inspired gradient source localization and DNA sequence analysis," PhD thesis, Georgia Institute of Technology, Aug. 2006.

N. Chakravarthy, A. Spanias, L. D. lasemidis, and K. Tsakalis, "Autoregressive modeling and feature analysis of DNA sequences,"EURASIP JASP, vol. 1, pp. 13-28, 2004.

Hazrina Yusof Hamdani,; Siti Rohkmah Mohd Shukri,”Gene Prediction System,”ITSIM, vol.1, pp. 1-7, 2008.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.