DSpace Repository

A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis

Show simple item record

dc.contributor.author Meher, Jayakishan K.
dc.contributor.author Dash, Gananath N.
dc.contributor.author Meher, Pramod K.
dc.contributor.author Raval, Mukesh K.
dc.date.accessioned 2016-10-18T05:48:35Z
dc.date.available 2016-10-18T05:48:35Z
dc.date.issued 2011-07
dc.identifier.citation American Journal of Molecular Biology, 2011, 1, 79-86 en_US
dc.identifier.uri http://dx.doi.org/10.4236/ajmb.2011.12010
dc.identifier.uri http://hdl.handle.net/123456789/970
dc.description.abstract Development of efficient gene prediction algorithms is one of the fundamental efforts in gene prediction study in the area of genomics. In genomic signal processing the basic step of the identification of protein coding regions in DNA sequences is based on the period-3 property exhibited by nucleotides in exons. Several approaches based on signal processing tools and numerical representations have been applied to solve this problem, trying to achieve more accurate predictions. This paper presents a new indicator sequence based on amino acid sequence, called as aminoacid indicator sequence, derived from DNA string that uses the existing signal processing based time-domain and frequency domain methods to predict these regions within the billions long DNA sequence of eukaryotic cells which reduces the computational load by one-third. It is known that each triplet of bases, called as codon, instructs the cell machinery to synthesize an amino acid. The codon sequence therefore uniquely identifies an amino acid sequence which defines a protein. Thus the protein coding region is attributed by the codons in amino acid sequence. This property is used for detection of period-3 regions using amino acid sequence. Physico-chemical properties of amino acids are used for numerical representation. Various accuracy measures such as exonic peaks, discriminating factor, sensitivity, specificity, miss rate, wrong rate and approximate correlation are used to demonstrate the efficacy of the proposed predictor. The proposed method is validated on various organisms using the standard data-set HMR195, Burset and Guigo and KEGG. The simulation result shows that the proposed method is an effective approach for protein coding prediction. en_US
dc.language.iso en en_US
dc.publisher Scientific Research Publishing en_US
dc.subject Genomics en_US
dc.subject Bioinformatics en_US
dc.subject Codon en_US
dc.subject Coding region en_US
dc.subject Amino Acid Sequence en_US
dc.subject Fourier Transform en_US
dc.subject Antinotch Filter en_US
dc.subject Periodicity-3 en_US
dc.subject Indicator Sequence en_US
dc.title A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account