|
|
||||||||
Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokahama Institute, Yokohama City, Kanagawa 230-0045; Genome Science Laboratory, RIKEN Tsukuba Institute, and Core Research of Evolutional Science and Technology, Japan Science and Technology Corporation, Tsukuba City, Ibaraki 305-0074, Japan
Here we present an amino acid translation program designed to suggest the position of experimental frameshift errors and predict amino acid sequences for full-length cDNA sequences having phred scores. Our program generates artificial insertions into artificial deletions from low-accuracy positions of the original sequence, thereby generating many candidate sequences. The validity of the most probable sequence (the likelihood that it represents the actual protein) is evaluated by using a score (Va) that is calculated in light of the Kozak consensus, preferred codon usage, and position of the initiation codon. To evaluate the software, we have used a database in which, out of 612 cDNA sequences, 524 (86%) carried 773 frameshift errors in the coding sequence. Our software detected and corrected 48% of the total frameshift errors in 62% of the total cDNA sequences with frameshift errors. The false positive rate of frameshift correction was 9%, and 91% of the suggested frameshifts were true.
phred score; Kozak consensus; codon usage; initiation codon; base-call error
This article has been cited by other articles:
![]() |
K. Mochida, T. Yoshida, T. Sakurai, Y. Ogihara, and K. Shinozaki TriFLDB: A Database of Clustered Full-Length Coding Sequences from Triticeae with Applications to Comparative Grass Genomics Plant Physiology, July 1, 2009; 150(3): 1135 - 1146. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gouzy, S. Carrere, and T. Schiex FrameDP: sensitive peptide detection on noisy matured sequences Bioinformatics, March 1, 2009; 25(5): 670 - 671. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Papanicolaou, S. Gebauer-Jung, M. L. Blaxter, W. Owen McMillan, and C. D. Jiggins ButterflyBase: a platform for lepidopteran genomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D582 - D587. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Nagaraj, R. B. Gasser, and S. Ranganathan A hitchhiker's guide to expressed sequence tag (EST) analysis Brief Bioinform, January 1, 2007; 8(1): 6 - 21. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. O. Fernandes, M. G. Mackenzie, G. Elgar, Y. Suzuki, S. Watabe, J. R. Kinghorn, and I. A. Johnston A genomic approach to reveal novel genes associated with myotube formation in the model teleost, Takifugu rubripes Physiol Genomics, August 11, 2005; 22(3): 327 - 338. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Furuno, T. Kasukawa, R. Saito, J. Adachi, H. Suzuki, R. Baldarelli, Y. Hayashizaki, and Y. Okazaki CDS Annotation in Full-Length cDNA Sequence Genome Res., June 1, 2003; 13(6): 1478 - 1487. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kochiwa, R. Suzuki, T. Washio, R. Saito, T. R. G. E. R. G. Phase II Team, H. Bono, P. Carninci, Y. Okazaki, R. Miki, Y. Hayashizaki, et al. Inferring Alternative Splicing Patterns in Mouse from a Full-Length cDNA Library and Microarray Data Genome Res., August 1, 2002; 12(8): 1286 - 1293. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kawaji, C. Schonbach, Y. Matsuo, J. Kawai, Y. Okazaki, Y. Hayashizaki, and H. Matsuda Exploration of Novel Motifs Derived from Mouse cDNA Sequences Genome Res., March 1, 2002; 12(3): 367 - 378. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |