|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Articles in PresS, published online ahead of print March 5, 2002
Physiol Genomics, 10.1152/physiolgenomics.00086.2001
Submitted on September 26, 2001
Accepted on November 9, 2001
1 Roswell Park Cancer Institute, Buffalo, NY, USA
2 Institut de Genetique et Microbiologie, Universite de Paris-Sud, Cedex, France
3 Marine Biological Laboratory, Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Woods Hole, MA, USA
* To whom correspondence should be addressed. E-mail: mriley{at}mbl.edu.
The well-researched Escherichia. coli genome offers the opportunity to explore the value of using protein families within a single organism to enrich functional annotation procedures and to study mechanisms of protein evolution. Having identified multi-modular proteins resulting from gene fusion, and treated each module as a separate protein, non-overlapping sequence-similar families in E. coli could be assembled. Of 3902 proteins of length 100 residues or more, 2415 clustered into 609 protein families. The relatedness of function among members of each family was dissected in detail. Data on paralogous protein families provides valuable information in attributing putative function to unknown genes, supplementing existing function annotation. Enzymes, transporters, and regulators represent the three major types of proteins in E. coli. They are shown to have distinctive patterns in gene duplication and divergence and gene fusion, suggesting that details of protein evolution have been different for genes in these categories. Data for the complete list of paralogous protein families and updated functional annotation for E. coli K-12 are accessible in GenProtEC (http://genprotec.mbl.edu).
This article has been cited by other articles:
![]() |
A. Sivakumar, C. Wilton, and L. Holm From sequences to a functional unit Physiol Genomics, March 13, 2006; 25(1): 1 - 8. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Serres, S. Goswami, and M. Riley GenProtEC: an updated and improved analysis of functions of Escherichia coli K-12 proteins Nucleic Acids Res., January 1, 2004; 32(90001): D300 - 302. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matte, J. Sivaraman, I. Ekiel, K. Gehring, Z. Jia, and M. Cygler Contribution of Structural Genomics to Understanding the Biology of Escherichia coli J. Bacteriol., July 15, 2003; 185(14): 3994 - 4002. [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |
| Visit Other APS Journals Online |