|
|
||||||||
1 Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts 02543
2 Institut de Génétique et Microbiologie, Centre National de la Recherche Scientifique UMR 8621, Bâtiment 409, Université de Paris-Sud, 91405 Orsay Cedex, France
The well-researched Escherichia coli genome offers the opportunity to explore the value of using protein families within a single organism to enrich functional annotation procedures and to study mechanisms of protein evolution. Having identified multimodular proteins resulting from gene fusion, and treated each module as a separate protein, nonoverlapping sequence-similar families in E. coli could be assembled. Of 3,902 proteins of length 100 residues or more, 2,415 clustered into 609 protein families. The relatedness of function among members of each family was dissected in detail. Data on paralogous protein families provides valuable information in attributing putative function to unknown genes, supplementing existing function annotation. Enzymes, transporters, and regulators represent the three major types of proteins in E. coli. They are shown to have distinctive patterns in gene duplication and divergence and gene fusion, suggesting that details of protein evolution have been different for genes in these categories. Data for the complete list of paralogous protein families and updated functional annotation for E. coli K-12 are accessible in GenProtEC (http://genprotec.mbl.edu).
module; sequence similarity; protein family; predicting protein function; annotation; evolution
This article has been cited by other articles:
![]() |
L. A. Nahum, S. Goswami, and M. H. Serres Protein families reflect the metabolic diversity of organisms and provide support for functional prediction Physiol Genomics, August 7, 2009; 38(3): 250 - 260. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Sivakumar, C. Wilton, and L. Holm From sequences to a functional unit Physiol Genomics, March 13, 2006; 25(1): 1 - 8. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Serres, S. Goswami, and M. Riley GenProtEC: an updated and improved analysis of functions of Escherichia coli K-12 proteins Nucleic Acids Res., January 1, 2004; 32(90001): D300 - 302. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matte, J. Sivaraman, I. Ekiel, K. Gehring, Z. Jia, and M. Cygler Contribution of Structural Genomics to Understanding the Biology of Escherichia coli J. Bacteriol., July 15, 2003; 185(14): 3994 - 4002. [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |