Physiol. Genomics  AJP: Regulatory, Integrative and Comparative Physiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


Physiol. Genomics (March 5, 2002). doi:10.1152/physiolgenomics.00086.2001
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
9/1/15    most recent
00086.2001v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Liang, P.
Right arrow Articles by Riley, M.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Liang, P.
Right arrow Articles by Riley, M.

Articles in PresS, published online ahead of print March 5, 2002
Physiol Genomics, 10.1152/physiolgenomics.00086.2001
Submitted on September 26, 2001
Accepted on November 9, 2001

Physiological genomics of E. coli protein families

Ping Liang1, Bernard Labedan2, and Monica Riley3*

1 Roswell Park Cancer Institute, Buffalo, NY, USA
2 Institut de Genetique et Microbiologie, Universite de Paris-Sud, Cedex, France
3 Marine Biological Laboratory, Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Woods Hole, MA, USA

* To whom correspondence should be addressed. E-mail: mriley{at}mbl.edu.

The well-researched Escherichia. coli genome offers the opportunity to explore the value of using protein families within a single organism to enrich functional annotation procedures and to study mechanisms of protein evolution. Having identified multi-modular proteins resulting from gene fusion, and treated each module as a separate protein, non-overlapping sequence-similar families in E. coli could be assembled. Of 3902 proteins of length 100 residues or more, 2415 clustered into 609 protein families. The relatedness of function among members of each family was dissected in detail. Data on paralogous protein families provides valuable information in attributing putative function to unknown genes, supplementing existing function annotation. Enzymes, transporters, and regulators represent the three major types of proteins in E. coli. They are shown to have distinctive patterns in gene duplication and divergence and gene fusion, suggesting that details of protein evolution have been different for genes in these categories. Data for the complete list of paralogous protein families and updated functional annotation for E. coli K-12 are accessible in GenProtEC (http://genprotec.mbl.edu).




This article has been cited by other articles:


Home page
Physiol. GenomicsHome page
A. Sivakumar, C. Wilton, and L. Holm
From sequences to a functional unit
Physiol Genomics, March 13, 2006; 25(1): 1 - 8.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. H. Serres, S. Goswami, and M. Riley
GenProtEC: an updated and improved analysis of functions of Escherichia coli K-12 proteins
Nucleic Acids Res., January 1, 2004; 32(90001): D300 - 302.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Matte, J. Sivaraman, I. Ekiel, K. Gehring, Z. Jia, and M. Cygler
Contribution of Structural Genomics to Understanding the Biology of Escherichia coli
J. Bacteriol., July 15, 2003; 185(14): 3994 - 4002.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Visit Other APS Journals Online
Copyright © 2002 by the American Physiological Society.