Physiol. Genomics 35: 55-64, 2008.
First published July 8, 2008; doi:10.1152/physiolgenomics.90247.2008
1094-8341/08 $8.00
Received 12 May 2008;
accepted in final form 7 July 2008.
Physiological Genomics 35:55-64 (2008)
1094-8341/08 $8.00 © 2008 American Physiological Society
Prioritization of candidate disease genes for metabolic syndrome by computational analysis of its defining phenotypes
Nicki Tiffin
1,
Ikechi Okpechi
2,
Carolina Perez-Iratxeta
3,
Miguel A. Andrade-Navarro
3,4 and
Rajkumar Ramesar
1
1 Division of Human Genetics, MRC Human Genetics Research Unit, Institute for Infectious Diseases and Molecular Medicine, Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa
2 Nephrology Division and Department of Medicine, Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa
3 Ontario Genomics Innovation Centre, Ottawa Health Research Institute, University of Ottawa, Ottawa, Ontario, Canada
4 Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
 |
ABSTRACT
|
|---|
There is a rapid increase in the world-wide burden of disease attributed to metabolic syndrome, as defined by co-occurrence of an array of phenotypes including abdominal obesity, dysglycemia, hypertriglyceridemia, low levels of high density lipoprotein cholesterol, and hypertension. Familial studies clearly indicate a genetic component to the disease and many linkage studies have identified a large number of linked loci. No disease-causing genes, however, have been conclusively identified, most likely because this is a multigenic disease for which effects of many causative genes may be small and combined with environmental effects. To assist empirical identification of metabolic syndrome associated genes, we present here a novel computational approach to prioritize candidate genes. We have used linkage studies and the clinical and population-specific presentation of the disease to select a final candidate gene list of 19 most likely disease-causing genes. These are predominantly involved in chylomicron processing, transmembrane receptor activity, and signal transduction pathways. We propose here that information about the clinical presentation of a complex trait can be used to effectively inform computational prioritization of disease-causing genes for that trait.
candidate genes; complex disease
 |
INTRODUCTION
|
|---|
METABOLIC SYNDROME IS AN INCREASINGLY common complex disease that is defined by co-occurrence of an array of metabolic phenotypes. These phenotypes have been classified in multiple ways (Ref. 2 and reviewed in Ref. 49) to provide a list of simple clinical markers that can diagnose individuals with the syndrome. The guidelines of the 2001 National Cholesterol Education Program-Adult Treatment Panel III are in common use and characterize metabolic syndrome as the co-occurrence of any three of the five following phenotypes: abdominal obesity, dysglycemia including insulin resistance/hyperinsulinemia, increased triglycerides, decreased high-density lipoprotein (HDL) cholesterol, hypertension (2, 17, 22, 44). Currently,
25% of North Americans are thought to be affected by metabolic syndrome (15). Obesity is the prevalent phenotype and is rising as a global epidemic (1) that is likely to result in a concurrent increase in metabolic syndrome incidence (42, 54) and related morbidity and mortality.
Familial studies clearly indicate a heritable component to metabolic syndrome (reviewed in Ref. 49); however, multiple linkage studies have generated a large number of metabolic syndrome-associated loci (Supplementary Data File S1)1
without subsequent identification of etiological genes. To date, existing approaches have in general failed to identify genes underlying complex diseases or traits such as metabolic syndrome, as they often present with a wide range of phenotypes and generally involve multiple etiological mechanisms and contributing genes (20). Also, the number of genes falling within the identified loci is generally far too large for further empirical analysis (47).
As a complementary approach, many computational methods have been developed to prioritize candidate genes by mining existing biological data (28). A selection of such methods that analyze gene and protein sequence data, ortholog data, expression data, Gene Ontology (GO) (23) and eVOC (29) annotation and biomedical literature has been previously used in concert in a study to select and/or prioritize most likely candidate disease genes from regions identified by linkage analyses in a case study of Type 2 diabetes and the related trait obesity (57). The rationale for using multiple computational methods is to select candidate genes that fulfill as many disease-relevant criteria as possible in a complementary approach. In prioritizing genes that have been selected by all methods we prioritize candidate genes that fulfill all criteria used to select most likely disease-causing genes across all the methods. In the current study, we build on this approach by additionally incorporating complex clinical presentation data with computational analyses of candidate genes, for the complex disease metabolic syndrome and its defining phenotypes. We include in our study the additional analysis of the individual phenotypes described in metabolic syndrome and the relative frequencies with which the different symptoms occur in several patient populations. Although data-mining of biological and disease-related data entails inherent bias to existing knowledge, this knowledge can be applied to effectively guide the prioritization of most likely disease candidates for further empirical analysis, particularly in the scenario where no other discriminating characteristics identify candidate genes. The phenotype complexity of metabolic syndrome makes it an excellent candidate for such computational analysis to identify most likely disease gene candidates for subsequent empirical validation.
The relationships between candidate genes in the final selected list and their involvement in common pathways and protein interactions are studied with a variety of online tools: the genes selected are mostly involved in lipid transport and processing, the chylomicron, and signal transduction at the plasma membrane. Such functions are appealing targets for the search for genetic causes of metabolic syndrome as each signal transduction pathway has multiple potential activating events and a variety of downstream targets and effects and could therefore be responsible for the diverse array of phenotypes seen in metabolic syndrome. Also, these functions are fundamental to maintaining an appropriate balance between the intracellular and extracellular environments, and disruption could lead to the homeostatic imbalances seen in metabolic syndrome.
In summary, we combine multiple computational approaches for candidate disease gene selection and give candidate genes a clinically relevant score that is based on the defining phenotypes of the disease and their frequency of occurrence in several populations. We use this score to select a short list of most likely candidate genes for metabolic syndrome and discuss their etiological relevance.
 |
METHODS
|
|---|
Overview.
An overview of the analysis is shown in Fig. 1. First, we use multiple computational methods (described below) to analyze the starting set of candidate genes. The methods access an array of biological databases and analyze the candidates according to different criteria. Candidate genes are scored and ranked according to the number of methods that select them (as described in Collation of gene lists and construction of a scoring matrix, below). We have selected the top-scoring genes to generate lists of the most likely candidates for metabolic syndrome and, in parallel, each of the phenotypes defining this syndrome (abdominal obesity, dysglycemia, hypertension, hypertriglyceridemia, low HDL cholesterol - see Refs. 2, 49 for full definitions of phenotypes). Next, we assigned a score to each gene in each phenotype candidate list, using a weighting system that depends on the frequency with which each phenotype appears in patients diagnosed with metabolic syndrome. Finally, every gene is given a total score depending on the score it received in each of the phenotype candidate lists. This total score is used to rank the genes. As the phenotype frequencies vary between populations we have repeated this analysis using weightings appropriate to the white population and to the black population as reported by Kraja et al. 2005 (32). Additionally, phenotype frequencies vary between males and females, and we have repeated the analysis using the corresponding frequencies reported by McCarthy et al. 2003 (39). We repeated the analysis for black/white/male/female population groups using the data generated by Ford et al. (2002) (17) and generated a slightly more stringent list containing matching candidates. We have used the more inclusive analysis in this study. Thus, the final candidate gene list consists of genes that are selected in the search for candidate genes for metabolic syndrome and are also independently selected in the parallel selection of candidate genes for the five phenotypes, depending on the frequencies with which the phenotypes occur in four population groups.

View larger version (20K):
[in this window]
[in a new window]
|
Fig. 1. Overview of analysis, showing selection of genes common to all methods, weighted selection of genes common to all phenotypes, and selection of genes common to metabolic syndrome and the phenotype analysis.
|
|
Generating the starting set of candidate genes.
Linkage studies for metabolic syndrome were used as a primary filter to select the starting set of candidate genes (see Supplementary Data File S1 for references and LOD scores). Cytogenetic loci were used to define linked regions from which candidates were selected, to ensure that potential candidate genes were not excluded due to changes or inconsistencies in marker annotation and identification. Genes were compiled from linked cytogenetic loci using the Ensembl database Ensembl_mart_40 (www.ensembl.org). The final candidate list comprised 13,882 genes and covered all cytogenetic loci documented to be linked to the syndrome (Supplementary Data File S2). Data for all genes were downloaded from the Ensembl database Ensembl_mart_41 into a local database for further analysis. Database versions differ due to ongoing database updates and differences occurring between gene sets in different databases. This is accommodated in part in this study by the scoring system that discriminates between genes that are rejected as poor candidates and those that are not analyzed at all by a method, avoiding biased negative scoring against those genes not measured by all methods because of database differences (see Collation of gene lists and construction of a scoring matrix).
Compilation of search terms and training sets of disease genes.
Many of the methods require various user-defined inputs, and these are shown in table 1. These include disease search terms for querying abstracts in PubMed and anatomy terms used for GeneSeeker queries. Also shown are the top ranking eVOC terms selected by text-mining of PubMed abstracts containing the disease name. Disease name synonyms were used as described in the Online Mendelian Inheritance in Man (OMIM) database (40). The MeSH terms used for each disease by G2D are as follows. For metabolic syndrome, we used all MeSH terms annotated in the literature references from OMIM entry 605552. For its phenotypes, we used "obesity," "hypertriglyceridemia," "hypertension," "diabetes mellitus," and "cholesterol, HDL." The SUSPECTS software (4) automatically retrieves genes implicated in a disorder from OMIM, the Human Gene Mutation Database (55), and Genetic Association Database (8). These training sets were used for each of the phenotypes, using the search terms outlined in Table 1, column 6. For "dysglycemia," no associated genes were found and a training set of "diabetes"-associated genes was used, as this is a very closely related condition. For "low HDL cholesterol" no associated genes were found, and "HDL cholesterol"-associated genes were used for the training set. There are no known disease genes for "metabolic syndrome" and very few genes that are currently considered strong candidate genes, so the training set was compiled from a set of genes that have been previously linked with "metabolic syndrome" in the manner described by Matsunaga and Muramatsu (38) and derived by a sophisticated knowledge-based analysis involving text-mining of PubMed abstracts.
Computational methods for disease gene prioritization.
Seven existing candidate disease gene prioritization methods were used to analyze the starting set of candidate genes: Disease Gene Prediction (DGP) (http://cgg.ebi.ac.uk/services/dgp/) (34) assigns probabilities to all genes (from Ensembl version 15.33.1), indicating their likelihood to mutate based on specific sequence property patterns; PROSPECTR (http://www.genetics.med.ed.ac.uk/prospectr/) (3) differentiates between genes likely to be involved in disease and those that are not, according to sequence-based features; the eVOC method (58) selects candidate genes by determining disease relevance of their expression profile; GeneSeeker (http://www.cmbi.ru.nl/geneseeker/) (60, 61) filters positional candidate disease genes based on expression and phenotypic data from human and mouse databases; SUSPECTS (http://www.genetics.med.ed.ac.uk/suspects/) (4) scores genes according to annotation data from GO, Interpro, and expression libraries, and sequence analysis by the method PROSPECTR; G2D (http://www.ogic.ca/projects/g2d_2/) (46–48) scores all terms in GO according to their relevance to each disease and selects top scoring genes; and finally Endeavour (http://homes.esat.kuleuven.be/
bioiuser/endeavour/endeavour.php) (5), which ranks candidate genes according to Entrez Gene data, GO data, InterPro and Bind protein-protein interaction data, KEGG pathway data, microarray and expressed sequence tag (EST)-based expression data, TFBS cis-regulatory modules, and sequence similarity by BLAST. More detailed descriptions of all methods, including validation and threshold values, are given in the Supplementary Data File S3.
Collation of gene lists and construction of a scoring matrix.
Gene lists were analyzed using Microsoft Excel. Visual Basic software was written to compile the data, assign scores to the candidates, generate final scores for candidates, and graphically present and rank them (as illustrated in Supplementary Data File S4). The computational methods used access a variety of data sources, and consequently not all candidate genes may be analyzed by all methods. This can create a bias in the scoring system, for example an excellent candidate may be selected by six of the seven methods and not analyzed by the seventh method, although the seventh method would have similarly selected it if it had been present in the database(s) analyzed by this method. This candidate should score more highly than a candidate that was analyzed by all seven methods and found to be a good candidate by six methods but found to be an unlikely candidate by the seventh method. To reflect this, our scoring system distinguishes between candidates that were not analyzed by a method and candidates that were analyzed by a method and found to be poor candidates: a likely candidate ("positive"), is assigned a score of 1, an unlikely candidate ("negative") is assigned a score of 0, and if the method does not analyze the gene ("null"), the score assigned is 0.5. Thus a gene that is not considered by a method has a marginally better ranking than a gene that is actively found to be "unlikely" by that method. For this analysis, top-scoring genes selected by all seven methods were selected for each of the phenotypes and for metabolic syndrome. Phenotype frequency of occurrence is used for each population group (black, white, male, and female), and the following scoring system is used: genes selected for each phenotype are given a value of the frequency of occurrence of that phenotype within the population .The frequency values are shown in Table 2. For each gene, a final score is calculated as the sum of all scores assigned to it for that population. Final gene selections for each population are those genes selected for at least three of the five phenotypes (which are in general the highest scoring genes) and also any additional genes that score higher than the lowest score generated by the genes that are common to three of the five phenotypes.
Automated analysis of final gene list.
Enrichment for specific regulatory pathways and functional annotation was investigated with the Database for Annotation, Visualization and Integrated Discovery (DAVID) [http://david.abcc.ncifcrf.gov/ (13)] and the reaction pathway database REACTOME [www.reactome.org (63)]. Additionally, GOstat was used [http://gostat.wehi.edu.au/ (9)] to measure overrepresentation of GO annotations [http://www.geneontology.org (23)] in the final candidate gene list. To identify known relationships between the candidates described in the biomedical literature, the CHILIBOT data-mining software was used [www.chilibot.net (12)], and the results obtained using CHILIBOT were manually curated for accuracy. Additionally the genes were investigated for known relationships to metabolic syndrome with CHILIBOT. Known or inferred protein-protein interactions between the candidate genes were investigated with STRING [Search Tool for Retrieval of Interacting Proteins; http://string.embl.de (64)]. The interactions identified from STRING were manually curated for sufficiency of evidence.
 |
RESULTS
|
|---|
The numbers of genes selected by each method for metabolic syndrome and its five defining phenotypes are shown in Table 3 (see METHODS). Also shown are the numbers of genes selected by all methods for each phenotype and those in common to both metabolic syndrome and all phenotypes, appropriately weighted according to frequency of occurrence of each phenotype (see METHODS for details). In brief, for "metabolic syndrome," 54 genes were selected by all methods used. For each of the phenotypes, the following numbers of genes were selected in common to all methods: abdominal obesity - 48; dysglycemia - 58; hypertension - 69; hypertriglyceridemia - 80; low HDL cholesterol - 14. Genes were then scored according to their selection for the individual phenotypes in a weighted system whereby genes for more common phenotypes received higher scores. Using the phenotype frequency weightings, we selected the following numbers of top-scoring genes for each population: white population - 34 (score >113), black population - 41 (score >103), males - 36 (score >128), females - 34 (score >144). There were 34 genes common to all these analyses, and of these 19 were also common to the set selected for metabolic syndrome. These 19 were investigated further as a final candidate gene list. They are briefly described in Supplementary Data File S5, and the loci in which they fall are detailed in Supplementary Data File S8. The genes selected using population-based weighting were compared with a selection made without weighting, i.e., all genes that appeared in at least three of the phenotype-specific sets were selected regardless of phenotype frequency. There were 34 genes that appeared in three or more phenotype sets, and these were the same 34 as for the white and female population-weighted data. However, six of these genes did not appear in the list selected using black population data. Seven additional unique genes appeared in the black population data set, and two additional unique genes were selected using the male population data (Supplementary Data File S4). This illustrates the extent to which population-specific frequency data affects the final set of selected candidate genes. Reanalysis using population-specific frequency data generated by Ford et al. (2002) (17) generated a list containing the same genes except for the exclusion of TIE1, AXL, and UTRN. We have analyzed the more inclusive list.
View this table:
[in this window]
[in a new window]
|
Table 3. Number of genes selected for metabolic syndrome, for each phenotype, and in common to the selected sets (starting set 13,882 genes)
|
|
Several existing utilities, described below, were used to analyze the selected set of candidate genes, to give an overview of the types of genes selected according to their functional annotation (DAVID and GOstat), pathways represented (REACTOME SkyPainter), existing relationships previously reported between the candidate genes (CHILIBOT), and protein-protein interactions and networks (STRING).
Clustering of genes by functional annotation by DAVID.
The DAVID software analyses similarity of functional annotation of candidate genes, indicating the gene functions most commonly represented in the final candidate group of 19 genes (13). When using high stringency, we found the most enriched functional cluster, with a score of 3.9, contained annotation terms relating to lipid metabolism and transportation. The terms, shown with number of genes and P values in parentheses, were "lipid transporter activity" (6; 1.3 x 10–8), "lipid metabolism" (5; 6.0 x 10–3), "transporter activity" (6; 2.6 x 10–2). The second cluster, with a score of 2.91, contained the terms, with number of genes and P values shown in parentheses, "signal" (10; 5.9 x 10–5), "signal peptide" (10; 3.8 x 10–3), and "disulphide bond" (9; 8.7 x 10–3). Using medium stringency generates a series of large, low-scoring term clusters that include terms involving lipid and chylomicron processing and transport, cell signaling, transmembrane/plasma membrane activity, and tyrosine kinase activity. The complete data set is available as Supplementary Data File S6.
Analysis of pathways involving the candidate genes by REACTOME.
The SkyPainter utility available through the Reactome website [www.reactome.org (63)] highlights the pathways found most commonly associated with the final set of candidate genes. Of the starting set of 19 candidate genes, 9 were found in the Reactome database. The pathway most commonly represented is (with P value in parentheses) "metabolism of lipids and lipoproteins" (6.2 x 10–4). Processing of the chylomicron/apoE/LDLR complex is the most overrepresented event, followed by LDLR/LDL processing. The complete data set is available as Supplementary Data File S7.
Analysis of overrepresented gene ontology annotation of candidate genes by GOstat.
The most representative GO terms for the final candidate gene list (P value cutoff = 0.1) include terms relating to lipid transport and processing, signal transduction, plasma membrane location, and development. The top-scoring 15 terms, and the candidate genes that are annotated with those terms, are shown in Table 4.
Automated literature analysis by CHILIBOT.
Automated analysis of PubMed abstracts using CHILIBOT [www.chilibot.net (12)] can detect existing associations reported between the disease, its phenotypes, and selected candidate genes, thus giving an indication of the degree of relevance and/or novelty of the selected candidates. The disease name "metabolic syndrome" was associated with 9/19 candidate gene names, and on inspection 8/19 were found to be real associations (APOB, APOE, FOXO1A, IGF1, INSR, LDLR, LPL, and PTPN11; the relationship indicated between "metabolic syndrome" and CAT was spurious). The nature of these associations is described in Table 5. Potential indirect relationships with metabolic syndrome through one or more of the other short-listed candidates were indicated for all genes except ADCY7. Analysis of literature associating the candidate genes with each of the phenotypes indicates that only one of the candidates has been previously associated with all five phenotypes (APOB), five of the candidates have been associated with all phenotypes except dysglycemia (IGF1, LDLR, APOE, INSR, LPL), and three of the candidates have not been independently associated with any of the phenotypes previously (ADCY7, GIPC1, PFKM).
View this table:
[in this window]
[in a new window]
|
Table 5. Genes previously associated with metabolic syndrome in the biomedical literature, as detected by Chilibot
|
|
Protein binding and protein network analysis by STRING.
STRING analysis (64) indicates two main groups of interacting proteins within the final candidate list (Fig. 2A). The first centers around PTPN11 (also called SHP2), and interactions between PTPN11 and FYN, TIE1, AXL, CAT, and INSR (which in turn interacts with IGF1) have been experimentally documented (see below). These interactions are also described in the biomedical literature (PubMed abstracts) and in additional protein interaction databases (KEGG), which further extend the group by indicating interaction between ITGB1 and UTRN via FYN. The second centers around LRP1, which interacts with GIPC1, LPL, and APOE (which in turn interacts with LDLR and APOB). These interactions are also described in the biomedical literature and pathway databases (see below). FOXO1A (also called FOXO1 and FKHR) lies between these two clusters, and we present below a possible connection between the two clusters facilitated by this transcription factor. Proteins PFKM and ADCY7 are not included in the interacting groups.

View larger version (19K):
[in this window]
[in a new window]
|
Fig. 2. A: curated results from STRING analysis (64) showing protein-protein interactions between the candidate genes (gene symbol shown at each node). Bold lines: experimental evidence; broken lines: evidence in pathway databases; gray dotted lines: evidence in the biomedical literature linking the two networks. B: potential binding interactions among candidate gene products, showing downstream signaling pathways activated. Candidate gene names are shown in bold, additional proteins required for protein complexes are shown in normal text. Dotted arrows indicate interactions that may not involve direct binding.
|
|
The potential interactions between the candidates' protein products are represented in Fig. 2B, illustrating the predominance of genes in the final candidate set that are involved in chylomicron processing, transmembrane receptor activity and signal transduction. These are described below.
Hepatic chylomicron processing is mediated by binding of an extracellular complex containing APOB, APOE, and LPL to the transmembrane receptors LDLR and LRP1. The APOE and APOB components interact with LDLR; and the APOE and LPL components bind to LRP1 to facilitate chylomicron remnant uptake (11, 24, 35, 59).
Three distinct groups of transmembrane proteins can be identified in the candidate set. LRP1 forms a complex with GIPC1 (21). UTRN, ITGB1, and FYN form a second (50, 67). Finally, INSR binds extracellular IGF1 as well as intracellular PTPN11 (also commonly called SHP2) (7, 30) to form the third complex. CAT (70) and its binding partner HSD17B4 (62), TIE1 (36), FYN (56), and AXL (18) are also implicated in this third complex through their interactions with PTPN11. Additionally, the ability of FYN to interact with ITGB1/UTRN as well as INSR/PTPN11 suggests it may link these two complexes.
Three distinct signaling pathways are likely to be activated by genes in the final candidate set: The LRP1 receptor can activate G protein signaling through its interaction with GIPC1 (21); UTRN/ITGB1 in complex with FYN activates the MAP/ERK pathway (50), and IGF1 binding to INSR activates the PI3K pathway (26). LPL has also been implicated in INSR signaling, although the mechanism of this interaction is unclear (31, 53).
FOXO1A, also commonly referred to as FKHR or FOXO1, is the only nuclear transcription factor in the final candidate list and is regulated via PTPN11 and IGF1/INSR binding and activation of the PI3K pathway (51, 68, 69). FOXO1A can in turn upregulate expression of LPL (27), an integral protein for chylomicron processing (24) and a proposed regulator of insulin signaling (31, 53), thus providing a possible direct regulatory connection between transmembrane receptor activation/signal transduction and the processes of chylomicron processing and insulin receptor signaling.
 |
DISCUSSION
|
|---|
In summary, we have used computational methods to select most likely candidate genes for metabolic syndrome, starting with a set of genes selected from all loci previously associated with the disease. We have used the five phenotypes that define metabolic syndrome and used phenotype and population-specific filters to refine the starting set of genes. We selected the top-scoring 19 genes from the final candidate list as most likely disease gene candidates. These candidates are found to most commonly represent pathways involving metabolism of lipids and lipoproteins, as well as transmembrane signaling and subsequent activation of signal transduction pathways.
Given the diversity of the five phenotypes associated with metabolic syndrome and the variety of the phenotype combinations that present clinically, it would seem unlikely that genes belonging to specific and restricted pathways are causative of this syndrome. Instead, disease candidates are more likely to belong to generic pathways with multiple downstream effects and thus be capable of generating a diverse array of phenotypic effects if disrupted. The predominance of receptor and signaling molecules in the candidate set fits this model: signal transduction pathways can have a large variety of initiating events and multiple downstream targets, regulated through a common signal transduction pathway. Additionally, because of the cascade effect in signal transduction events, even small changes in top-level signaling events can have large and diverse downstream effects, such as the variety of phenotypes seen in metabolic syndrome. Of interest, the only transcription factor candidate, FOXO1A, appears to connect chylomicron remnant uptake with insulin signaling and possibly functions in a feedback loop: FOXO1A is regulated by insulin signaling and upregulates LPL expression; and LPL in turn activates chylomicron processing and is involved in regulation of insulin signaling. Additional trans-regulatory and secondary downstream effects of this transcription factor may also lead to the diverse phenotypes seen in metabolic syndrome. Many of the genes selected are also annotated with terms relating to development (as seen in results from GOstat analysis, Table 4). This most likely reflects a role for lipids and insulin in regulating genes involved in development (for example, neural development as described in Ref. 21) rather than a role for developmental genes in metabolic syndrome. Additionally, however, adipose tissue is increasingly understood as an active endocrine and paracrine organ that secretes molecules involved in the inflammatory process, coagulation, insulin resistance, diabetes, and atherosclerosis (33, 41, 65). Differentiation of preadipocytes to mature fat cells is accompanied by release of many adipokines that appear to modulate both the insulin resistance of metabolic syndrome and endothelial dysfunction present in obesity, and many of these adipokines affect a variety of cellular processes that can lead to the array of abnormalities that characterize metabolic syndrome (6).
Individually, eight of the candidates have been previously implicated in metabolic syndrome, and notably, polymorphisms in LDLR have been associated with metabolic syndrome previously (39). Only one of the candidates (APOB) has been previously linked to all five phenotypes. Three of the candidates show no prior connection with these phenotypes, and one of these also has no direct or indirect association with metabolic syndrome (ADCY7) and is therefore a totally novel candidate. This selection of the top-ranking 19 most likely candidates therefore presents an array of genes including a gene containing known disease-associated single nucleotide polymorphisms (SNPs), some genes previously associated with metabolic syndrome and/or its phenotypes, and some entirely new candidates for further investigation. This demonstrates that novel predictions can be made from existing data using computational approaches and that this approach is truly synthetic and not simply sophisticated data-mining of existing knowledge.
Certain issues intrinsic to computational analysis do arise. These include differences between gene sets in different databases or versions of the same database, as described in METHODS. This is accommodated in part in this study by the scoring system that avoiding biased negative scoring against genes not measured by all methods. Also to be considered, genes with better annotation are generally more likely to be selected, which results in a bias toward selection of well-investigated genes (including disease-related genes) rather than novel predicted or uncharacterized genes. Several of the selected genes have previously been well characterized as disease-causing genes, including LDLR (OMIM #606945), ApoE (OMIM #107741), ApoB (OMIM #107730), and LPL (OMIM #609708) for a variety of phenotypes. Although selection of these genes may be biased due to their extensive annotation, it is equally possible that these genes are predisposed to mutation and/or are sufficiently crucial metabolic genes that their disruption is highly likely to result in a clinical phenotype. The two latter possibilities support the selection of these genes as good candidates, and, within the model of a multigenic complex disease, novel combinations of mutations in these genes could cause the metabolic syndrome phenotypes. To further investigate novel or poorly characterized gene candidates, however, it is possible to look exclusively at scoring by DGP (34) and PROSPECTR (3), analysis by G2D (46–48), which employs sequence similarity to analyze uncharacterized genes, and expression annotation by eVOC, which is based on mapping of EST sequences from cDNA libraries regardless of characterization and annotation of the gene (29, 58).
Another issue that arises is the use for some methods of suboptimal training sets of "known" or disease-associated genes. For example, for metabolic syndrome no known disease genes have been identified, and the training set of genes has been identified through a previous text-mining study as having been considered in the context of the disease (38) rather than being known disease-causing genes. Also, the use of training sets in selection limits the search to fit existing paradigms regarding the type of gene causing the disease and limits the possibility of finding novel disease mechanisms. It would, however, be possible to investigate candidates selected without the use of training sets by excluding the methods that require a training set of known disease genes. Finally, thresholds and cut-offs for selection of most likely candidates can be arbitrary or an "informed guess" for methods that rank genes (in comparison to straightforward inclusion/exclusion). Where possible, thresholds in this study have been based on results from test data sets, as described in the seminal reference for each method (see Supplementary Data File S3).
Metabolic syndrome, in conjunction with obesity, is becoming a serious health issue globally (1, 42, 54). Linkage studies have failed to date to identify causative genes underlying the syndrome, despite clear evidence for heritability (49), and this is most likely due to complex disease genetics whereby many etiological genes of low penetrance, rather than a single causative gene, result in the syndrome. With multiple contributing phenotypes as well as many permutations of those phenotypes, the syndrome lends itself to being deconstructed for systematic computational analysis. Here, we have addressed the identification of most likely disease genes for metabolic syndrome using computational rather than empirical methods. We have used the results of published linkage studies as a primary filter to select our starting set of candidate disease genes and then used multiple computational methods to select most likely candidate genes in a combinatorial approach informed by the clinical presentation of the syndrome. With the increasing use of genome-wide association studies to identify loci associated with specific phenotypes (45), however, our approach may be additionally adapted to pinpoint most likely candidate genes associated with assayed tagging SNPs in these studies. Computational selection of appropriate candidate genes associated with the SNPs assayed is likely to be highly successful because of the much smaller starting sets of candidate genes under analysis.
Whether the selected genes are truly associated with metabolic syndrome can only be shown empirically [as, for example, disease-associated polymorphisms have been previously identified in the selected candidate gene LDLR (39)], and it is not possible to validate our approach to selection of metabolic syndrome candidate disease genes without further empirical research. However, our study is instead a novel combination of independently validated methods combined with disease phenotype data, and we propose a novel set of candidate causative genes for metabolic syndrome as a starting point for further empirical research. All genes selected by any one of the methods already form a subset of likely candidate disease genes, as determined by these independently verified methods. We then take this subset and filter it further by selecting genes that match the most of the criteria tested across all methods and apply additional filters based on the array of phenotypes that define metabolic syndrome and the frequencies of those phenotypes in several patient populations.
Given the increasing urgency of understanding and managing metabolic syndrome effectively, we show here that computational analysis can offer new insights into genetic determinants and regulatory pathways that may be involved and presents a selection of genes that includes both novel and identified candidates for further empirical analysis. Exploring new avenues can also lead to a better understanding of environmental and genetic interactions underlying the syndrome.
We believe that accurately understanding disease phenotype and clinical presentation can lead to success in identifying genetic determinants for complex disease. A complete understanding of the complex clinical presentation of a disease can be used to ask intelligent questions of the existing genetic data and can subsequently be employed to make sense of the pathways and processes represented by the candidates selected. Because no disease-causing genes have been identified for metabolic syndrome to date, it is timely to utilize the phenotypic data in novel and informative ways to determine the genetics of the disease.
 |
GRANTS
|
|---|
I. Okpechi is funded by Wyeth South Africa (pty) Limited, funds are administered by the University of Cape Town. M. A. Andrade-Navarro and C. Perez-Iratxeta acknowledge funding from the Canadian Institutes of Health Research, Canadian Foundation for Innovation, and Ontario Innovation Trust. R. Ramesar is funded by the Medical Research Council of South Africa.
 |
ACKNOWLEDGMENTS
|
|---|
M. A. Andrade-Navarro is a Canada Research Chair in Bioinformatics. We thank A. J. Swift for Visual Basic programming.
 |
FOOTNOTES
|
|---|
Address for reprint requests and other correspondence: N. Tiffin, Div. of Human Genetics, MRC Human Genetics Research Unit, Inst. for Infectious Diseases and Molecular Medicine, Univ. of Cape Town, Cape Town 7925, South Africa (e-mail: nickitiffin{at}imaginet.co.za).
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The online version of this article contains supplemental material. 
 |
REFERENCES
|
|---|
- Obesity: preventing, and managing the global epidemic. Report of a WHO consultation. World Health Organ Tech Rep Ser 894: i–xii, 1–253, 2000.
- National Cholesterol Education Program (N.CEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III). Third Report of the National Cholesterol Education Program (N.CEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III) final report. Circulation 106: 3143–3421, 2002.[Free Full Text]
- AdieEA, Adams RR, Evans KL, Porteous DJ, Pickard BS. Speeding disease gene discovery by sequence based candidate prioritization. BMC Bioinformatics 6: 55, 2005.[CrossRef][Medline]
- AdieEA, Adams RR, Evans KL, Porteous DJ, Pickard BS. SUSPECTS: enabling fast and effective prioritization of positional candidates. Bioinformatics 22: 773–774, 2006.[Abstract/Free Full Text]
- AertsS, Lambrechts D, Maity S, Van Loo P, Coessens B, De Smet F, Tranchevent LC, De Moor B, Marynen P, Hassan B, Carmeliet P, Moreau Y. Gene prioritization through genomic data fusion. Nat Biotechnol 24: 537–544, 2006.[CrossRef][Web of Science][Medline]
- AhimaRS. Adipose tissue as an endocrine organ. Obesity (Silver Spring) 14, Suppl 5: 242S–249S, 2006.[CrossRef]
- Asante-AppiahE, Kennedy BP. Protein tyrosine phosphatases: the quest for negative regulators of insulin action. Am J Physiol Endocrinol Metab 284: E663–E670, 2003.[Abstract/Free Full Text]
- BeckerKG, Barnes KC, Bright TJ, Wang SA. The genetic association database. Nat Genet 36: 431–432, 2004.[CrossRef][Web of Science][Medline]
- BeissbarthT, Speed TP. GOstat: find statistically overrepresented Gene Ontologies within a group of genes. Bioinformatics 20: 1464–1465, 2004.[Abstract/Free Full Text]
- BruningJC, Michael MD, Winnay JN, Hayashi T, Horsch D, Accili D, Goodyear LJ, Kahn CR. A muscle-specific insulin receptor knockout exhibits features of the metabolic syndrome of NIDDM without altering glucose tolerance. Mol Cell 2: 559–569, 1998.[CrossRef][Web of Science][Medline]
- CarterCJ. Convergence of genes implicated in Alzheimer's disease on the cerebral cholesterol shuttle: APP, cholesterol, lipoproteins, and atherosclerosis. Neurochem Int 50: 12–38, 2007.[CrossRef][Web of Science][Medline]
- ChenH, Sharp BM. Content-rich biological network constructed by mining PubMed abstracts. BMC Bioinformatics 5: 147, 2004.[CrossRef][Medline]
- DennisG Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol 4: P3, 2003.[CrossRef][Medline]
- EckelRH. Mechanisms of the components of the metabolic syndrome that predispose to diabetes and atherosclerotic CVD. Proc Nutr Soc 66: 82–95, 2007.[CrossRef][Web of Science][Medline]
- EckelRH, Grundy SM, Zimmet PZ. The metabolic syndrome. Lancet 365: 1415–1428, 2005.[CrossRef][Web of Science][Medline]
- EfstratiadisG, Tsiaousis G, Athyros VG, Karagianni D, Pavlitou-Tsiontsi A, Giannakou-Darda A, Manes C. Total serum insulin-like growth factor-1 and C-reactive protein in metabolic syndrome with or without diabetes. Angiology 57: 303–311, 2006.[Abstract/Free Full Text]
- FordES, Giles WH, Dietz WH. Prevalence of the metabolic syndrome among US adults: findings from the third National Health and Nutrition Examination Survey. JAMA 287: 356–359, 2002.[Abstract/Free Full Text]
- GallicchioM, Mitola S, Valdembri D, Fantozzi R, Varnum B, Avanzi GC, Bussolino F. Inhibition of vascular endothelial growth factor receptor 2-mediated endothelial cell activation by Axl tyrosine kinase receptor. Blood 105: 1970–1976, 2005.[Abstract/Free Full Text]
- GaoJ, Katagiri H, Ishigaki Y, Yamada T, Ogihara T, Imai J, Uno K, Hasegawa Y, Kanzaki M, Yamamoto TT, Ishibashi S, Oka Y. Involvement of apolipoprotein E in excess fat accumulation and insulin resistance. Diabetes 56: 24–33, 2007.[Abstract/Free Full Text]
- GlazierAM, Nadeau JH, Aitman TJ. Finding genes that underlie complex traits. Science 298: 2345–2349, 2002.[Abstract/Free Full Text]
- GotthardtM, Trommsdorff M, Nevitt MF, Shelton J, Richardson JA, Stockinger W, Nimpf J, Herz J. Interactions of the low density lipoprotein receptor gene family with cytosolic adaptor and scaffold proteins suggest diverse biological functions in cellular communication and signal transduction. J Biol Chem 275: 25616–25624, 2000.[Abstract/Free Full Text]
- GrundySM, Brewer HB Jr, Cleeman JI, Smith SC Jr, Lenfant C. Definition of metabolic syndrome: report of the National Heart, Lung, and Blood Institute/American Heart Association conference on scientific issues related to definition. Circulation 109: 433–438, 2004.[Free Full Text]
- HarrisMA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS, Sethuraman A, Theesfeld CL, Botstein D, Dolinski K, Feierbach B, Berardini T, Mundodi S, Rhee SY, Apweiler R, Barrell D, Camon E, Dimmer E, Lee V, Chisholm R, Gaudet P, Kibbe W, Kishore R, Schwarz EM, Sternberg P, Gwinn M, Hannick L, Wortman J, Berriman M, Wood V, de la Cruz N, Tonellato P, Jaiswal P, Seigfried T, White R. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res 32: D258–D261, 2004.[Abstract/Free Full Text]
- HeerenJ, Beisiegel U, Grewal T. Apolipoprotein E recycling: implications for dyslipidemia and atherosclerosis. Arterioscler Thromb Vasc Biol 26: 442–448, 2006.[Abstract/Free Full Text]
- HegeleRA. Monogenic forms of insulin resistance: apertures that expose the common metabolic syndrome. Trends Endocrinol Metab 14: 371–377, 2003.[CrossRef][Web of Science][Medline]
- KadowakiT, Tobe K, Honda-Yamamoto R, Tamemoto H, Kaburagi Y, Momomura K, Ueki K, Takahashi Y, Yamauchi T, Akanuma Y, Yazaki Y. Signal transduction mechanism of insulin and insulin-like growth factor-1. Endocr J 43, Suppl: S33–S41, 1996.[CrossRef][Medline]
- KameiY, Mizukami J, Miura S, Suzuki M, Takahashi N, Kawada T, Taniguchi T, Ezaki O. A forkhead transcription factor FKHR up-regulates lipoprotein lipase expression in skeletal muscle. FEBS Lett 536: 232–236, 2003.[CrossRef][Web of Science][Medline]
- KanehisaM, Bork P. Bioinformatics in the post-sequence era. Nat Genet 33 Suppl: 305–310, 2003.[CrossRef][Web of Science][Medline]
- KelsoJ, Visagie J, Theiler G, Christoffels A, Bardien S, Smedley D, Otgaar D, Greyling G, Jongeneel CV, McCarthy MI, Hide T, Hide W. eVOC: a controlled vocabulary for unifying gene expression data. Genome Res 13: 1222–1230, 2003.[Abstract/Free Full Text]
- KharitonenkovA, Schnekenburger J, Chen Z, Knyazev P, Ali S, Zwick E, White M, Ullrich A. Adapter function of protein-tyrosine phosphatase 1D in insulin receptor/insulin receptor substrate-1 interaction. J Biol Chem 270: 29189–29193, 1995.[Abstract/Free Full Text]
- KimJK, Fillmore JJ, Chen Y, Yu C, Moore IK, Pypaert M, Lutz EP, Kako Y, Velez-Carrasco W, Goldberg IJ, Breslow JL, Shulman GI. Tissue-specific overexpression of lipoprotein lipase causes tissue-specific insulin resistance. Proc Natl Acad Sci USA 98: 7522–7527, 2001.[Abstract/Free Full Text]
- KrajaAT, Hunt SC, Pankow JS, Myers RH, Heiss G, Lewis CE, Rao D, Province MA. An evaluation of the metabolic syndrome in the HyperGEN study. Nutr Metab (Lond) 2: 2, 2005.[CrossRef][Medline]
- LauDC, Schillabeer G, Li ZH, Wong KL, Varzaneh FE, Tough SC. Paracrine interactions in adipose tissue development and growth. Int J Obes Relat Metab Disord 20, Suppl 3: S16–S25, 1996.[Web of Science]
- Lopez-BigasN, Ouzounis CA. Genome-wide identification of genes likely to be involved in human genetic disease. Nucleic Acids Res 32: 3108–3114, 2004.[Abstract/Free Full Text]
- MahleyRW, Ji ZS. Remnant lipoprotein metabolism: key pathways involving cell-surface heparan sulfate proteoglycans and apolipoprotein E. J Lipid Res 40: 1–16, 1999.[Abstract/Free Full Text]
- MarronMB, Hughes DP, McCarthy MJ, Beaumont ER, Brindle NP. Tie-1 receptor tyrosine kinase endodomain interaction with SHP2: potential signalling mechanisms and roles in angiogenesis. Adv Exp Med Biol 476: 35–46, 2000.[Web of Science][Medline]
- MatsumotoM, Han S, Kitamura T, Accili D. Dual role of transcription factor FoxO1 in controlling hepatic insulin sensitivity and lipid metabolism. J Clin Invest 116: 2464–2472, 2006.[CrossRef][Web of Science][Medline]
- MatsunagaT, Muramatsu MA. Knowledge-based computational search for genes associated with the metabolic syndrome. Bioinformatics 21: 3146–3154, 2005.[Abstract/Free Full Text]
- McCarthyJJ, Meyer J, Moliterno DJ, Newby LK, Rogers WJ, Topol EJ. Evidence for substantial effect modification by gender in a large-scale genetic association study of the metabolic syndrome among coronary heart disease patients. Hum Genet 114: 87–98, 2003.[CrossRef][Web of Science][Medline]
- McKusickVA. Mendelian inheritance in man and its online version, OMIM. Am J Hum Genet 80: 588–604, 2007.[CrossRef][Web of Science][Medline]
- Mohamed-AliV, Pinkney JH, Coppack SW. Adipose tissue as an endocrine and paracrine organ. Int J Obes Relat Metab Disord 22: 1145–1158, 1998.[CrossRef][Web of Science][Medline]
- MokdadAH, Ford ES, Bowman BA, Dietz WH, Vinicor F, Bales VS, Marks JS. Prevalence of obesity, diabetes, and obesity-related health risk factors, 2001. JAMA 289: 76–79, 2003.[Abstract/Free Full Text]
- MoonJY, Park S, Rhee JH, Jee SH, Park CM, Choi DS, Park HY, Ko YG, Choi D, Jang Y, Chung N. The applicability of the Asian modified criteria of the metabolic syndrome in the Korean population. Int J Cardiol 114: 83–89, 2007.[CrossRef][Web of Science][Medline]
- ParkYW, Zhu S, Palaniappan L, Heshka S, Carnethon MR, Heymsfield SB. The metabolic syndrome: prevalence and associated risk factor findings in the US population from the Third National Health and Nutrition Examination Survey, 1988–1994. Arch Intern Med 163: 427–436, 2003.[Abstract/Free Full Text]
- PearsonTA, Manolio TA. How to interpret a genome-wide association study. JAMA 299: 1335–1344, 2008.[Abstract/Free Full Text]
- Perez-IratxetaC, Bork P, Andrade-Navarro MA. Update of the G2D tool for prioritization of gene candidates to inherited diseases. Nucleic Acids Res 35: W212–W216, 2007.[Abstract/Free Full Text]
- Perez-IratxetaC, Bork P, Andrade MA. Association of genes to genetically inherited diseases using data mining. Nat Genet 31: 316–319, 2002.[Web of Science][Medline]
- Perez-IratxetaC, Wjst M, Bork P, Andrade MA. G2D: a tool for mining genes associated with disease. BMC Genet 6: 45, 2005.[CrossRef][Medline]
- PollexRL, Hegele RA. Genetic determinants of the metabolic syndrome. Nat Clin Pract Cardiovasc Med 3: 482–489, 2006.[CrossRef][Web of Science][Medline]
- RivierF, Robert A, Hugon G, Mornet D. Different utrophin and dystrophin properties related to their vascular smooth muscle distributions. FEBS Lett 408: 94–98, 1997.[CrossRef][Web of Science][Medline]
- SchinnerS, Scherbaum WA, Bornstein SR, Barthel A. Molecular mechanisms of insulin resistance. Diabet Med 22: 674–682, 2005.[CrossRef][Web of Science][Medline]
- SeidelmannSB, De Luca C, Leibel RL, Breslow JL, Tall AR, Welch CL. Quantitative trait locus mapping of genetic modifiers of metabolic syndrome and atherosclerosis in low-density lipoprotein receptor-deficient mice: identification of a locus for metabolic syndrome and increased atherosclerosis on chromosome 4. Arterioscler Thromb Vasc Biol 25: 204–210, 2005.[Abstract/Free Full Text]
- ShibasakiM, Bujo H, Takahashi K, Murakami K, Unoki H, Saito Y. Catalytically inactive lipoprotein lipase overexpression increases insulin sensitivity in mice. Horm Metab Res 38: 491–496, 2006.[CrossRef][Web of Science][Medline]
- SorofJM, Lai D, Turner J, Poffenbarger T, Portman RJ. Overweight, ethnicity, and the prevalence of hypertension in school-aged children. Pediatrics 113: 475–482, 2004.[Abstract/Free Full Text]
- StensonPD, Ball EV, Mort M, Phillips AD, Shiel JA, Thomas NS, Abeysinghe S, Krawczak M, Cooper DN. Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat 21: 577–581, 2003.[CrossRef][Web of Science][Medline]
- TangH, Zhao ZJ, Huang XY, Landon EJ, Inagami T. Fyn kinase-directed activation of SH2 domain-containing protein-tyrosine phosphatase SHP-2 by Gi protein-coupled receptors in Madin-Darby canine kidney cells. J Biol Chem 274: 12401–12407, 1999.[Abstract/Free Full Text]
- TiffinN, Adie E, Turner F, Brunner HG, van Driel MA, Oti M, Lopez-Bigas N, Ouzounis C, Perez-Iratxeta C, Andrade-Navarro MA, Adeyemo A, Patti ME, Semple CA, Hide W. Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes. Nucleic Acids Res 34: 3067–3081, 2006.[Abstract/Free Full Text]
- TiffinN, Kelso JF, Powell AR, Pan H, Bajic VB, Hide WA. Integration of text- and data-mining using ontologies successfully selects disease gene candidates. Nucleic Acids Res 33: 1544–1552, 2005.[Abstract/Free Full Text]
- Van der GeerP. Phosphorylation of LRP1: regulation of transport and signal transduction. Trends Cardiovasc Med 12: 160–165, 2002.[CrossRef][Web of Science][Medline]
- Van DrielMA, Cuelenaere K, Kemmeren PP, Leunissen JA, Brunner HG. A new web-based data mining tool for the identification of candidate genes for human genetic disorders. Eur J Hum Genet 11: 57–63, 2003.[CrossRef][Web of Science][Medline]
- Van DrielMA, Cuelenaere K, Kemmeren PP, Leunissen JA, Brunner HG, Vriend G. GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases. Nucleic Acids Res 33: W758–W761, 2005.[Abstract/Free Full Text]
- Van GrunsvenEG, van Berkel E, Ijlst L, Vreken P, de Klerk JB, Adamski J, Lemonde H, Clayton PT, Cuebas DA, Wanders RJ. Peroxisomal D-hydroxyacyl-CoA dehydrogenase deficiency: resolution of the enzyme defect and its molecular basis in bifunctional protein deficiency. Proc Natl Acad Sci USA 95: 2128–2133, 1998.[Abstract/Free Full Text]
- VastrikI, D'Eustachio P, Schmidt E, Joshi-Tope G, Gopinath G, Croft D, de Bono B, Gillespie M, Jassal B, Lewis S, Matthews L, Wu G, Birney E, Stein L. Reactome: a knowledgebase of biological pathways and processes. Genome Biol 8: R39, 2007.[CrossRef][Medline]
- Von MeringC, Jensen LJ, Kuhn M, Chaffron S, Doerks T, Kruger B, Snel B, Bork P. STRING 7–recent developments in the integration and prediction of protein interactions. Nucleic Acids Res 35: D358–362, 2007.[Abstract/Free Full Text]
- WajchenbergBL. Subcutaneous and visceral adipose tissue: their relation to the metabolic syndrome. Endocr Rev 21: 697–738, 2000.[Abstract/Free Full Text]
- WakinoS, Hayashi K, Kanda T, Tatematsu S, Homma K, Yoshioka K, Takamatsu I, Saruta T. Peroxisome proliferator-activated receptor gamma ligands inhibit Rho/Rho kinase pathway by inducing protein tyrosine phosphatase SHP-2. Circ Res 95: e45–e55, 2004.[Abstract/Free Full Text]
- WaryKK, Mariotti A, Zurzolo C, Giancotti FG. A requirement for caveolin-1 and associated kinase Fyn in integrin signaling and anchorage-dependent cell growth. Cell 94: 625–634, 1998.[CrossRef][Web of Science][Medline]
- WoodsYL, Rena G. Effect of multiple phosphorylation events on the transcription factors FKHR, FKHRL1 and AFX. Biochem Soc Trans 30: 391–397, 2002.[CrossRef][Web of Science][Medline]
- XuJ, Liao K. Protein kinase B/AKT 1 plays a pivotal role in insulin-like growth factor-1 receptor signaling induced 3T3–L1 adipocyte differentiation. J Biol Chem 279: 35914–35922, 2004.[Abstract/Free Full Text]
- YanoS, Arroyo N, Yano N. SHP2 binds catalase and acquires a hydrogen peroxide-resistant phosphatase activity via integrin-signaling. FEBS Lett 577: 327–332, 2004.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:

|
 |

|
 |
 
Y. Wang, Y. Zheng, P. M Nishina, and J. K Naggert
A new mouse model of metabolic syndrome and associated complications
J. Endocrinol.,
July 1, 2009;
202(1):
17 - 28.
[Abstract]
[Full Text]
[PDF]
|
 |
|
Copyright © 2008 by the American Physiological Society.