|
|
||||||||
1 School of Pharmacy, University of Wisconsin, Madison, Wisconsin 53705
2 Molecular and Environmental Toxicology Center, University of Wisconsin, Madison, Wisconsin 53705
3 Waisman Center, University of Wisconsin, Madison, Wisconsin 53705
4 Center for Neuroscience, University of Wisconsin, Madison, Wisconsin 53705
| ABSTRACT |
|---|
|
|
|---|
antioxidant responsive element; human neuroblastoma cell; phase II detoxifying enzymes; oligonucleotide microarray analysis
| INTRODUCTION |
|---|
|
|
|---|
-glutamylcysteine ligase catalytic (GCLC) and regulatory (GCLR) subunits, as well as ferritin heavy and light chains, are also suspected to be upregulated through ARE activation (29, 31). AREs share a consensus motif, TGAC/TNNNGC, originally identified by mutational analysis of the rat GST-Ya ARE (10, 25). Thus identification of the ARE was an initial step leading to elucidation of the molecular mechanism for antioxidant response. tert-butylhydroquinone (tBHQ), a metabolite of the widely used food antioxidant butylated hydroxyanisole (BHA), is a known inducer of phase II enzyme systems including IMR-32 human neuroblastoma cell (18). Increasing evidence indicates that tBHQ induces these phase II detoxifying enzymes through the ARE (2, 23). In addition, the transcription factor, NF-E2-related factor 2 (Nrf2), is essential for ARE-mediated induction of these genes (12, 13, 29, 31). Our laboratory and others have already confirmed that exposure to tBHQ results in nuclear accumulation of Nrf2 (6, 17, 26). The binding of Nrf2 to the ARE leads to transcriptional activation of a score of genes such as NQO1, HO1, multiple forms of GST, glutathione reductase (GR), and thioredoxin reductase (TR) (1, 4, 8, 14, 15, 19, 21). Oligonucleotide microarray analysis allows us to monitor the expression level of thousands of genes in parallel and has the potential to surpass traditional approaches in terms of sensitivity and speed. Presently, molecular biologists and bioinformatic analysts are working together to establish different biostatistical models for interpretation of microarray data. Most of these methods, however, are time-consuming and hard to manage, leading to unfavorable reviews from the common users. In addition, researchers often complain that microarray data can be so confusing that it sometimes provides few clues for further investigation. A major reason for these types of interpretations is the inherent variability of microarray data that leads to false-positive signals concealing real changes. This is true especially for genes with low basal expression levels (28). Another principal contributing factor is the complexity of transcriptional regulatory networks responsible for the observed changes in gene expression. This makes it difficult to predict the signal transduction pathway(s) driving the changes based only on gene expression profiles. Clearly, the main goal of the analysis is to determine whether the microarray data make biological sense. In the present study, oligonucleotide microarrays were used to identify the time-dependent changes in ARE-driven genes induced by tBHQ. We also introduced a set of simple, useful methods comprising rank evaluation, cross comparisons, and reproducibility analysis to evaluate and minimize the variance of microarray datasets.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Cell culture and drug treatment.
IMR-32 human neuroblastoma cells were grown in DMEM supplemented with fetal calf serum (10%), 100 IU/ml penicillin, and 100 mg/ml streptomycin. Cultures were maintained at 37°C in a humidified 10% CO2 atmosphere. IMR-32 cells (60% confluent) were treated with tBHQ at a final concentration of 10 µM or vehicle (0.01% EtOH) for 4, 8, 24, and 48 h prior to harvesting.
Microarray analysis.
Details for the sample preparation and microarray processing are available from Affymetrix (Santa Clara, CA). Total RNA was prepared from cells by using a Qiagen RNeasy midi kit. Eight micrograms of total RNA was used to prepare double-stranded cDNA (Superscript choice kit; GIBCO-BRL) with a T7-(dT)24 primer containing a T7 RNA polymerase promoter site (Operon). The cRNA was prepared and biotin labeled by in vitro transcription (Enzo Biochem). Labeled cRNA was fragmented by incubation at 94°C for 35 min in the presence of 40 mM Tris acetate, pH 8.1, 100 mM potassium acetate, and 30 mM magnesium acetate. Fifteen micrograms of fragmented cRNA was hybridized 16 h at 45°C to an HG U95a array (Affymetrix, Santa Clara, CA). This array contains
12,000 probe sets corresponding to 9,670 full-length human cDNAs. Sixteen to twenty pairs of 25-mer oligonucleotides that span the coding region represent each gene. After hybridization, the GeneChips were automatically washed and stained with streptavidin-phycoerythrin by using a fluidics Station. Finally, probe arrays were scanned at 3-µm resolution using the GeneChip System confocal scanner made for Affymetrix by Aligent. Affymetrix Microarray Suite 4.1 was used to scan and analyze the relative abundance of each gene derived from the average difference of intensities. Analysis parameters used by the software were set to values corresponding to moderate stringency (SDT = 30, SRT = 1.5). Fluorescence intensity was measured for each probe array and normalized to the average fluorescence intensity for the entire probe array. Output from the GeneChip analysis was merged with the Unigene or GenBank descriptor and stored as an Excel data spreadsheet. Gene cluster analysis was performed using GENECLUSTER 1.0 (MIT, Cambridge, MA) (30). Gene categorization was based on NetAffx (http://www.netaffx.com/index2.jsp) and other sources.
RT-PCR.
Validation of upregulated expression was performed by RT-PCR for NQO1, GCLR, GCLC, and HO1. PCR primers which were specific for these genes were used for cDNA synthesis and amplification as follows: NQO1, forward 5'-CATTCTGAAAGGCTGGTTTGA-3' and reverse 5'-TTTCTTCCATCCTTCCAGGAT-3', resulting in a PCR product of 300 bp; GCLR, forward 5'-TTTGGTCAGGGAGTTTCCAG-3' and reverse 5'-ACACAGCAGGAGGCAAGATT-3', resulting in a PCR product of 400 bp; GCLC, forward 5'-TGAGATTTAAGCCCCCTCCT-3' and reverse 5'-TTGGGATCAGTCCAGGAAAC-3', resulting in a PCR product of 380 bp; HO1, forward 5'-ACATCTATGTGGCCCTGGAG-3' and reverse 5'-GCGGTAGAGCTGCTTGAACT-3', resulting in a PCR product of 380 bp. Total RNA, purified from cell pellets with Trizol Reagent (GIBCO-BRL), was subjected to RT-PCR with the Promega Transcription System (Madison, WI). The reaction mix (20 µl) contained 200 µM dNTP, 0.45 µM of each primer, 1 µg of total RNA, and AMV reverse transcriptase (15 U). RNA was reverse transcribed at 42°C for 30 min. DNA was amplified by an initial incubation at 94°C for 4 min followed by 25
35 cycles of 94°C for 0.5 min, 55
58°C for 0.6 min, 72°C for 0.5 min, and a final extension at 72°C for 7 min. the PCR products were then separated by electrophoresis in a 1.2% agarose gel and visualized by ethidium bromide staining. The number of cycles and melting temperature were adjusted depending on the genes amplified.
Western blot.
The whole cell extract was resolved by SDS-PAGE, transferred to polyvinylidene difluoride (PVDF) membrane, and blocked with 5% nonfat milk. The PVDF membranes were incubated with primary antibodies [mouse monoclonal antibody against NQO1 (1:1,000), rabbit polyclonal antibody against GCLC (1:2,000) or GCLR (1:2,000), and mouse monoclonal antibody against HO1 (1:250)] overnight at 4°C. Membranes were washed and incubated with anti-rabbit (1:5,000) or anti-mouse (1:5,000) IgG labeled with horseradish peroxidase for 2 h. The chemiluminescence emitted from luminal oxidization by horseradish peroxidase was detected by using the enhanced chemiluminescence Western blotting detection system (Amersham Pharmacia Biotech, Piscataway, NJ).
| RESULTS |
|---|
|
|
|---|
2) because of the marginal calls. For example, in a 2 x 2 matrix that generates four data sets, genes called "marginal increase(1)/decrease(-1)" would have to be called in each comparison to pass the criteria, n2 = 4/-4. Therefore, the cutoff in the 3 x 3 matrix would be n2 = 9/-9, and that in the 4 x 4 matrix would be n2 = 16/-16. Analysis of the data from the 8-h time point by a Latin square comparison is shown in Fig. 1B. A significant decrease in the number of genes passing the rank analysis was observed when comparing 2 x 2 with 3 x 3 for both increased and decreased genes. There also appeared to be a greater variability in the number of increased genes than that of decreased genes (see Fig. 3B). This variability in gene number, however, does not reflect a consistent change in the same gene, thereby accounting for the low number of decreased genes following tBHQ treatment. These data indicate that a minimum of three independent samples should be run to determine differential gene expression between control and treatment groups. Application of this 3 x 3 matrix analysis to the microarray data at each time point led to the identification of
200 genes whose expression was consistently increased (196 genes) or decreased (4 genes).
|
|
|
|
Increases were also observed in genes associated with neuronal growth and differentiation, such as neuronal olfactomedin-related ER localized protein, axin, ß2-syntrophin, secretogranin II, and musashi.
Nuclear transcription factors including c-Jun, c-Fos, Jun-B, Jun-D, Fra-1, Fra-2, ATF-3, ATF-4, NF-
B, and small Maf proteins (MafK and MafF), which have been reported to possibly influence the Nrf2-ARE interaction, did not change at the mRNA level after treatment with tBHQ. Of particular note, KIAA0132, the human homolog of mouse KEAP1, was increased. KEAP1 is a cytosolic chaperone of Nrf2. Exposure of cells to inducers disrupts the KEAP1-Nrf2 complex and allows Nrf2 to translocate into the nucleus, where it binds to the ARE and stimulates transcription (6). Other categories of genes induced by tBHQ are listed in the Supplemental Table.
SOM clustering.
The 101 genes mentioned above were included in the self-organizing maps (SOM) clustering. We used GENECLUSTER to analyze the data. Normalized hybridization intensities (ADC) for individual genes at time t were defined as (ADC of gene x at time t) - (mean ADC of gene x)/(standard deviation of ADC of gene x), to allow clustering to occur on the basis of the expression profiles rather than by absolute level. A range of two-dimensional matrices was examined with a rapid settling on a 4 x 3 SOM. Less than 12 nodes provided distinct patterns (wide variation between expression of individual genes within a cluster), and increasing beyond 12 nodes led to cluster duplication (nearly identical cluster patterns). Figure 4 shows the average normalized gene expression patterns for the genes contained within clusters 011. Most genes previously identified in the literature to be induced by tBHQ ("Source of Evidence" in Table 1) were grouped within different clusters, although some of them were grouped together as shown in Table 1. The expression profiles induced by a specific agent, therefore, were far more complicated than expected. In addition, these genes grouped into 12 distinct clusters with strikingly consistent patterns, yet no functional correlation existed among the clustered genes (see the Supplemental Table). For example, the genes grouped in cluster 7 are involved in detoxification and cellular antioxidant defense (NQO1, ferritin heavy chain, and HDD), cytoskeleton construction (dynein heavy polypeptide), immune response [HLA class-1 (HLA-A26) heavy chain], and energy metabolism (transketolase). These data suggest that multiple pathways and/or transcription factors must be involved in altering expression of each individual gene in response to tBHQ.
|
| DISCUSSION |
|---|
|
|
|---|
SOM, based on an unsupervised neural network algorithm, was applied to cluster and analyze gene expression patterns in this study. In contrast to the rigid structure of hierarchical clustering, the strong prior hypotheses used in Bayesian clustering, and the nonstructure of k-means clustering, SOMs are ideally suited to exploratory data analysis by allowing one to impose partial structure on the clusters, facilitating easy visualization and interpretation (3, 7). Most researchers believe that this analysis assigns genes to the single group or "cluster" that most closely shares a related expression pattern across specimens. They also believe that this approach has biological relevance, because coordinate regulation of groups of genes often signifies a role in common processes or pathways (9). In this study, SOM automatically and quickly extracted the gene expression profiles among the most prominent features of the data. The SOM results showed that some of the potential ARE-driven genes were clustered together based on their gene expression similarity. However, there are also examples of ARE-driven genes that did not co-cluster with other known ARE-related genes, suggesting that multiple pathway and/or transcription factors may be involved in controlling the expression level of each individual gene.
Oligonucleotide microarray data reflect the mRNA level of each individual clone directly, which may not only be influenced by transcriptional regulation, but also by the posttranscriptional regulation and mRNA stability. As we know, most protein kinases or transcription regulators, which play an important role in the signal transduction pathway, seldom are regulated at the mRNA level or the translational level. Their activation or inactivation is usually associated with protein-protein interaction often mediated by protein kinases and phosphatases. Changes in gene expression based on microarray analysis, therefore, are unable to tell us the signal transduction pathway(s) involved directly. However, we cannot deny the usefulness of microarray analysis, since researchers still gain some critical information by combining their microarray analysis with clustering and reference confirmation.
The advent of high-density oligonucleotide microarrays has greatly facilitated the ability to simultaneously examine the abundance of multiple mRNAs. Although oligonucleotide microarrays are powerful tools for profiling gene expression, the dynamic change and the large number of signals produced require efficient procedures for distinguishing false-positive results from changes in expression that are "real" (independently reproducible). There are two sources of randomly generated error associated with microarray analysis, namely, systematic and experimental error. The latter is a result of the experimental design. Most bioinformatic specialists ignore this type of error, which reflects, for example, the variations in treatment, cell density, and culture medium in cell lines; or tissue regions harvested (reproducibility of the dissection), genetic background, and diet in animal models. In contrast, the systematic error is generated during the analytical phase of microarray analysis. False-positive signals can be produced at multiple steps including probe array manufacture, preparation of cRNAs for microarray analysis, hybridization or washing steps, and global scaling and normalization of overall signal intensities between probe arrays. Therefore, the running of independent samples is necessary to establish a high degree of confidence in the data and will likely diminish both sources of error, whereas repeatedly running the same samples will decrease systematic error but amplify the experimental error. Our analysis does not ensure that all negative calls are truly negative. The percentage of false negatives is hard to quantify, but the confirmation rate of RT-PCR for selected positive calls after our analysis has been 100% (data not show).
Another issue of concern for the user is the several parameters presented by Affymetrix algorithms for detecting differential expression. Several previous studies have used arbitrary fold-change thresholds (typically 2- to 3-fold) to define significant expression change and stratify microarray results (5). This analysis maintains that an individual mRNA species has to equal or exceed a mandated level of fold-change between control and experimental RNA preparations before it is considered to have undergone a change that is likely to be significant. Although this approach is intuitively appealing, we could not find published reports in which its utility has been systematically evaluated. In the application of this technique, there are several limitations. Fold-change, as a ratio, is particularly vulnerable to artifacts produced by global scaling of probe array datasets, and assigning an arbitrary cutoff may mask biologically significant changes (20, 22). In this study, we give high emphasis to "difference call," which is generated based on the "decision matrix thresholds" Affymetrix provided: maximum [increase/total, decrease/total], increase/decrease ratio, log average ratio change, and Dpos-Dneg ratio (27). Each comparison metric is weighted and entered into a decision matrix to derive a difference call that indicates whether a transcript has increased (I), marginally increased (MI), decreased (D), marginally decreased (MD), or not changed (NC) in expression level. We have developed a ranking analysis of difference call as shown in the results section to determine the increased or decreased gene expression based on a rational cutoff value (±n2). These data make it absolutely clear that a minimum of three independent samples should be run to generate a reliable set of changed genes.
According to Affymetrix Suite 4.1, "average difference" (AD) serves as a relative indicator in the level of expression of a transcript. ADC is a parameter calculated from average difference of treatment (ADtreat) balanced by average difference of baseline (ADbase), i.e., ADC = ADtreat - ADbase. We used ADC for reproducibility analyses, because ADC represents the difference between control and treatment more accurately than fold change does in Affymetrix algorithms. The CV is a relative measure of the variation, since dividing by the mean directly accounts for the magnitude of the values. Large values of CV suggest that the data are quite variable and, therefore, inaccurate. Above all, we showed that our approach to noise filtration not only permits an experiment-wide assessment of overall data quality, but also allows the users to score and rank individual genes according to their likelihood of manifesting changes that are reproducible. These noise-filtering methods have also proven to be very useful in screening data gathered from primary cortical neuronal cultures, human neural stem cells, and dissected brain regions from transgenic mice overexpressing human amyloid precursor protein.
In conclusion, this study not only identified new genes induced by tBHQ, but also provided new insight into the complex pathways governing the regulation of antioxidant defense gene. Clearly, there is a significant variability in microarray data that may be contributed to the complexity in transcriptional regulation of gene expression. The noise-filtering methods presented here, however, make more efficient use of microarray data and increase the probability of generating a biological relevant dataset.
| ACKNOWLEDGMENTS |
|---|
This study was supported by National Institute of Environmental Health Sciences Grants ES-08089 (to J. A. J), ES-10042 (to J. A. Johnson), and ES-09090 and by the Burroughs Wellcome New Investigator in Toxicological Sciences Award (to J. A. Johnson).
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: J. A. Johnson, School of Pharmacy, Univ. of Wisconsin, 6125 Rennebohm Hall, 777 Highland Ave, Madison, WI 53706 (E-mail: jajohnson{at}pharmacy.wisc.edu).
10.1152/physiolgenomics.00003.2002.
1 Supplementary material to this article is available online at http://physiolgenomics.physiology.org/cgi/content/full/9/3/137/DC1. ![]()
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
N. Hattori, T. Suzuki, S. Jinno, H. Okeya, A. Ishikawa, C. Kondo, T. Hayashi, M. Ito, T. Kanamori, T. Kawai, et al. Methyl Methacrylate Activates the Gsta1 Promoter Journal of Dental Research, December 1, 2008; 87(12): 1117 - 1121. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Garg, S. Gupta, and G. B. Maru Dietary curcumin modulates transcriptional regulators of phase I and phase II enzymes in benzo[a]pyrene-treated mice: mechanism of its anti-initiating action Carcinogenesis, May 1, 2008; 29(5): 1022 - 1032. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Hagemann, S. A. Gaeta, M. A. Smith, D. A. Johnson, J. A. Johnson, and A. Messing Gene expression analysis in mice with elevated glial fibrillary acidic protein and Rosenthal fibers reveals a stress response followed by glial activation and neuronal dysfunction Hum. Mol. Genet., August 15, 2005; 14(16): 2443 - 2458. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Li, M. L. Spletter, and J. A. Johnson Dissecting tBHQ induced ARE-driven gene expression through long and short oligonucleotide arrays Physiol Genomics, March 21, 2005; 21(1): 43 - 58. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. Stein, N. J. Anders, C. DeCarli, S. L. Chan, M. P. Mattson, and J. A. Johnson Neutralization of Transthyretin Reverses the Neuroprotective Effects of Secreted Amyloid Precursor Protein (APP) in APPSw Mice Resulting in Tau Phosphorylation and Loss of Hippocampal Neurons: Support for the Amyloid Hypothesis J. Neurosci., September 1, 2004; 24(35): 7707 - 7717. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wu, M. H. Noyan Ashraf, M. Facci, R. Wang, P. G. Paterson, A. Ferrie, and B. H. J. Juurlink Dietary approach to attenuate oxidative stress, hypertension, and inflammation in the cardiovascular system PNAS, May 4, 2004; 101(18): 7094 - 7099. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Kraft, D. A. Johnson, and J. A. Johnson Nuclear Factor E2-Related Factor 2-Dependent Antioxidant Response Element Activation by tert-Butylhydroquinone and Sulforaphane Occurring Preferentially in Astrocytes Conditions Neurons against Oxidative Insult J. Neurosci., February 4, 2004; 24(5): 1101 - 1112. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Y. Shih, D. A. Johnson, G. Wong, A. D. Kraft, L. Jiang, H. Erb, J. A. Johnson, and T. H. Murphy Coordinate Regulation of Glutathione Biosynthesis and Release by Nrf2-Expressing Glia Potently Protects Neurons from Oxidative Stress J. Neurosci., April 15, 2003; 23(8): 3394 - 3406. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-M. Lee, M. J. Calkins, K. Chan, Y. W. Kan, and J. A. Johnson Identification of the NF-E2-related Factor-2-dependent Genes Conferring Protection against Oxidative Stress in Primary Cortical Astrocytes Using Oligonucleotide Microarray Analysis J. Biol. Chem., March 28, 2003; 278(14): 12029 - 12038. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. M. Diffee, E. A. Seversen, T. D. Stein, and J. A. Johnson Microarray expression analysis of effects of exercise training: increase in atrial MLC-1 in rat ventricles Am J Physiol Heart Circ Physiol, March 1, 2003; 284(3): H830 - H837. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-K. Kwak, N. Wakabayashi, K. Itoh, H. Motohashi, M. Yamamoto, and T. W. Kensler Modulation of Gene Expression by Cancer Chemopreventive Dithiolethiones through the Keap1-Nrf2 Pathway. IDENTIFICATION OF NOVEL GENE CLUSTERS FOR CELL SURVIVAL J. Biol. Chem., February 28, 2003; 278(10): 8135 - 8145. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Li, M. Pankratz, and J. A. Johnson Differential Gene Expression Patterns Revealed by Oligonucleotide Versus Long cDNA Arrays Toxicol. Sci., October 1, 2002; 69(2): 383 - 390. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |