|
|
||||||||
1 Epithelial Research Group, Institute for Cell and Molecular Biosciences
2 International Centre for Life, Newcastle University, Newcastle upon Tyne, United Kingdom
| ABSTRACT |
|---|
|
|
|---|
natural antisense transcript; DNA array; RNA; antisense transcriptome
| INTRODUCTION |
|---|
|
|
|---|
The current lack of suitable tools to assess the expression of sense and antisense RNAs on a routine basis limits progress in antisense research (18). The methodologies that currently provide this information such as tiling arrays or large-scale sequencing strategies are not suitable to routinely address physiological questions. At present commercial arrays are designed to hybridize mainly to (protein encoding) sense transcripts and neglect most antisense transcripts. An experimental method to label both strands of an RNA preparation independently is conceivable; however, this strategy may prove unsuccessful due to the low level of antisense expression and the often limited coverage of the DNA arrays. Here we provide the proof of principle that both coverage and sensitivity of commercial DNA oligo arrays are sufficient for monitoring antisense RNA expression in total RNA on a genome-wide scale. In addition, we show that a significant proportion of randomly selected sense/antisense pairs show tissue-specific expression patterns, strongly suggesting that antisense transcripts may play a physiological role.
| MATERIALS AND METHODS |
|---|
|
|
|---|
DNA arrays.
Two generations of Affymetrix mouse genome arrays were used, the U74A and U74B chips that each contained
25% probes annotated in reverse orientation and the 430 array as a positive reference. The U74A array carries 12,588 probes, of which 2,611 were in reverse complement orientation; on the U74B chips of 12,570 probes, 3,041 were wrongly annotated. The U74C chip that contains 60% wrongly annotated probe sets was not available for this study. U74A and parts of U74B contained probes representing fully annotated genes and ESTs, whereas the remaining part of U74B and U74C contained only ESTs (likely the major source of the wrongly annotated sequence information).The mouse 430 array contains 45,102 probe sets. [The 430 array is supplied in two versions, one that contains the probe set on one chip (430.2, used for kidney samples); the other has the identical probe set spotted of two chips (430A and 430B, used for brain samples). Over all, the probe sets are identical and will be referred to as "mouse genome array 430"].
Isolation of total RNA, reverse transcription, second-strand DNA synthesis, and synthesis of biotin-labeled cRNA were performed with kits and reagents according to the manufacturer's protocol (Affymetrix). In brief, 15 µg of total RNA were primed with an oligo dT adaptor primer carrying a T7 polymerase promoter. Reverse transcription using SuperScriptII was performed at 42°C for 1 h. After second-strand cDNA synthesis and cDNA clean-up, biotin-labeled cRNA was synthesized using T7 RNA polymerase (note that cDNA that may be synthesized from contaminating traces of genomic DNA during second-strand cDNA synthesis will not be transcribed into cRNA due to the absence of the T7 promoter). The cRNA was then purified by removal of unincorporated NTPs and fragmented to 35200 base fragments by metal-induced hydrolysis. Hybridization and washing steps were performed according to the manufacturer's protocol (Affymetrix). The microarray analysis suite 5.0 (Affymetrix) was used to quantify the fluorescent signals and to determine probability values for expression [as: (P)resent, (M)arginal, (A)bsent]. A mask provided by Affymetrix identified the wrongly annotated probe sets on the U74A and U74B chips. The data from the U74 and 430 arrays were compared and annotated using the Affymetrix web portal Netaffx. The array data are available at GEO accession number GSE6106.
Real-time RT-PCR.
We reverse transcribed 0.5 µg of total RNA in a 20-µl reaction using oligo dT primers and reagents from Qiagen. Omniscript reverse transcriptase was used in the presence of 2 U/µl RNase inhibitor (RNasin, Promega). The reaction was performed at 37°C for 1 h. A reaction without enzyme was run as a negative control. After denaturation for 5 min at 95°C, aliquots of the reaction were either directly used for real-time PCR or stored at 20°C. We used 0.6 µl of cDNA for PCR in a 10-µl reaction containing 250 nM of forward and reverse primer, 4 mM MgCl2, and 1x ABsolute QPCR capillary SYBR green mix (without MgCl2) (Abgene). Reactions were performed in a Roche Light Cycler. Cycling initiated with a polymerase activation step of 15 min at 95°C followed by 50 cycles denaturation at 95°C (10 s), annealing at 55°C (10 s), and elongation 72°C (15 s). The reaction was terminated with the recording of the melting curve. Selected end point amplicons were analyzed by 2.5% Agarose gel electrophoresis. All PCR experiments were repeated at least once under identical conditions and at least four times with different batches of RNA or cDNA.
Primer design.
Primers were designed using the net platform http://biotools.umassmed.edu/bioapps/primer3_www.cgi. The melting temperature of the primers was 60°C. The generated amplicons of
120 base pairs were required to span an intron. The sequences of all primers as well as the cognate sequence accession numbers are given in Supplementary Table 1. The online version of this article contains supplementary material.
|
| RESULTS |
|---|
|
|
|---|
|
|
The expressed antisense transcripts were related to the cognate sense transcripts as monitored on the 430 arrays. The comparison of the two datasets was hampered by the fact that the probes on the 430 array derive from an updated genome version, thus carrying new identities, and were designed with a new algorithm. As a consequence, the comparison was done manually and revealed that only 268 of the 672 positive calls with kidney RNA on U74 could be assigned to probe sets on the 430 array. Of the 268 sense/antisense pairs 248 (92.5%) showed coexpression, 19 (7.1%) of the sense RNAs were not expressed. From the 573 positive calls in the brain, 212 could be related to probes on the 430 array. Of these 212 pairs, 158 (74.5%) were coexpressed and 49 (23.1%) sense RNAs were absent. We detected 129 of coexpressed transcript pairs in both kidney and brain. Most interestingly bidirectionally active genes appear to be preferentially coexpressed in kidney and brain (Table 1, bottom).
To corroborate the array results we selected 10 bidirectionally transcribed genes from the 129 coexpressed transcript pairs for validation by real-time RT-PCR. The selection depended on the accessibility of information concerning the genomic structure of potential sense/antisense pairs. We preferentially used the sense/antisense database of the PHANTOM3 consortium (http://fantom31p.gsc.riken.jp/s_as/) and compared the results to Ensembl annotations (http://www.ensembl.org/Mus_musculus/index.html). This process was hampered by the fact that we identified numerous antisense transcripts that were not represented in the database. We often found evidence for bidirectional transcription of the cognate loci in the form of close genes in tail-to-tail arrangement but no documented overlap that would enable primer design. Figure 3 gives an example of an antisense transcript identified on the U74 array, how the locus is represented in the FANTOM and the Ensembl database, and where the primers are localized. To exclude genomic DNA from contaminating the RT-PCR we designed primer pairs to locate on different exons. RNA was isolated from kidney, brain, heart, thymus, and liver and used for real-time RT-PCR following standard procedures. The 10 primer pairs for sense and antisense transcripts were used in parallel reactions. ß-Actin primers served as positive controls; reactions without reverse transcriptase and reactions without template served as negative controls. MgCl2 concentration and primer concentrations were adjusted to optimize the results. Three parameters were taken into account to evaluate the data: The cycle threshold (cT) values, the melting curves, and agarose gel electrophoresis of the end phase amplicon. Based on these criteria, five of the sense specific primer pairs and seven of the antisense specific primers produced specific amplicons (Table 2). The high cT values observed for a number of primer pairs (6 and 10 for sense, 2 for antisense) may reflect inefficient amplification rather than low abundance of the template, or both. A summary of the results is given in Table 2. The values were compared with actin and a cut-off of (cT actin + 10 cycles) was set to exclude the very inefficiently amplified fragments (sense, 6 and 10; antisense, 2 and 6). As shown in Fig. 4, sense and antisense transcripts were expressed in a tissue-specific manner. Deviations of up to 39-fold for both sense and antisense, within the assessed tissues, were observed. However, it has to be taken into account that the level of ß-actin may vary between the different tissues; therefore, the RT-PCR results are qualitative. Nevertheless, the results confirm that the expression of antisense transcripts is regulated in a tissue-specific manner. The widespread coexpression of sense and antisense transcripts suggests that the expression patterns may be related. Previous reports on the tissue-specific expression of sense and antisense transcripts corroborate our findings thus validating the DNA array approach (5, 15, 16, 31).
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Antisense transcripts are increasingly recognized to have gene-regulating potential. Their expression pattern is time and tissue dependent, a finding that is corroborated by few specific examples of antisense transcripts implicated in embryonic development and cancer progression (25). To elucidate the largely unknown impact of NATs on biological processes would require monitoring the antisense transcriptome under physiological and pathological circumstances. The reversely annotated probes on the U74 chip series offer the opportunity to deliver the proof of principle for this kind of experiment. The probe sets only cover
25% of each sequence. This raised the question of whether a significant proportion of polyadenylated antisense transcripts overlap with and bind to these probes. In addition, antisense RNAs are usually expressed at a low level, which again may be critical for detection. We obtained
10% positive calls in both kidney and brain, which is lower than recent predictions (15, 17). The most relevant figure for this study comes from the FANTOM consortium, estimating that 72% of all transcriptional units in mouse show evidence of overlapping RNA (15). To explain this discrepancy we have to take the limited coverage of the array into account and to acknowledge that the reversely annotated oligonucleotides of the U74 chips may not provide a fully representative selection. This bias will become irrelevant with the use of exon arrays (10).
Recent reports by Kiyosawa et al. (16) and Cheng et al. (7) found a substantial number of nonpolyadenylated transcripts compared with polyadenylated (overlapping) RNAs. The intimate relation between transcription and RNA processing, in this case polyadenylation, implies divergent synthesis pathways for the two groups of RNA and possibly different cellular fates (24). Common labeling procedures, including our own, select for polyadenylated transcripts. This bias may prove advantageous to distinguish between transcripts of different biological function. Fully processed RNAs are more stable and thus less likely to be byproducts of transcription (transcriptional noise) (8, 29).
The validation of the array results was performed by real-time PCR. Primer design was hampered by two major constraints. First, records of both sense and antisense transcripts (ESTs, cDNAs) were required. Our findings comply with others that antisense transcripts are often unannotated. Second, RNA preparations contain varying traces of genomic DNA that may interfere with the amplification. To avoid this pitfall we designed all primers to contain the upstream intron adjacent to the probe set. This requirement may have had impact on the quality of the primers, and as a consequence high cT values were observed in a number of samples. The input from reverse transcription is limited to 10% of the PCR volume; therefore, we ran 50 cycles to monitor expression of the transcript pairs. This resulted in a number of borderline values (cT >40, Table 2) that would require either redesign of the primers or further optimization of the cognate reactions. These values were not included into the assessment of tissue-specific expression of sense/antisense pairs (Fig. 4). Nevertheless, we could confirm a significant number of transcripts identified on the chip. To conclude, we show that commercial DNA arrays offer a tool to monitor antisense expression. A protocol that allows the independent labeling of sense and antisense strand of an RNA sample in combination with exon arrays will bring routine examinations of the antisense transcriptome within reach.
We have found extensive coexpression of sense and antisense transcripts in the RNA preparations from the kidney and brain. In general terms, similar observations have been made when the expression pattern of antisense transcripts in mammalian systems was evaluated by tiling arrays and large-scale sequencing approaches (1, 5, 15, 16, 23). Also our experimental validation of a few antisense transcripts showed tissue-specific expression patterns that reflected a developmental and hormonal state (15, 23). Note that our findings confirm that expression of sense and antisense does not follow a general pattern of co- or reciprocal regulation (15); interestingly, however, the data suggest that this pattern may vary for a specific transcript pair between different organs. Another striking feature represents the strong bias of coexpressed sense/antisense pairs toward expression in both kidney and brain. This could indicate that coexpression of sense and antisense transcripts provides a very general positive impact on mRNA and/or protein synthesis. The two transcripts could theoretically derive from different cell populations; however, the extent of coexpression argues against a strict local separation of sense and antisense transcription. We therefore assume that a significant number of NATs are coexpressed with their cognate sense RNAs in the same cell population. Together with the preferred conservation of NAT pairs that overlap in exonic regions, the coexpression would suggest a putative regulatory mechanism that involves hybrid formation of fully processed sense and antisense transcripts.
Our results are based on the reversely annotated probe set on the Affymetrix U74A and B arrays. This resulted in a serendipitous probe "selection" that excludes any bias toward the genes screened for antisense transcription. We found more antisense transcripts in the kidney compared with the brain and sense/antisense coexpression also seemed higher in the kidney than in the brain. The lower general transcriptional activity in brain might account for this observation. Sun et al. (31) reported higher antisense expression in mammalian brain compared with other tissues. It is tempting to speculate about biological explanations for these findings especially since regulatory RNAs are potential molecular mediators of higher eukaryote brain complexity (11, 21). We suggest that the reason for the discrepancies between our data and published reports is the biological background of the different datasets. Our data derive from mice with defined genetic background and age, whereas in silico compiled datasets usually do not discriminate strains and developmental stages. Considering the regulated and potentially dynamic expression of the antisense transcriptome it is conceivable that the snapshot presented here may differ from the bioinformatic average. In addition, experimental variations may add to the observed differences. For example, the dataset is relatively small, limited by unique chip design, and thus more susceptible to experimental variations. Several reports have highlighted the fact that cDNA preparation gives rise to a small proportion of reverse complement sequences (7). These traces, however, should not account for the variations. The probe synthesized for chip hybridization strictly derives from primed RNA since contaminations from genomic DNA will not include the T7 RNA polymerase promoter. The RT-PCR results are again unlikely to derive from unspecific priming because the all amplicons span an intron.
Natural antisense RNAs help to organize fundamental biological processes such as embryonic development and cancer formation (4, 12, 22). Consequently, the expression of NATs is tightly regulated with respect to tissue and age (3). We provide proof of principle that commercial DNA arrays are suitable tools to map the antisense transcriptome. This will eventually help to unravel the putative role of NATs in eukaryotic gene regulation.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Article published online before print. See web site for date of publication (http://physiolgenomics.physiology.org).
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Carlile, D. Swan, K. Jackson, K. Preston-Fayers, B. Ballester, P. Flicek, and A. Werner Strand selective generation of endo-siRNAs from the Na/phosphate transporter gene Slc34a1 in murine tissues Nucleic Acids Res., April 1, 2009; 37(7): 2274 - 2282. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. C. 't Hoen, Y. Ariyurek, H. H. Thygesen, E. Vreugdenhil, R. H. A. M. Vossen, R. X. de Menezes, J. M. Boer, G.-J. B. van Ommen, and J. T. den Dunnen Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms Nucleic Acids Res., December 1, 2008; 36(21): e141 - e141. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matsui, J. Ishida, T. Morosawa, Y. Mochizuki, E. Kaminuma, T. A. Endo, M. Okamoto, E. Nambara, M. Nakajima, M. Kawashima, et al. Arabidopsis Transcriptome Analysis under Drought, Cold, High-Salinity and ABA Treatment Conditions using a Tiling Array Plant Cell Physiol., August 1, 2008; 49(8): 1135 - 1149. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Carlile, P. Nalbant, K. Preston-Fayers, G. S. McHaffie, and A. Werner Processing of naturally occurring sense/antisense transcripts of the vertebrate Slc34a gene into short RNAs Physiol Genomics, June 1, 2008; 34(1): 95 - 100. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |