|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 United States Department of Agriculture, Agricultural Research Center, Beltsville Area Research Center, Beltsville, Maryland
2 University of Sao Paulo-ESALQ, Piracicaba, SP, Brazil
3 Bioinformatics and Computational Biology, George Mason University, Manassas, Virginia
4 United States Department of Agriculture, Agricultural Research Center, U.S. Meat Animal Research Center, Clay Center, Nebraska
| ABSTRACT |
|---|
|
|
|---|
22 nucleotide-long noncoding RNAs capable of controlling gene expression by inhibiting translation. Alignment of human microRNA stem-loop sequences (mir) against a recent draft sequence assembly of the bovine genome resulted in identification of 334 predicted bovine mir. We sequenced five tissue-specific cDNA libraries derived from the small RNA fractions of bovine embryo, thymus, small intestine, and lymph node to validate these predictions and identify new mir. This strategy combined with comparative sequence analysis identified 129 sequences that corresponded to mature microRNAs (miR). A total of 107 sequences aligned to known human mir, and 100 of these matched expressed miR. The other seven sequences represented novel miR expressed from the complementary strand of previously characterized human mir. The 22 sequences without matches displayed characteristic mir secondary structures when folded in silico, and 10 of these retained sequence conservation with other vertebrate species. Expression analysis based on sequence identity counts revealed that some miR were preferentially expressed in certain tissues, while bta-miR-26a and bta-miR-103 were prevalent in all tissues examined. These results support the premise that species differences in regulation of gene expression by miR occur primarily at the level of expression and processing. small RNA; microRNA; embryo; immune
| INTRODUCTION |
|---|
|
|
|---|
22 nucleotides in length) that influence the expression of hundreds of genes (24) and have a role in regulation of gene expression for numerous biological processes including brain morphogenesis (17, 24), cardiomyocyte proliferation and differentiation (41), insulin secretion (32), tumorogenesis, viral defense (23), and hematopoietic lineage differentiation (12). Mature microRNAs (miR) in animals interact mostly with the 3'-untranslated region (UTR) of targeted mRNA and modulate gene expression (3, 4, 7, 19, 20, 30, 31, 40). miR have been identified by sequence and expression analyses (5). Genome sequence analysis algorithms based on phylogenetic conservation (10) and RNA folding (27) have identified potential stem-loop structures containing microRNAs (mir) in many species with available genome sequence. However, validation of these predictions has required detection of miR transcripts by Northern blot or by sequencing of cDNA libraries derived from size-fractioned RNA (5). The latter approach has also identified miR not predicted by in silico methods.
There are currently 4,039 mir described for primates, rodents, birds, fish, worms, flies, plants, and viruses. A total of 462 mir have been described for humans (18) (release 8.2). Genomic scans and cloning results have indicated that the actual total number of human mir may be closer to 800 (9). Some mir maintain conservation across vertebrate species, while others have a more limited species distribution (9). Probably the most interesting observations pertain to variation in miR abundance and expression among different tissues (6, 15, 24, 25, 35). Elucidation of the differences in miR and target mRNA expression between species and tissues will continue to be valuable in understanding the gene expression regulatory networks underlying biological differences between organisms.
Cattle have tremendous importance not only for food production but as a mammalian model organism for comparative genomics and biological studies (16). Despite the recognized importance of miR in regulating gene expression during development and other biological processes, there has been little information about miR expression in cattle. Thus, the main objective in this study was to identify conserved and novel miR present in cattle and to evaluate specific expression patterns in embryo and tissues that are important for immune responses.
| MATERIALS AND METHODS |
|---|
|
|
|---|
miR cloning.
Day 30 (d30) bovine embryos (gestation period 280 days) were snap-frozen in liquid nitrogen immediately after removal from the reproductive tract of slaughtered cows. Immune and gut tissue samples were obtained from 8-mo-old Holstein steers raised in concrete stalls. Approximately 200300 mg of tissue from mesenteric (MLN) and abomasal (ALN) lymph nodes, thymus (THY), small intestine (SI), and whole embryos (EMB) were processed with TRIzol (Invitrogen, Carlsbad, CA) for RNA extraction according to manufacturer's instruction. Single insert cDNA libraries corresponding to expressed miR were constructed as described by Lu et al. (26) with the following modifications. In brief, RNA at each stage was separated by size on denaturing acrylamide gels, stained with Syber Gold (Molecular Probes, Eugene, OR), and eluted by FlashPAGE electrophoresis (Ambion, Austin, TX). PCR-amplified cDNA was cloned using Topo TA cloning (Invitrogen). Individual clones from the transformation were transferred into 384-well plates and sequenced with DYEnamic ET terminator (Amersham, Piscataway, NJ) on an ABI 3730 instrument (Applied Biosystems, Foster City, CA). All animal care and protocols were reviewed and approved by the Beltsville Agricultural Research Center's Animal Care and Use Committee (protocol number 05-013).
Sequence, quantitative RT-PCR and statistical analyses.
Chromatograms were analyzed with Phred (14), and resulting sequences were screened for vector and linker sequences. Sequences were oriented based on the 3'- and 5'-end specific linker sequences and were used only if the inserts were 1734 bases with a minimum base quality of 20 for all bases. Distinct clone sequences were clustered and assembled to obtain the longest sequence, along with member counts within the cluster. Some of the clusters were further collapsed by manual intervention that allowed for single base mismatches, especially toward the ends of a cloned sequence. The reduced set of sequences from the above analysis was annotated by matching against miRBase for known miR. Sequences not having matches to miRBase were screened against rRNA, tRNA (http://lowelab.ucsc.edu/GtRNAdb/Hsapi/Hsapi-summary.html) and snoRNAs (http://www.snorna.biotoul.fr/browse.php) to remove contaminating sequences that could interfere with identification of novel bovine miR. Bases flanking the bovine miR were obtained by BLAST analysis (1) against the bovine genome and were used to check for hairpin conformation using mfold (27). Human matches and conservation of bases across different species was determined using UCSC genome browser (http://genome.ucsc.edu). Those sequences identified as mir were submitted to the miRBase registry web site for official annotation. Statistical analysis of miR expression was conducted using
2 analysis to compare global and individual miR abundance among the libraries. MicroRNA expression was defined as the number of sequences for each miR in a library, divided by the total number of sequences for that library (Supplemental Table S2). (The online version of this article contain supplemental material.) Quantitative RT-PCR was conducted using human TaqMan miR probes that had the exact same sequence as the bovine miR. Reactions were conducted following manufacturer's recommendations (Applied Biosystems). Hierarchical clustering of miR expression was performed on data from miR sequenced at least four times and with the program GeneSpring version 7.2 (Agilent, Foster City, CA) and Pearson correlation.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
50%, comparison against mirBase appears to be the most conservative way to identify mir in an unannotated genome. A count of matches between predicted bovine mir and mir from other species was also determined (Supplemental Table S2). As in other species (8, 34), some mir were found clustered in specific genomic regions, suggesting potential coexpression or coregulation. A large cluster with 38 mir was observed from BLAST alignment in Chr21 (59.4- to 59.6-Mb region). A comprehensive list of clusters is presented in Supplemental Table S2. Full-scale genomic inferences about mir clusters in cattle is still incomplete, as the current genome sequence assembly has assigned 80% of the scaffolds to chromosomes, and some of these have not been oriented. Further comparison of mir conserved across different species will also be useful in understanding common regulation of gene expression among species.
Identification of expressed bovine miR.
To validate predicted mir and begin characterizing the miR portion of the transcriptome, five cDNA libraries were constructed from size-fractioned bovine RNA (1530 bases). A library from early embryo (d30) RNA was constructed to capture the presumed diversity of miR expression during somite differentiation, and four libraries were constructed from tissues of the immune-gut axis. These latter libraries represent tissues important to current animal health and food safety studies.
A total of 3,209 clones were processed to yield 2,617 sequences. These sequences collapsed into 412 clusters that represented potential unique small RNAs and yielded an overall novelty index of 15.7% (Table 1). The 412 unique sequences were evaluated by several criteria (described below) to better determine if these sequences actually represented expression of bovine miR.
|
The remaining 312 sequence clusters were aligned against the bovine genome to detect potential cloning artifacts. A total of 177 sequence clusters (sequence count of 190) did not match, suggesting only a small percentage of the sequences from the libraries were potential artifacts of small RNA cloning (7%). Alternatively, some of these sequences may not yet be represented in the current draft genome assembly. For the 135 sequence clusters that did match (sequence count of 199), 40 matched tRNA and snoRNA. These sequences along with those not matching the genome sequence were discarded from subsequent analyses.
Secondary structures that incorporated flanking genomic sequence were generated from each of the remaining 95 potential miR sequences. These folded structures were examined for stem-loop motifs of an mir transcript capable of producing an miR (some examples in Fig. 1). A total of 28 sequences clusters could be identified as potentially novel miR (Table 2). The miR-derived sequences that generated structure 7 (Table 2) probably represents a true miR, because this sequence was observed in more than one tissue (Supplemental Table S2). The other 27 miR sequences were only observed in one tissue. These sequences were further characterized by analysis of miR sequence conservation among other species. Such a comparison would be expected to yield several types of results. Those miR sequences not residing within a region of conserved genome sequence would be suggestive of mir loci unique to cattle. This was the case for 12 of the potential 28 unique miR sequences (Table 2). In contrast, the entire stem-loop sequence for the other 16 putative mir were highly similar to sequences found in other animal genomes supporting classification as probable bovine orthologs to mir not yet identified (Table 2). This class of sequences may also represent potential artifacts arising from cloning degradation products of longer cellular RNAs.
|
|
Seven of the 16 novel bovine miR that aligned to regions of conserved genome actually matched stem-loop sequences of previously identified human mir but did not match the corresponding mature miR associated with that sequence (Fig. 2). Our sequences were located on either the complementary 5'- or the 3'-strand of the stem. An analysis of the abundance of these miR across libraries (Fig. 2) revealed that in some cases only the complementary strand to the known human miR was cloned (bta-miR-455-3p, bta-miR-545-5p, and bta-miR-22-5p), while in other cases both strands were observed (bta-miR-21-5p, bta-miR-21-3p, bta-miR-425-5p, bta-miR-425-3p, bta-miR-127-5p, bta-miR-127-3p bta-miR-193-5p, and bta-miR-193-3p). Similar observations were made by Suh et al. (36) for miR expressed in human embryonic stem cells. The expression of the complementary miR sequences for miR-455, miR-127, miR-193, and miR-22 was intriguing, because the mature and stem-loop sequences were perfect matches to the corresponding human mir and the mature miR observed in cattle have yet to be detected from human samples. However, since the submission of this manuscript, miR-455-3p was cloned in mouse (28) and miR-425-5p was cloned in human (2). To our knowledge there are no studies explaining why different stems of the loop might be selected in different species, but it may be reasonable to predict that the resulting miR will not have the same range of target genes.
|
Overall expression profile of bovine miR.
A total of 308 small RNAs were observed only once, while 107 were present multiple times. A biologically significant level of miR expression was not determined, but the sequence identity count data suggested that the depth of sequencing was sufficient to support estimation of expression levels for the more highly expressed miR within a specific tissue. Deeper sequencing from each sample would be necessary to fully characterize less prevalent miR based on a recent report that indicated identification of all expressed miR was saturated at
40,000 sequences (13). Because miR modulate expression by binding to target mRNA and appear to act stoichiometrically rather than catalytically, it might be argued that the subtle effects caused by miR expressed with few molecules per cell would be difficult to elucidate with current expression profiling platforms. In any case, the data obtained from these bovine libraries provide an initial survey for comparison of the most abundant miR transcripts in the five tissues sampled and provide the first analysis of miR expression in bovids.
The diversity of observed miR sequences varied between different tissues. The EMB and SI libraries had the highest number of unique sequences (131 and 187 respectively, Table 1), and the highest novelty rate (33.0% and 33.5% respectively, Table 1). In addition, the 10 most observed miR sequences in EMB and SI represented a smaller proportion of the total sequences collected (53% and 46% respectively, Supplemental Table S2) than the top 10 observed in ALN and MLN samples (75% and 85% respectively, Supplemental Table S2). THY had a novelty rate similar to ALN and MLN (17%, Table 1) and a top 10 proportion intermediate between the ALN/MLN and EMB/SI groups (67%, Supplemental Table S2). These results were consistent with the idea that embryos have a high relative diversity of miR expression, which might be explained in part by the varied emerging tissue types within a whole embryo. However, our results also suggested that the SI expresses a diverse repertoire of miR involved in posttranscriptional regulation. The latter observation was supported by a recent study of a colorectal sample (13) that indicated a high diversity of miR expression in alimentary tissue, perhaps indicative of multiple and divergent tissue types contained in the sample similar to those in a whole embryo sample.
To further characterize the abundance of some of the cloned bovine miR across different tissues, quantitative RT-PCR was employed (Table 3). The U6B small nuclear RNA (RNU6B) was used as an internal control. The constant expression level of RNU6B across all tissues examined indicated an equivalent loading of total RNA for all reactions. Thus, the calculated threshold values (Ct) for all miR are presented in Table 3 without correction for the internal control RNU6B. Efficiency of amplification was determined for all miR from serial dilutions of the specific miR cDNA, and values ranged from 1.79 to 1.94. The data obtained by quantitative RT-PCR had correlations between normalized miR counts (normalized across libraries to an equivalent number of clones sequenced per library) and Ct value that ranged from 0.59 to 0.88 (a negative correlation was expected, since the greater the abundance of the miR, the lower the Ct value). Despite the good agreement between quantitative RT-PCR results and normalized library counts, one must consider that the quantitative RT-PCR results reflect the abundance of each miR in relation to the total RNA for each tissue, while the normalized counts reflect the relative abundance of miR identified in each library. As a consequence, comparison of library counts and quantitative RT-PCR results are qualitative and not quantitative in nature.
|
3% of unique tags in colorectal tissue (13). Together, these data suggest that significant differences exist in expression levels of miR between similar tissues of different species.
|
The only other highly expressed miR (Table 3 and Fig. 3) present in all tissues was miR-103. Human microarray and Northern blot data support widespread expression of miR-103 in adult tissues, but data from human lymph node, THY, and SI did not indicate that miR-103 was highly expressed (6, 8, 35). The contrast between the bovine data for miR-103 and human data further supports the notion of species-specific variability in level of expression of miR.
There were 17 miR expressed in all five libraries, disregarding overall expression level (including miR-26a and miR-103 discussed above), and these represented 14% of the combined set of 100 miR matching mirBase plus 28 putative new miR (listed in Supplemental Table S2). The remaining miR showed various patterns of expression (Supplemental Table S2). Of the 128 miR, 50 were represented by a single sequence from one library, making it difficult to assign a particular miR as tissue specific or enriched. All the libraries had approximately the same frequency of these singletons.
Comparison of expression profiles between tissues.
Tissue clustering based on miR expression showed that different tissues have diverse expression profiles, while similar tissues such as abomasum and mesenteric lymph nodes have very similar patterns of miR expression (Fig. 3). Based on function, this result was somewhat expected, even though both types of lymph nodes reside in different positions along the gut axis. SI and THY were less similar to the lymph nodes, whereas embryonic tissues had a more distinct profile of miR expression. The clustering of THY and SI proximal to lymph nodes could reflect the presence of developing T-cells in the THY and the infiltration of immune cells into the SI. The tissue with the greatest diversity in miR expression was SI (74 distinct miR, Supplemental Table S2). Again, this observation could reflect cell type diversity of this tissue and the rapid turnover of intestinal epithelial cells. Our tissue clustering was consistent with results of a previous study based on the expression profile of several miR from 26 human tissues (35).
Clustering of the miR by expression profile resulted in four major groups (Fig. 3). miR that were present in cluster 1 were preferentially expressed in SI and lymph nodes; cluster 2 were preferentially expressed in embryo, THY, and SI with lower expression in lymph nodes; cluster 3 were expressed in most tissues; and cluster 4 were preferentially expressed in embryo.
Examination of cluster 4 (Fig. 3 and Supplemental Table S2) uncovered two apparently embryo-specific miR, miR-122a and miR-199a*. The largest disparity was found for miR-122a, which occurred with 8% frequency in the EMB library sequences but was not observed in any other libraries. Quantitative RT-PCR results (Table 3) corroborate this observation, since expression of bta-miR122a was at least 36 times higher in embryo than the other tissues examined. Low expression of miR-122a in adult THY, lymph node, and SI was consistent with previous microarray and Northern analysis of human RNA (8, 35), which showed very low levels of hybridization to miR-122a probes (although high expression was observed in adult liver). Expression in embryo was also consistent with detection in early mouse embryos (11).
The other EMB-specific miR (excluding singletons) was miR-199a*, present at 2% frequency in EMB and not observed (i.e., present at <0.2% frequency) in any of the other libraries. Detection of miR-199* in d30 cattle embryo was consistent with expression observed in zebrafish embryos (39). However, microarray and Northern blot studies suggested expression of miR-199* at low to moderate levels in human lymph node, THY, and SI (8, 35). The most straightforward interpretation of our data was that miR-199a* does not have the same expression pattern in cattle as in human THY, lymph node, and SI. We also observed six miR sequences that were not EMB specific but had higher expression in the embryo library, with relative expression ratios to the next highest expressing tissue of 7.7 (miR-10a), 9.4 (miR-124a), 14 (miR-127), 8.4 (miR-214), 21 (miR-218), and 9.0 (miR-487).
There were 17 nonsingleton miR for which expression was detected in two or more nonembryonic samples. The most dramatic example of this was bta-miR-29a, which was in high relative abundance (412%) in the adult tissue libraries, but not observed in EMB. Low level bta-miR-29a expression was confirmed by RT-PCR results, where a 31-fold difference in expression was observed for this miR between embryonic and THY tissues. This was consistent with microarray results found for human samples that showed widespread, relatively high expression of miR-29a (6, 8). The data do not rule out a role for miR-29a in embryogenesis, because the EMB library only represents a narrow snapshot of development shortly after completion of somitogenesis. There were two other noteworthy instances of between-tissue variation in miR expression, involving miR-145 and miR-150. In our study, bta-miR-145 was found only in SI. This is consistent with previous expression studies in mice indicating miR-145 has a higher expression in SI than THY or lymph nodes (35). However, a previous study of zebrafish embryos (39) indicated expression of miR-145, which was not observed in our EMB library. This could mean that expression of miR-145 does not occur in d30 embryos or that the level of expression is <0.3%, the level at which we would expect to see at least one clone in the number of sequences obtained. Similarly, bta-miR-150 had higher expression in tissues implicated in immune response (THY and lymph nodes), which agrees with results in mice where miR-150 is highly expressed in lymph node and THY (35). This miR is involved in maturation and differentiation of T and B cells by being up regulated in T and B cells and repressed in Th1 and Th2 cells (29). Our quantitative RT-PCR results are in general agreement with library counts (61-fold higher expression in lymph nodes than in the embryo), but RT-PCR also revealed expression of bta-miR-150 in SI.
The construction of size-fractioned RNA libraries from bovine tissues allowed discovery and expression profile characterization of over 100 bovine miR. This represents about one-third of the 334 mir predicted with the miRBase data set, which was somewhat surprising, considering that some miR are cell type specific and only a few bovine tissues were sampled. The identification of 28 potential new miR indicates the importance of animal model systems, since our study resulted in the identification of miR not previously identified in the human genome. However, it was evident that a thorough identification of bovine miR will require a broader sampling of tissues and more in depth sequencing. Certainly, the simple sampling approaches used in this study must be complemented in future studies aimed at determining miR function in bovids, thereby establishing which bovine mir loci are functional as well as structural orthologs of their counterparts in other mammalian species.
| DISCLOSURES |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Article published online before print. See web site for date of publication (http://physiolgenomics.physiology.org).
* L. L. Coutinho and L. K. Matukumalli contributed equally to this work. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. G. McDaneld MicroRNA: Mechanism of gene regulation and application to livestock J Anim Sci, April 1, 2009; 87(14_suppl): E21 - E28. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Z. Carletti and L. K. Christenson MicroRNA in the ovary and female reproductive tract J Anim Sci, April 1, 2009; 87(14_suppl): E29 - E38. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Loor and W. S. Cohick ASAS Centennial Paper: Lactation biology for the twenty-first century J Anim Sci, February 1, 2009; 87(2): 813 - 824. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Glazov, S. McWilliam, W. C. Barris, and B. P. Dalrymple Origin, Evolution, and Biological Role of miRNA Cluster in DLK-DIO3 Genomic Region in Placental Mammals Mol. Biol. Evol., May 1, 2008; 25(5): 939 - 948. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |