Rna22 is a pattern-based algorithm for the discovery of microRNA target sites and the corresponding heteroduplexes. [1]
The algorithm is conceptually distinct from other methods for predicting microRNA:mRNA heteroduplexes in that it does not use experimentally validated heteroduplexes for training, instead relying only on the sequences of known mature miRNAs that are found in the public databases. The key idea of rna22 is that the reverse complement of any salient sequence features that one can identify in mature microRNA sequences (using pattern discovery techniques) should allow one to identify candidate microRNA target sites in a sequence of interest: rna22 makes use of the Teiresias algorithm to discover such salient features. Once a candidate microRNA target site has been located, the targeting microRNA can be identified with the help of any of several algorithms able to compute RNA:RNA heteroduplexes. A new version (v2.0) of the algorithm is now available: v2.0-beta adds probability estimates to each prediction, gives users the ability to choose the sensitivity/specificity settings on-the-fly, is significantly faster than the original, and can be accessed through http://cm.jefferson.edu/rna22/Interactive/.
Rna22 neither relies on nor imposes any cross-organism conservation constraints to filter out unlikely candidates; this gives it the ability to discover microRNA binding sites that may not be conserved in phylogenetically proximal organisms. Also, as mentioned above, rna22 can identify putative microRNA binding sites without needing to know the identity of the targeting microRNA. A notable property of rna22 is that it does not require the presence of the exact reverse complement of a microRNA's seed in a putative target permitting bulges and G:U wobbles in the seed region of the heteroduplex. Lastly, the algorithm has been shown to achieve high signal-to-noise ratio. [2]
Use of rna22 led to the discovery of "non-canonical" microRNA targets in the coding regions of the mouse Nanog, Oct4 and Sox2. [3] Most of these targets are not conserved in the human orthologues of these three transcription factors even though they reside in the coding region of the corresponding mRNAs. Moreover, most of these targets contain G:U wobbles, one or more bulges, or both, in the seed region of the heteroduplex. In addition to coding regions, rna22 has helped discover non-canonical targets in 3'UTRs. [4]
A recent study [5] examined the problem of non-canonical miRNA targets using molecular dynamics simulations of the crystal structure of the Argonaute-miRNA:mRNA ternary complex. The study found that several kinds of modifications, including combinations of multiple G:U wobbles and mismatches in the seed region, are admissible and result in only minor structural fluctuations that do not affect the stability of the ternary complex. The study also showed that the findings of the molecular dynamics simulation are supported by HITS-CLIP (CLIP-seq) data. These results suggest that bona fide miRNA targets transcend the canonical seed-model in turn making target prediction tools like rna22 an ideal choice for exploring the newly augmented spectrum of miRNA targets.
Name | Description | type | Link | References |
---|---|---|---|---|
RNA22 version 2.0 | The first web-site link (interactive & dynamic) permits the user to find on-the-fly putative miRNA binding sites for any sequence of interest (i.e. a protein-coding mRNA, or long non-coding RNA) and for any miRNA (publicly known or novel). The second link [6] (precomputed & static) provides access to RNA22 v2 predictions for all protein coding transcripts in human, mouse, roundworm, and fruit fly. It allows the user to visualize the predictions within a cDNA map and also find transcripts where multiple miRNA's of interest target. | microRNA target predictions | interactive predictions precomputed predictions | TBD |
RNA22 | The link [6] (precomputed & static) provides access to RNA22 predictions for all protein coding transcripts in human, mouse, roundworm, and fruit fly. It allows you to visualize the predictions within a cDNA map and also find transcripts where multiple miRNA's of interest target. | microRNA target predictions | precomputed predictions |
In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′-UTR often contains regulatory regions that post-transcriptionally influence gene expression.
Transfer RNA is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length. In a cell, it provides the physical link between the genetic code in messenger RNA (mRNA) and the amino acid sequence of proteins, carrying the correct sequence of amino acids to be combined by the protein-synthesizing machinery, the ribosome. Each three-nucleotide codon in mRNA is complemented by a three-nucleotide anticodon in tRNA. As such, tRNAs are a necessary component of translation, the biological synthesis of new proteins in accordance with the genetic code.
Oct-4, also known as POU5F1, is a protein that in humans is encoded by the POU5F1 gene. Oct-4 is a homeodomain transcription factor of the POU family. It is critically involved in the self-renewal of undifferentiated embryonic stem cells. As such, it is frequently used as a marker for undifferentiated cells. Oct-4 expression must be closely regulated; too much or too little will cause differentiation of the cells.
In biology, reprogramming refers to erasure and remodeling of epigenetic marks, such as DNA methylation, during mammalian development or in cell culture. Such control is also often associated with alternative covalent modifications of histones.
Artificial transcription factors (ATFs) are engineered individual or multi molecule transcription factors that either activate or repress gene transcription (biology).
RNA silencing or RNA interference refers to a family of gene silencing effects by which gene expression is negatively regulated by non-coding RNAs such as microRNAs. RNA silencing may also be defined as sequence-specific regulation of gene expression triggered by double-stranded RNA (dsRNA). RNA silencing mechanisms are conserved among most eukaryotes. The most common and well-studied example is RNA interference (RNAi), in which endogenously expressed microRNA (miRNA) or exogenously derived small interfering RNA (siRNA) induces the degradation of complementary messenger RNA. Other classes of small RNA have been identified, including piwi-interacting RNA (piRNA) and its subspecies repeat associated small interfering RNA (rasiRNA).
In molecular biology lin-4 is a microRNA (miRNA) that was identified from a study of developmental timing in the nematode Caenorhabditis elegans. It was the first to be discovered of the miRNAs, a class of non-coding RNAs involved in gene regulation. miRNAs are transcribed as ~70 nucleotide precursors and subsequently processed by the Dicer enzyme to give a 21 nucleotide product. The extents of the hairpin precursors are not generally known and are estimated based on hairpin prediction. The products are thought to have regulatory roles through complete or partial complementarity to mRNA. The lin-4 gene has been found to lie within a 4.11kb intron of a separate host gene.
The miR-103 microRNA precursor, is a short non-coding RNA gene involved in gene regulation. miR-103 and miR-107 have now been predicted or experimentally confirmed in human.
The miR-16 microRNA precursor family is a group of related small non-coding RNA genes that regulates gene expression. miR-16, miR-15, mir-195 and miR-497 are related microRNA precursor sequences from the mir-15 gene family. This microRNA family appears to be vertebrate specific and its members have been predicted or experimentally validated in a wide range of vertebrate species.
The miR-24 microRNA precursor is a small non-coding RNA molecule that regulates gene expression. microRNAs are transcribed as ~70 nucleotide precursors and subsequently processed by the Dicer enzyme to give a mature ~22 nucleotide product. In this case the mature sequence comes from the 3' arm of the precursor. The mature products are thought to have regulatory roles through complementarity to mRNA. miR-24 is conserved in various species, and is clustered with miR-23 and miR-27, on human chromosome 9 and 19. Recently, miR-24 has been shown to suppress expression of two crucial cell cycle control genes, E2F2 and Myc in hematopoietic differentiation and also to promote keratinocyte differentiation by repressing actin-cytoskeleton regulators PAK4, Tsk5 and ArhGAP19.
SRY -box 2, also known as SOX2, is a transcription factor that is essential for maintaining self-renewal, or pluripotency, of undifferentiated embryonic stem cells. Sox2 has a critical role in maintenance of embryonic and neural stem cells.
This microRNA database and microRNA targets databases is a compilation of databases and web portals and servers used for microRNAs and their targets. MicroRNAs (miRNAs) represent an important class of small non-coding RNAs (ncRNAs) that regulate gene expression by targeting messenger RNAs.
PAR-CLIP is a biochemical method for identifying the binding sites of cellular RNA-binding proteins (RBPs) and microRNA-containing ribonucleoprotein complexes (miRNPs). The method relies on the incorporation of ribonucleoside analogs that are photoreactive, such as 4-thiouridine (4-SU) and 6-thioguanosine (6-SG), into nascent RNA transcripts by living cells. Irradiation of the cells by ultraviolet light of 365 nm wavelength induces efficient crosslinking of photoreactive nucleoside–labeled cellular RNAs to interacting RBPs. Immunoprecipitation of the RBP of interest is followed by isolation of the crosslinked and coimmunoprecipitated RNA. The isolated RNA is converted into a cDNA library and is deep sequenced using next-generation sequencing technology.
High-throughput sequencing of RNA isolated by crosslinking immunoprecipitation (HITS-CLIP) is a variant of CLIP for genome-wide mapping protein–RNA binding sites or RNA modification sites in vivo. HITS-CLIP was originally used to generate genome-wide protein-RNA interaction maps for the neuron-specific RNA-binding protein and splicing factor NOVA1 and NOVA2; since then a number of other splicing factor maps have been generated, including those for PTB, RbFox2, SFRS1, hnRNP C, and even N6-Methyladenosine (m6A) mRNA modifications.
miR-296 is a family of microRNA precursors found in mammals, including humans. The ~22 nucleotide mature miRNA sequence is excised from the precursor hairpin by the enzyme Dicer. This sequence then associates with RISC which effects RNA interference.
isomiRs are miRNA sequences that have variations with respect to the reference sequence. The term was coined by Morin et al in 2008. It has been found that isomiR expression profiles can also exhibit race, population, and sex dependencies.
In molecular biology, competing endogenous RNAs regulate other RNA transcripts by competing for shared microRNAs (miRNAs). Models for ceRNA regulation describe how changes in the expression of one or multiple miRNA targets alter the number of unbound miRNAs and lead to observable changes in miRNA activity - i.e., the abundance of other miRNA targets. Models of ceRNA regulation differ greatly. Some describe the kinetics of target-miRNA-target interactions, where changes in the expression of one target species sequester one miRNA species and lead to changes in the dysregulation of the other target species. Others attempt to model more realistic cellular scenarios, where multiple RNA targets are affecting multiple miRNAs and where each target pair is co-regulated by multiple miRNA species. Some models focus on mRNA 3' UTRs as targets, and others consider long non-coding RNA targets as well.
MicroRNA sequencing (miRNA-seq), a type of RNA-Seq, is the use of next-generation sequencing or massively parallel high-throughput DNA sequencing to sequence microRNAs, also called miRNAs. miRNA-seq differs from other forms of RNA-seq in that input material is often enriched for small RNAs. miRNA-seq allows researchers to examine tissue-specific expression patterns, disease associations, and isoforms of miRNAs, and to discover previously uncharacterized miRNAs. Evidence that dysregulated miRNAs play a role in diseases such as cancer has positioned miRNA-seq to potentially become an important tool in the future for diagnostics and prognostics as costs continue to decrease. Like other miRNA profiling technologies, miRNA-Seq has both advantages and disadvantages.
In bioinformatics, TargetScan is a web server that predicts biological targets of microRNAs (miRNAs) by searching for the presence of sites that match the seed region of each miRNA. For many species, other types of sites, known as 3'-compensatory sites are also identified. These miRNA target predictions are regularly updated and improved by the laboratory of David Bartel in conjunction with the Whitehead Institute Bioinformatics and Research Computing Group.