UTRdb

Last updated
UTRdb and UTRsite
Database.png
Content
Descriptionsequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs
Organisms Eukaryotes
Contact
LaboratoryIstituto Tecnologie Biomediche del Consiglio Nazionale delle Ricerche, via Amendola 122/D, 70126 Bari, Italy
AuthorsGiorgio Grillo
Primary citationGrillo & al. (2010) [1]
Access
Website http://utrdb.ba.itb.cnr.it/

UTRdb is a database of 5' and 3' untranslated sequences of eukaryotic mRNAs. [1]

Contents

See also

Related Research Articles

<span class="mw-page-title-main">Exon</span> A region of a transcribed gene present in the final functional mRNA molecule

An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term exon refers to both the DNA sequence within a gene and to the corresponding sequence in RNA transcripts. In RNA splicing, introns are removed and exons are covalently joined to one another as part of generating the mature RNA. Just as the entire set of genes for a species constitutes the genome, the entire set of exons constitutes the exome.

<span class="mw-page-title-main">Messenger RNA</span> RNA that is read by the ribosome to produce a protein

In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.

The coding region of a gene, also known as the coding sequence (CDS), is the portion of a gene's DNA or RNA that codes for a protein. Studying the length, composition, regulation, splicing, structures, and functions of coding regions compared to non-coding regions over different species and time periods can provide a significant amount of important information regarding gene organization and evolution of prokaryotes and eukaryotes. This can further assist in mapping the human genome and developing gene therapy.

<span class="mw-page-title-main">Three prime untranslated region</span> Sequence at the 3 end of messenger RNA that does not code for product

In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′-UTR often contains regulatory regions that post-transcriptionally influence gene expression.

The 5′ untranslated region is the region of a messenger RNA (mRNA) that is directly upstream from the initiation codon. This region is important for the regulation of translation of a transcript by differing mechanisms in viruses, prokaryotes and eukaryotes. While called untranslated, the 5′ UTR or a portion of it is sometimes translated into a protein product. This product can then regulate the translation of the main coding sequence of the mRNA. In many organisms, however, the 5′ UTR is completely untranslated, instead forming a complex secondary structure to regulate translation.

An internal ribosome entry site, abbreviated IRES, is an RNA element that allows for translation initiation in a cap-independent manner, as part of the greater process of protein synthesis. Initiation of eukaryotic translation nearly always occurs at and is dependent on the 5' cap of mRNA molecules, where the translation initiation complex forms and ribosomes engage the mRNA. IRES elements, however allow ribosomes to engage the mRNA and begin translation independently of the 5' cap.

<span class="mw-page-title-main">Silencer (genetics)</span> Type of DNA sequence

In genetics, a silencer is a DNA sequence capable of binding transcription regulation factors, called repressors. DNA contains genes and provides the template to produce messenger RNA (mRNA). That mRNA is then translated into proteins. When a repressor protein binds to the silencer region of DNA, RNA polymerase is prevented from transcribing the DNA sequence into RNA. With transcription blocked, the translation of RNA into proteins is impossible. Thus, silencers prevent genes from being expressed as proteins.

The Kozak consensus sequence is a nucleic acid motif that functions as the protein translation initiation site in most eukaryotic mRNA transcripts. Regarded as the optimum sequence for initiating translation in eukaryotes, the sequence is an integral aspect of protein regulation and overall cellular health as well as having implications in human disease. It ensures that a protein is correctly translated from the genetic message, mediating ribosome assembly and translation initiation. A wrong start site can result in non-functional proteins. As it has become more studied, expansions of the nucleotide sequence, bases of importance, and notable exceptions have arisen. The sequence was named after the scientist who discovered it, Marilyn Kozak. Kozak discovered the sequence through a detailed analysis of DNA genomic sequences.

Eukaryotic chromosome fine structure refers to the structure of sequences for eukaryotic chromosomes. Some fine sequences are included in more than one class, so the classification listed is not intended to be completely separate.

<span class="mw-page-title-main">Directionality (molecular biology)</span> End-to-end chemical orientation of a single strand of nucleic acid

Directionality, in molecular biology and biochemistry, is the end-to-end chemical orientation of a single strand of nucleic acid. In a single strand of DNA or RNA, the chemical convention of naming carbon atoms in the nucleotide pentose-sugar-ring means that there will be a 5′ end, which frequently contains a phosphate group attached to the 5′ carbon of the ribose ring, and a 3′ end, which typically is unmodified from the ribose -OH substituent. In a DNA double helix, the strands run in opposite directions to permit base pairing between them, which is essential for replication or transcription of the encoded information.

<span class="mw-page-title-main">Untranslated region</span> Non-coding regions on either end of mRNA

In molecular genetics, an untranslated region refers to either of two sections, one on each side of a coding sequence on a strand of mRNA. If it is found on the 5' side, it is called the 5' UTR, or if it is found on the 3' side, it is called the 3' UTR. mRNA is RNA that carries information from DNA to the ribosome, the site of protein synthesis (translation) within a cell. The mRNA is initially transcribed from the corresponding DNA sequence and then translated into protein. However, several regions of the mRNA are usually not translated into protein, including the 5' and 3' UTRs.

<span class="mw-page-title-main">Rotavirus translation</span>

Rotavirus translation, the process of translating mRNA into proteins, occurs in a different way in Rotaviruses. Unlike the vast majority of cellular proteins in other organisms, in Rotaviruses the proteins are translated from capped but nonpolyadenylated mRNAs. The viral nonstructural protein NSP3 specifically binds the 3'-end consensus sequence of viral mRNAs and interacts with the eukaryotic translation initiation factor eIF4G. The Rotavirus replication cycle occurs entirely in the cytoplasm. Upon virus entry, the viral transcriptase synthesizes capped but nonpolyadenylated mRNA The viral mRNAs bear 5' and 3' untranslated regions (UTR) of variable length and are flanked by two different sequences common to all genes.

Polymerase stuttering is the process by which a polymerase transcribes a nucleotide several times without progressing further on the mRNA chain. It is often used in addition of poly A tails or capping mRNA chains by less complex organisms such as viruses.

<span class="mw-page-title-main">5′ flanking region</span>

The 5′ flanking region is a region of DNA that is adjacent to the 5′ end of the gene. The 5′ flanking region contains the promoter, and may contain enhancers or other protein binding sites. It is the region of DNA that is not transcribed into RNA. Not to be confused with the 5′ untranslated region, this region is not transcribed into RNA or translated into a functional protein. These regions primarily function in the regulation of gene transcription. 5′ flanking regions are categorized between prokaryotes and eukaryotes.

The C-rich Not M, -ND stability element is a short part of a gene sequence important for the proper synthesis of globin and several other human proteins.

<span class="mw-page-title-main">Red clover necrotic mosaic virus translation enhancer elements</span>

Red clover necrotic mosaic virus (RCNMV) contains several structural elements present within the 3′ and 5′ untranslated regions (UTR) of the genome that enhance translation. In eukaryotes transcription is a prerequisite for translation. During transcription the pre-mRNA transcript is processes where a 5′ cap is attached onto mRNA and this 5′ cap allows for ribosome assembly onto the mRNA as it acts as a binding site for the eukaryotic initiation factor eIF4F. Once eIF4F is bound to the mRNA this protein complex interacts with the poly(A) binding protein which is present within the 3′ UTR and results in mRNA circularization. This multiprotein-mRNA complex then recruits the ribosome subunits and scans the mRNA until it reaches the start codon. Transcription of viral genomes differs from eukaryotes as viral genomes produce mRNA transcripts that lack a 5’ cap site. Despite lacking a cap site viral genes contain a structural element within the 5’ UTR known as an internal ribosome entry site (IRES). IRES is a structural element that recruits the 40s ribosome subunit to the mRNA within close proximity of the start codon.

<span class="mw-page-title-main">Barley yellow dwarf virus 5'UTR</span>

Barley yellow dwarf virus 5' UTR is a non-coding RNA element containing structural elements required for translation of the genome of the plant disease pathogen Barley yellow dwarf virus.

<span class="mw-page-title-main">Riboregulator</span>

In molecular biology, a riboregulator is a ribonucleic acid (RNA) that responds to a signal nucleic acid molecule by Watson-Crick base pairing. A riboregulator may respond to a signal molecule in any number of manners including, translation of the RNA into a protein, activation of a ribozyme, release of silencing RNA (siRNA), conformational change, and/or binding other nucleic acids. Riboregulators contain two canonical domains, a sensor domain and an effector domain. These domains are also found on riboswitches, but unlike riboswitches, the sensor domain only binds complementary RNA or DNA strands as opposed to small molecules. Because binding is based on base-pairing, a riboregulator can be tailored to differentiate and respond to individual genetic sequences and combinations thereof.

The Atlas of UTR Regulatory Activity (AURA), a biological database, now at its second version, is a manually curated and comprehensive catalog of human 5' and 3' untranslated sequences (UTR) and UTR regulatory annotations. It includes basic annotation, phylogenetic conservation, binding sites for RNA-binding proteins and miRNA, cis-elements, RNA methylation and editing data, and more, for human and mouse. Through its intuitive web interface, it furthermore provides full access to a wealth of information that integrates RNA sequence and structure data, variation sites, gene synteny, gene and protein expression and gene functional descriptions from scientific literature and specialized databases. Eventually, it provides several tool for batch analysis of gene lists, allowing the tracing of post-transcriptional regulatory networks.

In molecular biology, a cap-independent translation element is an RNA sequence found in the 3'UTR of many RNA plant viruses.

References

  1. 1 2 Grillo, Giorgio; Turi Antonio; Licciulli Flavio; Mignone Flavio; Liuni Sabino; Banfi Sandro; Gennarino Vincenzo Alessandro; Horner David S; Pavesi Giulio; Picardi Ernesto; Pesole Graziano (Jan 2010). "UTRdb and UTRsite (RELEASE 2010): a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs". Nucleic Acids Res. 38 (Suppl_1): D75-80. doi:10.1093/nar/gkp902. PMC   2808995 . PMID   19880380.