Vault RNA | |
---|---|
Identifiers | |
Symbol | Vault |
Rfam | RF00006 |
Other data | |
Domain(s) | Eukaryote |
PDB structures | PDBe |
Many eukaryotic cells contain large ribonucleoprotein particles in the cytoplasm known as vaults. [3] The vault complex comprises the major vault protein (MVP), two minor vault proteins (VPARP and TEP1), and a variety of small untranslated RNA molecules known as vault RNAs (vRNAs, vtRNAs) only found in higher eukaryotes. These molecules are transcribed by RNA polymerase III.
Given the association with the nuclear membrane and the location within the cell, vaults are thought to play roles in intracellular and nucleocytoplasmic transport processes. [4] A study, using cryo-electron microscopy, has determined that vtRNAs are found close to the end caps of vaults. This positioning of the RNA indicates that they could interact with both the interior and exterior of the vault particle. [5] Overall, the current belief is that the vtRNAs do not have a structural role in the vault protein, but rather play some kind of functional role. [6] However, while there has been an expanding body of research on vtRNA, there has yet to be a solid conclusion on the exact function.
Vault RNA was first identified as part of the vault ribonucleoprotein complex in 1986. [7] Since the first discovery of non-coding RNA in the mid 1960s, there had been considerable interest in the field. The fruition of this interest was apparent in the 1980s during a string of non-coding RNA discoveries, such as Ribosomal RNA, snoRNA, Xist, and vault RNA.
Early research in the 1990s looked into the specifics of vault RNA and focused around the conservation of the gene in animals. So far, vault RNAs have been isolated from [8] humans, rodents, and bullfrogs.
Vault proteins, but not the vtRNA, have also been found in [9] sea urchin, Dictyostelium discoideum , and Acanthamoeba .
Vaults have been found to be highly expressed in "higher" eukaryotes, specifically mammals, amphibians, and avians, as well as "lower" eukaryotes, such as Dictyostelium discoideum. Given that both the structure and protein composition are highly conserved among these species, researchers posit that their function is crucial to eukaryotic cell function. [8]
vtRNA has a length that ranges between 86 and 141 bases, depending on the species. While the length of the transcript remains within a certain range from species to species, the level of expression can change significantly. For example, rats and mice express a single vtRNA 141 bases long while bullfrogs express 2 vtRNAs: one 89 bases long and the other 94. [8]
Research into human expression of vtRNA has found four related vtRNAs. Currently, only three have been identified and described; they are: hvg1 (98 bases), hvg2 (88 bases), and hvg3 (88 bases). A bulk of the total vtRNA was associated with the hvg1 type. [4]
Despite the inter-species differences in the vtRNA, the polymerase III promoter elements have been found to be highly conserved. In addition, all vtRNAs are predicted to fold into similar stem-loop structures. [8]
Vault RNAs are considerably small in length, falling in the range between 80 and 150 nucleotides. Their secondary structures have conserved stem loops that connect the 5’ and 3’ ends of the molecule, in addition to the panhandle-like shape. [10] [ failed verification ] There are polymerase III promoter elements, box A and box B, of which box A takes part in conserving structural features whereas box B does not.
About 5% of all cellular vault RNA goes into the vault organelle, the rest remaining free-floating in the cell. [11]
Vault RNAs, in conjunction with the vault complex, have been associated with drug resistance. [12] Through recent discoveries, it has been shown that the vault non-coding RNAs produce small vault RNAs through a DICER mechanism. These small vault RNAs then operate in similar manner to miRNAs: [13] An svRNA binds an argonaute protein and down-regulates expression of CYP3A4, an enzyme involved in drug metabolism. [14]
One of the major causes of cancer treatment failures is the resistance that cancer cells develop towards chemotherapeutic drugs. vtRNAs have been shown to play a role in this phenomenon due to their interaction with certain chemotherapeutic drugs through specific binding sites. It is believed these interactions lead to the export of the chemical agents released by the chemotherapeutic drugs. [15]
These conclusions come from the results of a study that show abnormally high levels of vtRNA expression in cancer cells (derived from glioblastoma, leukemia, and osteocarcinoma cell lines) that had resistance to mitoxantrone. In addition, the same study showed weakened expression of vtRNA correlated to the cancer cells became more responsive or sensitive to mitoxantrone. [15] Studies as such suggest that vtRNAs might have a role in blocking the drugs from getting to their target sites.
It has been shown that vault non-coding RNAs contain multiple cytosine residues that have been methylated by the NSUN2 protein. In NSUN2 deficient cells, the loss of cytosine-5 methylation causes incorrect processing into small RNA fragments that end up functioning similar to micro RNAs. As a result, it has been suggested that impaired vault RNA processing may contribute to the symptoms that are manifested in NSUN2 deficiency diseases. [16]
While the function of vault RNAs is still relatively unknown, due to their unique structure these molecules have become useful in developing new research methods. One example of this is seen in the fact that vtRNAs are used to benchmark the performance of the research query tool fragrep2.[ citation needed ]
Query tools are used to find regions of similar biological sequences amongst species. However, one problem that these tools (e.g. most famously, BLAST) have is that they struggle to identify sequences that contain insertions and deletions. These highly variable structural changes cause the tool to be fooled and have errors in their results.
Fragrep2 seeks to solve this problem by using a pattern-based algorithm that can match or almost match exact sequences of motifs within the desired molecule. In order to help build fragrep2, the scientists needed a test molecule and found vault RNAs to be perfect. The reason being that vault RNAs generally have two very well conserved sequences, surrounded by regions of high variability.
This tool is significant not only because it has helped advance the research of vault RNA, but also because of its other applications within the RNA field. Vault RNAs are not the only kind of RNA with this type of semi-conserved/highly variable structure, other notable RNAs include RNAse P, RNAse MRP, telomerase RNA, and 7SK RNA. [17]
Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid (DNA) are nucleic acids. Along with lipids, proteins, and carbohydrates, nucleic acids constitute one of the four major macromolecules essential for all known forms of life. Like DNA, RNA is assembled as a chain of nucleotides, but unlike DNA, RNA is found in nature as a single strand folded onto itself, rather than a paired double strand. Cellular organisms use messenger RNA (mRNA) to convey genetic information that directs synthesis of specific proteins. Many viruses encode their genetic information using an RNA genome.
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existence of this stage.
A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase enzyme to produce DNA from its RNA genome, the reverse of the usual pattern, thus retro (backwards). The new DNA is then incorporated into the host cell genome by an integrase enzyme, at which point the retroviral DNA is referred to as a provirus. The host cell then treats the viral DNA as part of its own genome, transcribing and translating the viral genes along with the cell's own genes, producing the proteins required to assemble new copies of the virus. Many retroviruses cause serious diseases in humans, other mammals, and birds.
Nucleobases are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nucleic acids. The ability of nucleobases to form base pairs and to stack one upon another leads directly to long-chain helical structures such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). Five nucleobases—adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U)—are called primary or canonical. They function as the fundamental units of the genetic code, with the bases A, G, C, and T being found in DNA while A, G, C, and U are found in RNA. Thymine and uracil are distinguished by merely the presence or absence of a methyl group on the fifth carbon (C5) of these heterocyclic six-membered rings. In addition, some viruses have aminoadenine (Z) instead of adenine. It differs in having an extra amine group, creating a more stable bond to thymine.
Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. These products are often proteins, but in non-protein-coding genes such as transfer RNA (tRNA) and small nuclear RNA (snRNA), the product is a functional non-coding RNA. Gene expression is summarized in the central dogma of molecular biology first formulated by Francis Crick in 1958, further developed in his 1970 article, and expanded by the subsequent discoveries of reverse transcription and RNA replication.
Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules called non-coding RNAs (ncRNAs). mRNA comprises only 1–3% of total RNA samples. Less than 2% of the human genome can be transcribed into mRNA, while at least 80% of mammalian genomic DNA can be actively transcribed, with the majority of this 80% considered to be ncRNA.
A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses.
A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA or RNA (GACU) molecule. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.
Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products. Sophisticated programs of gene expression are widely observed in biology, for example to trigger developmental pathways, respond to environmental stimuli, or adapt to new food sources. Virtually any step of gene expression can be modulated, from transcriptional initiation, to RNA processing, and to the post-translational modification of a protein. Often, one gene regulator controls another, and so on, in a gene regulatory network.
Rabies virus, scientific name Rabies lyssavirus, is a neurotropic virus that causes rabies in humans and animals. Rabies transmission can occur through the saliva of animals and less commonly through contact with human saliva. Rabies lyssavirus, like many rhabdoviruses, has an extremely wide host range. In the wild it has been found infecting many mammalian species, while in the laboratory it has been found that birds can be infected, as well as cell cultures from mammals, birds, reptiles and insects. Rabies is reported in more than 150 countries on all continents, with the exclusion of Antarctica. The main burden of disease is reported in Asia and Africa, but some cases have been reported also in Europe in the past 10 years, especially in returning travellers.
Nucleoproteins are proteins conjugated with nucleic acids. Typical nucleoproteins include ribosomes, nucleosomes and viral nucleocapsid proteins.
Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcribed by either RNA polymerase II or RNA polymerase III. Their primary function is in the processing of pre-messenger RNA (hnRNA) in the nucleus. They have also been shown to aid in the regulation of transcription factors or RNA polymerase II, and maintaining the telomeres.
Y RNAs are small non-coding RNAs. They are components of the Ro60 ribonucleoprotein particle which is a target of autoimmune antibodies in patients with systemic lupus erythematosus. They are also reported to be necessary for DNA replication through interactions with chromatin and initiation proteins. However, mouse embryonic stem cells lacking Y RNAs are viable and have normal cell cycles.
The signal recognition particle RNA, is part of the signal recognition particle (SRP) ribonucleoprotein complex. SRP recognizes the signal peptide and binds to the ribosome, halting protein synthesis. SRP-receptor is a protein that is embedded in a membrane, and which contains a transmembrane pore. When the SRP-ribosome complex binds to SRP-receptor, SRP releases the ribosome and drifts away. The ribosome resumes protein synthesis, but now the protein is moving through the SRP-receptor transmembrane pore.
The vault or vault cytoplasmic ribonucleoprotein is a eukaryotic organelle whose function is not yet fully understood. Discovered and isolated by Nancy Kedersha and Leonard Rome in 1986, vaults are cytoplasmic organelles which, when negative-stained and viewed under an electron microscope, resemble the arches of a cathedral's vaulted ceiling, with 39-fold symmetry. They are present in many types of eukaryotic cells, and appear to be highly conserved among eukaryotes.
Post-transcriptional regulation is the control of gene expression at the RNA level. It occurs once the RNA polymerase has been attached to the gene's promoter and is synthesizing the nucleotide sequence. Therefore, as the name indicates, it occurs between the transcription phase and the translation phase of gene expression. These controls are critical for the regulation of many genes across human tissues. It also plays a big role in cell physiology, being implicated in pathologies such as cancer and neurodegenerative diseases.
HSURs are viral small regulatory RNAs. They are found in Herpesvirus saimiri which is responsible for aggressive T-cell leukemias in primates. They are nuclear RNAs which bind host proteins to form small nuclear ribonucleoproteins (snRNPs). The RNAs are 114–143 nucleotides in length and the HSUR family has been subdivided into HSURs numbered 1 to 7. The function of HSURs has not yet been identified; they do not affect transcription so are thought to act post-transcriptionally, potentially influencing the stability of host mRNAs.
Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.
Brain cytoplasmic 200 long-noncoding RNA is a 200 nucleotide RNA transcript found predominantly in the brain with a primary function of regulating translation by inhibiting its initiation. As a long non-coding RNA, it belongs to a family of RNA transcripts that are not translated into protein (ncRNAs). Of these ncRNAs, lncRNAs are transcripts of 200 nucleotides or longer and are almost three times more prevalent than protein-coding genes. Nevertheless, only a few of the almost 60,000 lncRNAs have been characterized, and little is known about their diverse functions. BC200 is one lncRNA that has given insight into their specific role in translation regulation, and implications in various forms of cancer as well as Alzheimer's disease.