The double-stranded RNA-specific adenosine deaminase enzyme family are encoded by the ADAR family genes. [5] ADAR stands for adenosine deaminase acting on RNA . [6] [7] This article focuses on the ADAR proteins; This article details the evolutionary history, structure, function, mechanisms and importance of all proteins within this family. [5]
ADAR enzymes bind to double-stranded RNA (dsRNA) and convert adenosine to inosine (hypoxanthine) by deamination. [8] ADAR proteins act post-transcriptionally, changing the nucleotide content of RNA. [9] The conversion from adenosine to inosine (A to I) in the RNA disrupts the normal A:U pairing, destabilizing the RNA. Inosine is structurally similar to guanine (G) which leads to inosine to cytosine (I:C) binding. [10] Inosine typically mimics guanosine during translation but can also bind to uracil, cytosine, and adenosine, though it is not favored.
Codon changes may arise from RNA editing leading to changes in the coding sequences for proteins and their functions. [11] Most editing sites are found in noncoding regions of RNA such as untranslated regions (UTRs), Alu elements, and long interspersed nuclear elements (LINEs). [12] Codon changes can give rise to alternate transcriptional splice variants. ADAR impacts the transcriptome in editing-independent ways, likely by interfering with other RNA-binding proteins. [9]
Mutations in this gene are associated with several diseases including HIV, measles, and melanoma. Recent research supports a linkage between RNA-editing and nervous system disorders such as amyotrophic lateral sclerosis (ALS). Atypical RNA editing linked to ADAR may also correlate to mental disorders such as schizophrenia, epilepsy, and suicidal depression. [13]
The ADAR enzyme and its associated gene were discovered accidentally in 1987 as a result of research by Brenda Bass and Harold Weintraub. [14] These researchers were using antisense RNA inhibition to determine which genes play a key role in the development of Xenopus laevis embryos. Previous research on Xenopus oocytes was successful. However, when Bass and Weintraub applied identical protocols to Xenopus embryos, they were unable to determine the embryo's developmental genes. To understand why the method was unsuccessful, they began comparing duplex RNA in both oocytes and embryos. This led them to discover a developmentally regulated activity that denatures RNA:RNA hybrids in embryos.
In 1988, Richard Wagner et al. further studied the activity occurring on Xenopus embryos. [15] They determined a protein was responsible for unwinding of RNA due to the absence of activity after proteinase treatment. This protein is specific for dsRNA and does not require ATP. It became evident this protein's activity on dsRNA modifies it beyond a point of rehybridization but does not fully denature it. Finally, the researchers determined this unwinding is due to the deamination of adenosine residues to inosine. This modification results in mismatched base-pairing between inosine and uridine, leading to the destabilization and unwinding of dsRNA.
ADARs are one of the most common forms of RNA editing, and have both selective and non-selective activity. [16] ADAR is able to modify and regulate the output of gene product, as inosine is interpreted by the cell to be guanosine. ADAR can change the functionality of small RNA molecules. Recently, ADARs have also been discovered as a regulator on splicing and circRNA biogenesis with their editing capability or RNA binding function. [17] [18] [19] It is believed that ADAR evolved from ADAT (Adenosine Deaminase Acting on tRNA), a critical protein present in all eukaryotes, early in the metazoan period through the addition of a dsRNA binding domain. This likely occurred in the lineage which leads to the crown Metazoa. When a duplicate ADAT gene was coupled to another gene which encoded at least one double stranded RNA binding. The ADAR family of genes has been largely conserved over the history of its existence. This, along with its presence in the majority of modern phyla, indicates that RNA editing is essential in regulating genes for metazoan organisms. ADAR has not been discovered in a variety of non-metazoan eukaryotes, such as plants, fungi and choanoflagellates.
ADARs are suggested to have two functions: to increase diversity of the proteome by inducing creation of harmless non-genomically encoded proteins, and protecting crucial translational sites. The conventional belief is their primary role is to increase the diversity of transcripts and expand the protein variation, promoting evolution of proteins. [5]
In mammals, there are three types of ADAR enzymes: ADAR (ADAR1), ADARB1 (ADAR2), and ADARB2 (ADAR3). [5]
ADAR (ADAR1) and ADAR2 (ADARB1)
ADAR one and two are both found within various tissues of the body. These two forms of ADAR are also found to be catalytically active, meaning they can be used as a catalyst in a reaction. Both forms also have similar expression pattern structures of proteins and require substrate double-stranded RNA structures. [11] However, they differ in their editing activity in that both ADAR one and two can edit GluR-B pre-mRNA at the R/G site and only ADAR2 can alter the Q/R site. [20] ADAR1 has been found two have two isoforms, ADAR1p150 and ADARp110. ADAR1p110 is typically found in the nucleus, while ADAR1p150 shuffles between the nucleus and the cytoplasm, mostly present in the cytoplasm.
ADAR3 (ADARB2)
ADAR 3 varies from the other two forms of ADAR in that it is only found within brain tissue. It also is considered to be inactive when it comes to catalytic activity. [11] ADAR3 has been found to be linked to memory and learning in mice, showing that it plays a crucial role in the nervous system. In vitro studies have also shown that ADAR3 might play a role in the regulation of ADAR one and two. [21]
ADARs catalyze the hydrolytic deamination reaction from adenosine to inosine. [8] An activated water molecule will react with adenosine in a nucleophilic substitution reaction with the carbon-6 amine group. A hydrated intermediate will exist for a short period of time, then the amine group will leave as an ammonia ion.
In humans, ADAR enzymes have two to three amino-terminal dsRNA binding domains (dsRBDs), and one carboxy terminal catalytic deaminase domain. [22] In the dsRBD there is a conserved α-β-β-β-α configuration. [11] ADAR1 contains two areas for binding Z-DNA known as Zα and Zβ. [23] [24] ADAR2 and ADAR3 have an arginine rich single stranded RNA (ssRNA) binding domain. A crystal structure of ADAR2 has been solved. [22] In the enzyme active site, there is a glutamic acid residue(E396) that hydrogen bonds to a water. A histidine (H394) and two cysteine residues (C451 and C516) coordinate with a zinc ion. The zinc activates the water molecule for the nucleophilic hydrolytic deamination. Within the catalytic core there is an inositol hexakisphosphate (IP6), which stabilizes arginine and lysine residues.
In mammals the conversion from A to I requires homodimerization of ADAR1 and ADAR2, but not ADAR3. [11] In vivo studies have are not conclusive if RNA binding is required for dimerization. A study with ADAR family mutants showed the mutants were not able to bind to dsRNA but were still able to dimerize, suggesting they may bind based on protein-protein interactions. [11] [25]
ADAR1 is one of multiple genes which often contribute to Aicardi–Goutières syndrome when mutated. [26] Aicardi–Goutières syndrome is a genetic inflammatory disease primarily affecting the skin and the brain and it is characterized by high levels of IFN-α in cerebral spinal fluid. [27] The inflammation is caused by incorrect activation of interferon inducible genes such as those activated to fight off viral infections. Mutation and loss of function of ADAR1 prevents destabilization of double stranded RNA (dsRNA). [28] This buildup of dsRNA stimulates IFN production without a viral infection, causing an inflammatory reaction and autoimmune response. [29] The phenotype in the knock-out mice is rescued by the p150 form of ADAR1 containing the Zα domain that binds specifically to the left-handed double-stranded conformation found in Z-DNA and Z-RNA, but not by the p110 isoform lacking this domain. [30] In humans, the P193A mutation in the Zα domain is causal for Aicardi–Goutières syndrome [26] and for the more severe phenotype found in Bilateral Striatal Necrosis/Dystonia. [31] The findings establish a biological role for the left-handed Z-DNA conformation. [32]
In motor neurons, the most well-grounded marker of amyotrophic lateral sclerosis (ALS) is the TAR DNA-binding protein (TDP-43). When there is failure of RNA-editing due to downregulation of TDP-43, motor neurons devoid of ADAR2 enzymes express unregulated, leading to abnormally permeable Ca2+ channels. ADAR2 knockout mice show signs of ALS phenotype similarity. Current researchers are developing a molecular targeting therapy by normalizing expression of ADAR2. [33]
(ADAR)-induced A-to-I RNA editing may elicit dangerous amino acid mutations. Editing mRNA typically imparts missense mutations leading to alterations in the beginning and terminating regions of translation. However, crucial amino acid changes can occur, resulting in change of function of several cellular processes. Amino acid changes can result in protein structural changes at secondary, tertiary, and quaternary structures. Researchers observed high levels of oncogenetic A-to-I editing in circular RNA precursors, directly confirming ADAR's relationship to cancer. A list of tumor related RNA editing sites can be found here. [34]
Studies of patients with hepatocellular carcinoma (HCC) have shown trends of upregulated ADAR1 and downregulated ADAR2. Results suggest the irregular regulation is responsible for the disrupted A to I editing pattern seen in HCC and that ADAR1 acts as an oncogene in this context whilst ADAR2 has tumor suppressor activities. [35] The imbalance in ADAR expression could change the frequency of A to I transitions in the protein coding region of genes, resulting in mutated proteins which drive the disease. The dysregulation of ADAR1 and ADAR2 could be used as a possible prognostic marker.
Studies have indicated that loss of ADAR1 contributes to melanoma growth and metastasis. ADAR enzymes can act on microRNA and affect its biogenesis, stability and/or its binding target. [36] ADAR1 may be downregulated by cAMP- response element binding protein (CREB), limiting its ability to act on miRNA. [37] One such example is miR-455-5p which is edited by ADAR1. When ADAR is downregulated by CREB the unedited miR-455-5p downregulates a tumor suppressor protein called CPEB1, contributing to melanoma progression in an in vivo model.
A Gly1007Arg mutation in ADAR1, as well as other truncated versions, have been implicated as a cause in some cases of DSH1. [38] This is a disease characterized by hyperpigmentation in the hands and feet and can occur in Japanese and Chinese families.
Expression levels of the ADAR1 protein have shown to be elevated during HIV infection and it has been suggested that it is responsible for A to G mutations in the HIV genome, inhibiting replication. [39] The mutation in the HIV genome by ADAR1 might in some cases lead to beneficial viral mutations which could contribute to drug resistance.
ADAR1 is an interferon ( IFN )-inducible protein (one released by a cell in response to a pathogen or virus), able to assist in a cell's immune pathway. Evidence shows elimination of HCV replicon, Lymphocytic choriomeningitis LCMV, and polyomavirus. [40] [41]
ADAR1 is proviral in other circumstances. ADAR1’s A to I editing has been found in many viruses including measles virus, [42] [41] [43] influenza virus, [44] lymphocytic choriomeningitis virus, [45] polyomavirus, [46] hepatitis delta virus, [47] and hepatitis C virus. [48] Although ADAR1 has been seen in other viruses, it has only been studied extensively in a few. Research on measles virus shows ADAR1 enhancing viral replication through two different mechanisms: RNA editing and inhibition of dsRNA-activated protein kinase (PKR). [40] [41] Specifically, viruses are thought to use ADAR1 as a positive replication factor by selectively suppressing dsRNA-dependent and antiviral pathways. [49]
Z-DNA is one of the many possible double helical structures of DNA. It is a left-handed double helical structure in which the helix winds to the left in a zigzag pattern, instead of to the right, like the more common B-DNA form. Z-DNA is thought to be one of three biologically active double-helical structures along with A-DNA and B-DNA.
RNA editing is a molecular process through which some cells can make discrete changes to specific nucleotide sequences within an RNA molecule after it has been generated by RNA polymerase. It occurs in all living organisms and is one of the most evolutionarily conserved properties of RNAs. RNA editing may include the insertion, deletion, and base substitution of nucleotides within the RNA molecule. RNA editing is relatively rare, with common forms of RNA processing not usually considered as editing. It can affect the activity, localization as well as stability of RNAs, and has been linked with human diseases.
Potassium voltage-gated channel subfamily A member 1 also known as Kv1.1 is a shaker related voltage-gated potassium channel that in humans is encoded by the KCNA1 gene. Isaacs syndrome is a result of an autoimmune reaction against the Kv1.1 ion channel.
Glutamate receptor 3 is a protein that in humans is encoded by the GRIA3 gene.
The 5-HT2C receptor is a subtype of the 5-HT2 receptor that binds the endogenous neurotransmitter serotonin (5-hydroxytryptamine, 5-HT). Like all 5-HT2 receptors, it is a G protein-coupled receptor (GPCR) that is coupled to Gq/G11 and mediates excitatory neurotransmission. HTR2C denotes the human gene encoding for the receptor, that in humans is located on the X chromosome. As males have one copy of the gene and females have one of the two copies of the gene repressed, polymorphisms at this receptor can affect the two sexes to differing extent.
Filamin A, alpha (FLNA) is a protein that in humans is encoded by the FLNA gene.
Glutamate ionotropic receptor kainate type subunit 2, also known as ionotropic glutamate receptor 6 or GluR6, is a protein that in humans is encoded by the GRIK2 gene.
Insulin-like growth factor-binding protein 7 is a protein that in humans is encoded by the IGFBP7 gene. The major function of the protein is the regulation of availability of insulin-like growth factors (IGFs) in tissue as well as in modulating IGF binding to its receptors. IGFBP7 binds to IGF with low affinity compared to IGFBPs 1-6. It also stimulates cell adhesion. The protein is implicated in some cancers.
Double-stranded RNA-specific editase 1 is an enzyme that in humans is encoded by the ADARB1 gene. The enzyme is a member of ADAR family.
Glutamate receptor, ionotropic, kainate 1, also known as GRIK1, is a protein that in humans is encoded by the GRIK1 gene.
Cytoplasmic FMR1-interacting protein 2 is a protein that in humans is encoded by the CYFIP2 gene. Cytoplasmic FMR1 interacting protein is a 1253 amino acid long protein and is highly conserved sharing 99% sequence identity to the mouse protein. It is expressed mainly in brain tissues, white blood cells and the kidney.
Glutamate receptor 4 is a protein that in humans is encoded by the GRIA4 gene.
Gamma-aminobutyric acid receptor subunit alpha-3 is a protein that in humans is encoded by the GABRA3 gene.
Bladder cancer-associated protein is a protein that in humans is encoded by the BLCAP gene.
Double-stranded RNA-specific editase B2 is an enzyme that in humans is encoded by the ADARB2 gene.
ADP-ribosylation-like factor 6 interacting protein 4 (ARL6IP4), also called SRp25 is the product of the ARL6IP4 gene located on chromosome 12q24. 31. Its function is unknown.
Within the science of molecular biology and cell biology, for human genetics, the GRIA2 gene is located on chromosome 4q32-q33. The gene product is the ionotropic AMPA glutamate receptor 2. The protein belongs to a family of ligand-activated glutamate receptors that are sensitive to alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate (AMPA). Glutamate receptors function as the main excitatory neurotransmitter at many synapses in the central nervous system. L-glutamate, an excitatory neurotransmitter, binds to the Gria2 resulting in a conformational change. This leads to the opening of the channel converting the chemical signal to an electrical impulse. AMPA receptors (AMPAR) are composed of four subunits, designated as GluR1 (GRIA1), GluR2 (GRIA2), GluR3 (GRIA3), and GluR4(GRIA4) which combine to form tetramers. They are usually heterotrimeric but can be homodimeric. Each AMPAR has four sites to which an agonist can bind, one for each subunit.[5]
In molecular biology, the protein domain Adenosine deaminase z-alpha domain refers to an evolutionary conserved protein domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA-editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
LEAPER is a genetic engineering technique in molecular biology by which RNA can be edited. The technique relies on engineered strands of RNA to recruit native ADAR enzymes to swap out different compounds in RNA. Developed by researchers at Peking University in 2019, the technique, some have claimed, is more efficient than the CRISPR gene editing technique. Initial studies have claimed that editing efficiencies of up to 80%.
An RNA timestamp is a technology that enables the age of any given RNA transcript to be inferred by exploiting RNA editing. In this technique, the RNA of interest is tagged to an adenosine rich reporter motif that consists of multiple MS2 binding sites. These MS2 binding sites recruit a complex composed of ADAR2 and MCP. The binding of the ADAR2 enzyme to the RNA timestamp initiates the gradual conversion of adenosine to inosine molecules. Over time, these edits accumulate and are then read through RNA-seq. This technology allows us to glean cell-type specific temporal information associated with RNA-seq data, that until now, has not been accessible.