Pur-alpha is a protein that in humans is encoded by the PURA gene [5] located at chromosome 5, band q31. [6] [7]
Pur-alpha is an ancient, multi-functional DNA- and RNA-binding protein. [5] [8] PURA is expressed in every human tissue, where it exists as a protein of 322 amino acids. According to convention, PURA, the gene, is written italicized in all upper case letters. Pur-alpha, the protein, is written with the first letter capitalized and can be found listed as Pur-alpha, Pur-α, Pura, Puralpha, Pur alpha and Pur1.
Pur-alpha was the first sequence-specific single-stranded DNA-binding protein to be discovered in higher organisms (GenBank M96684.1; GI:190749). [5] It binds to both single-stranded and double-stranded DNA, making contact with G residues in the purine-rich strand of its binding site. Cumulative data shows that Pur-alpha preferentially binds to the sequence (G2-4N1-3)n, where N is not G. N denotes a nucleotide, and n denotes the number of repeats of this small sequence. N may be repeated up to three times in this sequence. [5] [9] Following the identification of a Pur factor, which specifically bound a purine-rich sequence in the control region of the c-MYC gene, [10] the gene, PURA, encoding the protein, Pur-alpha, was cloned and sequenced for both human [5] and mouse (GenBank U02098.1). [8] Pur-alpha belongs to the four-member Pur protein family, which also includes Pur-beta (GenBank AY039216.1; GI:14906267) [5] and two forms of Pur–gamma (Variant A, GenBank AF195513.2; Variant B, GenBank AY077841). [11]
Pur protein sequences from bacteria through humans contain an amino acid segment that is strongly conserved (see NCBI smart00712). [5] [12] Human Pur-alpha contains three repeats of this Pur domain and bacterial Pur-alpha contains one. [5] [13] This evolutionary conservation means that the specific sequence of this domain is important for the survival of most species throughout the spectrum of living organisms. This essential nature of the Pur domain piques interest because the functions of Pur-alpha in lower organisms and in humans differ greatly. For example, Pur-alpha is essential for brain and blood cell development in mammals, [14] but bacteria have no brain and no blood. In humans Pur-alpha functions to activate transcription in the nucleus, to facilitate RNA transport in the cytoplasm and to regulate DNA replication in the cell cycle. [12] In certain functions Pur-alpha interacts with family member Pur-beta. [15] [16] Several cell cycle regulatory functions may be mediated by Pur-alpha binding to Cyclin/Cdk protein kinases, which phosphorylate proteins regulating cell cycle transition points. [17] [18] Requirements for Pur-alpha in all organisms are united by Pur-alpha's ability to bind nucleic acids coupled to its ability to interact with regulatory and transport proteins.
PURA, located at chromosome 5 band q31, is frequently deleted in myelodysplastic syndrome (MDS), [19] a disorder of white blood cells, that may progress to acute myelogenous leukemia (AML). [6] Loss of one copy of chromosome 7 is also frequent in MDS. PURB, the gene encoding Pur-beta, is located at 7p13. A visual fluorescence analysis of chromosomes from MDS patients shows that deletions of PURA at 5q31 are more strongly linked to progression of MDS to AML when combined with deletions of the PURB gene, including complete loss of chromosome 7. [6] All of the PURA deletions noted, involve only one of the two paired, parentally-derived chromosomes. The implication is that Pur-alpha and -beta are each codominantly expressed, and that haploid levels are insufficient for a protective effect against cancer. All known PURA deletions in people occur in only one of the two copies of chromosome 5. [20]
Inducing increased levels of Pur-alpha in several different cultured cancer cell lines blocks cell proliferation. It also blocks anchorage-independent colony formation, a hallmark of cancer. [17] [21] This is true whether Pur-alpha is microinjected or expressed after introducing a cloned PURA cDNA into cells. [22] The Pur-alpha inhibition of cancer cell proliferation occurs at specific points in the cell division cycle, primarily at checkpoints for transition to DNA replication or mitosis. [22] These cell cycle effects are consistent with an interaction between Pur-alpha and CDK, cell cycle-dependent protein kinases. [17] They are also consistent with documented interaction between Pur-alpha and the tumor suppressor protein, Rb. [23]
Studies of genetic inactivation of PURA in the mouse provided evidence leading to that for PURA gene disorders in brain disease. Homozygous PURA knockouts die shortly after birth with severe defects in brain layer development, tissue wasting and movement disorders. Defects in blood cell development are also prominent, and it is not known how these may affect the brain. Heterozygous knockouts do not die early but exhibit seizure-like disorders. [14] In rat hippocampal neurons, Pur-alpha is found in the cytoplasm together with mRNA transcripts, in a complex including non-coding RNAs, Pur-beta, fragile X mental retardation proteins and microtubule-associated proteins. This complex is transported by a kinesin motor [24] [25] to sites of translation at junctions of nerve cell dendrites. [26] Recently PURA mutations have been found in multiple patients with brain disorders of a similar phenotype including hypotonia, developmental delay, movement disorders, and seizure or seizure-like movements. [27] [28] [29] This spectrum of brain disorders is similar to the phenotype of a central nervous system syndrome termed the 5q31.3 microdeletion syndrome, [27] and is the basis for a proposed PURA Syndrome [30] based on PURA mutations rather than just deletions.
In the brain Pur-alpha plays a role in diseases involving glial cells, cells that support nerve cells, as well as diseases involving nerve cells. These diseases include neuro-AIDS. Pur-alpha binds to a regulatory RNA element, called TAR, in the HIV-1 genome. [31] This activates the expression of Tat, a transcriptional activator of its own gene. Pur-alpha binds TAR, allowing Tat to bind an adjacent TAR site to stimulate transcription. Pur-alpha then binds to the Tat protein itself. Pur-alpha also binds Cyclin T1, a regulatory partner of Cdk9 protein kinase, necessary for Tat activity. Cyclin T1/Cdk9 phosphorylates a region of RNA polymerase II. Such phosphorylation of the polymerase enhances its ability to complete RNA synthesis and stimulates replication of the HIV-1 RNA genome. [32] [33]
Pur-alpha participates in development of progressive multifocal leukoencephalopathy (PML), a loss of the nerve sheath formed by oligodendroglial cells. [34] [35] [32] Although HIV-1 is not usually found in these glial cells, HIV-1 proteins can pass through cell membranes to enter them. JCV is considered the causative agent of PML. JCV is activated in the glial cells by certain states of immune system suppression, including HIV-1 infection. [36] There is a documented interaction between Pur-alpha, the HIV-1 protein, Tat, and a Pur-alpha-binding regulatory sequence in JCV DNA. [35] Pur-alpha acts by altering both replication and gene expression of JCV. [34] [37] [38] [35] [39]
Pur-alpha plays a role in ALS, otherwise known as Lou Gehrig's disease. ALS is a motor neuron disease involving both the brain and spinal cord, resulting in progressive loss of muscle control. ALS has several contributing causes, but the most common familial form is due to an expanded repeat of the hexanucleotide GGGGCC at the chromosomal locus C9ORF72. [40] [41] The C9ORF72 hexanucleotide repeat expansion (HRE) is capable of binding Pur-alpha very tightly. Pur-alpha may act in ALS directly by binding this DNA repeat expansion or its single-stranded RNA transcript. [42] [41] One potential consequence of this binding would be to influence an unconventional translation of this transcript repeat that results in long dipeptide repeats. This is termed RAN (Repeat Associated Non-ATG) translation initiation. [43] Aberrant Pur-alpha association with its RNA sequence segment may also be a feature of ALS types that do not involve C9ORF72 expansion. [44] Addition of Pur-alpha suppresses neurodegeneration in mouse neuronal cells and in Drosophila expressing the C9ORF72 HRE. [41] Pur-alpha also reverses neuronal changes caused by defects in the gene, FUS, which can lead to ALS. [44] [45] The mechanism of action of Pur-alpha in ALS is not known. There is presently no evidence that the PURA sequence itself is mutated in the C9ORF72 form of ALS. Rather, it is a regulatory nucleic acid sequence to which Pur-alpha binds that is altered.
The 2017 version of this article was updated by an external expert under a dual publication model. The corresponding academic peer reviewed article was published in Gene and can be cited as: Dianne C Daniel; Edward M Johnson (5 December 2017). "PURA, the gene encoding Pur-alpha, member of an ancient nucleic acid-binding protein family with mammalian neurological functions". Gene . Gene Wiki Review Series. doi:10.1016/J.GENE.2017.12.004. ISSN 0378-1119. PMC 5770235 . PMID 29221753. Wikidata Q46703061. |
Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be included within or excluded from the final, processed messenger RNA (mRNA) produced from that gene. This means the exons are joined in different combinations, leading to different (alternative) mRNA strands. Consequently, the proteins translated from alternatively spliced mRNAs usually contain differences in their amino acid sequence and, often, in their biological functions.
RNA-binding proteins are proteins that bind to the double or single stranded RNA in cells and participate in forming ribonucleoprotein complexes. RBPs contain various structural motifs, such as RNA recognition motif (RRM), dsRNA binding domain, zinc finger and others. They are cytoplasmic and nuclear proteins. However, since most mature RNA is exported from the nucleus relatively quickly, most RBPs in the nucleus exist as complexes of protein and pre-mRNA called heterogeneous ribonucleoprotein particles (hnRNPs). RBPs have crucial roles in various cellular processes such as: cellular function, transport and localization. They especially play a major role in post-transcriptional control of RNAs, such as: splicing, polyadenylation, mRNA stabilization, mRNA localization and translation. Eukaryotic cells express diverse RBPs with unique RNA-binding activity and protein–protein interaction. According to the Eukaryotic RBP Database (EuRBPDB), there are 2961 genes encoding RBPs in humans. During evolution, the diversity of RBPs greatly increased with the increase in the number of introns. Diversity enabled eukaryotic cells to utilize RNA exons in various arrangements, giving rise to a unique RNP (ribonucleoprotein) for each RNA. Although RBPs have a crucial role in post-transcriptional regulation in gene expression, relatively few RBPs have been studied systematically.It has now become clear that RNA–RBP interactions play important roles in many biological processes among organisms.
Transcription factor Sp1, also known as specificity protein 1* is a protein that in humans is encoded by the SP1 gene.
In molecular biology, G-quadruplex secondary structures (G4) are formed in nucleic acids by sequences that are rich in guanine. They are helical in shape and contain guanine tetrads that can form from one, two or four strands. The unimolecular forms often occur naturally near the ends of the chromosomes, better known as the telomeric regions, and in transcriptional regulatory regions of multiple genes, both in microbes and across vertebrates including oncogenes in humans. Four guanine bases can associate through Hoogsteen hydrogen bonding to form a square planar structure called a guanine tetrad, and two or more guanine tetrads can stack on top of each other to form a G-quadruplex.
Eukaryotic DNA replication is a conserved mechanism that restricts DNA replication to once per cell cycle. Eukaryotic DNA replication of chromosomal DNA is central for the duplication of a cell and is necessary for the maintenance of the eukaryotic genome.
The HIV trans-activation response (TAR) element is an RNA element which is known to be required for the trans-activation of the viral promoter and for virus replication. The TAR hairpin is a dynamic structure that acts as a binding site for the Tat protein, and this interaction stimulates the activity of the long terminal repeat promoter.
Transcription factor E2F1 is a protein that in humans is encoded by the E2F1 gene.
DNA-directed RNA polymerases I, II, and III subunit RPABC1 is a protein that in humans is encoded by the POLR2E gene.
Transcription factor RelB is a protein that in humans is encoded by the RELB gene.
DNA-directed RNA polymerases I, II, and III subunit RPABC5 is a protein that in humans is encoded by the POLR2L gene.
DNA excision repair protein ERCC-1 is a protein that in humans is encoded by the ERCC1 gene. Together with ERCC4, ERCC1 forms the ERCC1-XPF enzyme complex that participates in DNA repair and DNA recombination.
Filamin A, alpha (FLNA) is a protein that in humans is encoded by the FLNA gene.
Interleukin enhancer-binding factor 3 is a protein that in humans is encoded by the ILF3 gene.
Y box binding protein 1 also known as Y-box transcription factor or nuclease-sensitive element-binding protein 1 is a protein that in humans is encoded by the YBX1 gene.
Transcriptional enhancer factor TEF-1 also known as TEA domain family member 1 (TEAD1) and transcription factor 13 (TCF-13) is a protein that in humans is encoded by the TEAD1 gene. TEAD1 was the first member of the TEAD family of transcription factors to be identified.
Double-stranded RNA-binding protein Staufen homolog 1 is a protein that in humans is encoded by the STAU1 gene.
Interleukin enhancer-binding factor 2 is a protein that in humans is encoded by the ILF2 gene.
Transcriptional activator protein Pur-beta is a protein that in humans is encoded by the PURB gene.
RNA-binding motif, single-stranded-interacting protein 1 is a protein that in humans is encoded by the RBMS1 gene.
In molecular biology, Tat is a protein that is encoded for by the tat gene in HIV-1. Tat is a regulatory protein that drastically enhances the efficiency of viral transcription. Tat stands for "Trans-Activator of Transcription". The protein consists of between 86 and 101 amino acids depending on the subtype. Tat vastly increases the level of transcription of the HIV dsDNA. Before Tat is present, a small number of RNA transcripts will be made, which allow the Tat protein to be produced. Tat then binds to cellular factors and mediates their phosphorylation, resulting in increased transcription of all HIV genes, providing a positive feedback cycle. This in turn allows HIV to have an explosive response once a threshold amount of Tat is produced, a useful tool for defeating the body's response.
{{cite journal}}
: Cite journal requires |journal=
(help)