Poison exons (PEs); also called premature termination codon (PTC) exons or nonsense-mediated decay (NMD) exons] are a class of cassette exons that contain PTCs. Inclusion of a PE in a transcript targets the transcript for degradation via NMD. PEs are generally highly conserved elements of the genome and are thought to have important regulatory roles in biology. [1] [2] Targeting PE inclusion or exclusion in certain transcripts is being evaluated as a therapeutic strategy.
In 2002, a model termed regulated unproductive splicing and translation (RUST) was proposed based on the finding that many (~one-third) alternatively spliced transcripts contain PEs. In this model, coupling alternative splicing to NMD (AS-NMD) is thought to tune transcript levels to regulate protein expression. [3] Alternative splicing may also lead to NMD via other pathways besides PE inclusion, e.g., intron retention. [4] [5]
PEs were initially characterized in RNA-binding proteins from the SR protein family. [1] [2] Genes for other RNA-binding proteins (RBPs) such as those for heterogenous nuclear ribonucleoprotein (hnRNP) also contain PEs. [2] Numerous chromatin regulators also contain PEs, though these are less conserved than PEs within RBPs such as the SR proteins. [6] Multiple spliceosomal components contain PEs. [7]
PE-containing transcripts generally represent a minority of the overall transcript population, in part due to their active degradation via NMD, though this relative abundance can be elevated upon inhibition of NMD or certain biological states. [2] [8] [9] [10] [11] Certain PE-containing transcripts are resistant to NMD and may be translated into truncated proteins. [12]
Cis-regulatory elements neighboring PEs have been found to affect PE inclusion. [13]
Many proteins whose corresponding genes contain PEs autoregulate PE inclusion in their respective transcripts and thereby control their own levels via a feedback loop. [12] [14] [15] [16] [17] [18] Cross-regulation of PE inclusion has also been observed. [19] [20] [21]
Differential splicing of PEs is implicated in biological processes such as differentiation, [22] [23] neurodevelopment, [24] dispersal of nuclear speckles during hypoxia, [25] tumorigenesis, [23] [26] organism growth, [15] and T cell expansion. [27]
PE inclusion can be regulated by external variables such as temperature and electrical activity. For example, PE inclusion in RBM3 transcript is lowered during hypothermia. This is mediated by temperature-dependent binding of the splicing factor HNRNPH1 to the RBM3 transcript. [9] The neuronal RBPs NOVA1/2 are translocated from the nucleus to the cytoplasm during pilocarpine-induced seizure in mice, and it was found that NOVA1/2 regulates the expression of cryptic PEs. [28] The glycosyltransferase O-GlcNAc transferase is responsible for installing the O-GlcNAc post-translational modification and contains a PE. [29] It has been frequently observed that pharmacological or genetic perturbations that elevate cellular O-GlcNAc levels increase PE inclusion in the OGT transcript. [30]
Proper regulation of PE inclusion and exclusion is important for health. Genetic mutations can affect inclusion of PEs and cause disease. For example, loss of CCAR1 leads to PE inclusion in the FANCA transcript, resulting in a Fanconi anemia phenotype. [31]
Dysregulation of components of the splicing machinery can also cause dysregulation of PE inclusion. Mutations in the splicing factor SF3B1 have been found to promote PE inclusion in BRD9 , reducing BRD9 mRNA and protein levels and leading to melanomagenesis. [32] Mutations in U2AF1 promote PE inclusion in EIF4A2 , leading to impaired global mRNA translation and acute myeloid leukemia (AML) chemoresistance through the integrated stress response pathway. [33] The splicing factor SRSF6 contains a PE whose skipping is connected to T cell acute lymphoblastic leukemia (T-ALL), [34] and PE inclusion in SRSF10 is linked to acute lymphoblastic leukemia (ALL). [35]
Intronic mutations can lead to PE inclusion, such as in the case of SCN1A , where mutations within intron 20 promote inclusion of the nearby PE 20N, leading to Dravet syndrome-like phenotypes in mouse models. [36] [37] An intronic mutation in FLNA has been found to impair binding of the splicing regulator PTBP1, leading to inclusion of a poison exon in FLNA transcripts that causes a brain-specific malformation. [24]
With the advent of next-generation sequencing technologies, [38] diagnostic genetic testing has emerged as a powerful tool to diagnose afflictions associated with specific genetic variants. Many diagnostic genetic testing efforts have focused on exome sequencing. [39] PE annotations may improve the diagnostic yield of these tests for certain diseases. For example, variants that affect PE inclusion in sodium channel genes (SCN1A, SCN2A , and SCN8A ) have been found to be associated with epilepsies, and analogous variants in SNRPB have been found to be associated with cerebrocostomandibular syndrome. [40] [41]
As PE inclusion results in transcript degradation, targeted PE inclusion or exclusion is being evaluated as a therapeutic strategy. [42] This strategy may prove especially applicable towards targets whose gene products are not easily ligandable such as "undruggable" proteins. Targeting PE inclusion/exlusion has been demonstrated with both small molecules [43] [44] and antisense oligonucleotides (ASOs). [23] [45] Small molecules may modulate splicing by stabilizing alternative splice sites. [43] [46] ASOs may block specific splice sites or target certain cis-regulatory elements to promote splicing at other sites. [47] [48] These ASOs may also be referred to as splice-switching oligonucleotides (SSOs). [23] [48] ASO walks tiling different ASOs across a gene sequence may be necessary to identify ASOs that have the desired effect on PE inclusion. [45]
Stoke Therapeutics is evaluating a strategy termed Targeted Augmentation of Nuclear Gene Output (TANGO). [45] Targeting exon 20N in SCN1A mRNA with the antisense oligonucleotide STK-001 blocks inclusion of this PE, leading to elevated levels of the productive SCN1A transcript and the gene product sodium channel protein 1 subunit alpha (NaV1.1). In mouse models of Dravet syndrome, which is driven by mutations in SCN1A, [36] [37] [49] STK-001 was able to reduce incidence of electrographic seizures and sudden unexpected death in epilepsy and prolong survival. [50] [51] As of October 2024, STK-001 is being evaluated in phase 2 clinical trials (NCT04740476). [52]
Stoke Therapeutics is also evaluating the ASO STK-002 for treatment of autosomal dominant optic atrophy (ADOA). STK-002 promotes removal of a PE in the transcript of OPA1 , leading to elevated OPA1 protein levels. [53]
Remix Therapeutics developed REM-422, which is an oral small molecule that promotes PE inclusion in the oncogene MYB . REM-422 was discovered through a screening campaign for molecules that promote PE inclusion in MYB. Subsequent in vitro experiments showed that REM-422 selectively facilitates binding of the U1 snRNP complex to oligonucleotides containing the MYB 5' splice site sequence. In various AML cell lines, REM-422 leads to degradation of MYB mRNA and lower MYB protein levels. REM-422 demonstrated antitumor activity in mouse xenograft models of acute myeloid leukemia. [43] As of October 2024, REM-422 is being evaluated in phase 1 clinical trials (NCT06118086, NCT06297941). [54] [55] The splicing modulator small molecule risdiplam, originally developed to promote exon 7 inclusion in the SMN2 transcript for treatment of spinal muscular atrophy, [56] [57] dose-dependently promotes PE inclusion in the MYB transcript as well. [58]
PTC Therapeutics is evaluating the oral small molecule PTC518 as a treatment for Huntington's disease. [44] PTC518 was well-tolerated and showed dose-dependent decreases in HTT mRNA and HTT protein levels in a phase 1 clinical trial. [59] As of October 2024, PTC518 is being evaluated in phase 2 clinical trials (NCT05358717). [60]
Therapeutic targeting of poison exon inclusion/exclusion has also been proposed for oncogenic splicing factors, [23] [26] BRD9 (for treatment of cancer), [32] SYNGAP1, [61] RBM3 (for treatment of neurodegeneration), [47] and CFTR (for treatment of cystic fibrosis). [62]
RNA splicing is a process in molecular biology where a newly-made precursor messenger RNA (pre-mRNA) transcript is transformed into a mature messenger RNA (mRNA). It works by removing all the introns and splicing back together exons. For nuclear-encoded genes, splicing occurs in the nucleus either during or immediately after transcription. For those eukaryotic genes that contain introns, splicing is usually needed to create an mRNA molecule that can be translated into protein. For many eukaryotic introns, splicing occurs in a series of reactions which are catalyzed by the spliceosome, a complex of small nuclear ribonucleoproteins (snRNPs). There exist self-splicing introns, that is, ribozymes that can catalyze their own excision from their parent RNA molecule. The process of transcription, splicing and translation is called gene expression, the central dogma of molecular biology.
Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene may be included within or excluded from the final RNA product of the gene. This means the exons are joined in different combinations, leading to different splice variants. In the case of protein-coding genes, the proteins translated from these splice variants may contain differences in their amino acid sequence and in their biological functions.
A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene and are the result of genetic differences. While many perform the same or similar biological roles, some isoforms have unique functions. A set of protein isoforms may be formed from alternative splicings, variable promoter usage, or other post-transcriptional modifications of a single gene; post-translational modifications are generally not considered. Through RNA splicing mechanisms, mRNA has the ability to select different protein-coding segments (exons) of a gene, or even different parts of exons from RNA to form different mRNA sequences. Each unique sequence produces a specific form of a protein.
Trans-splicing is a special form of RNA processing where exons from two different primary RNA transcripts are joined end to end and ligated. It is usually found in eukaryotes and mediated by the spliceosome, although some bacteria and archaea also have "half-genes" for tRNAs.
SR proteins are a conserved family of proteins involved in RNA splicing. SR proteins are named because they contain a protein domain with long repeats of serine and arginine amino acid residues, whose standard abbreviations are "S" and "R" respectively. SR proteins are ~200-600 amino acids in length and composed of two domains, the RNA recognition motif (RRM) region and the RS domain. SR proteins are more commonly found in the nucleus than the cytoplasm, but several SR proteins are known to shuttle between the nucleus and the cytoplasm.
A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs are modified in preparation for translation. For example, a precursor mRNA (pre-mRNA) is a type of primary transcript that becomes a messenger RNA (mRNA) after processing.
RNA-binding proteins are proteins that bind to the double or single stranded RNA in cells and participate in forming ribonucleoprotein complexes. RBPs contain various structural motifs, such as RNA recognition motif (RRM), dsRNA binding domain, zinc finger and others. They are cytoplasmic and nuclear proteins. However, since most mature RNA is exported from the nucleus relatively quickly, most RBPs in the nucleus exist as complexes of protein and pre-mRNA called heterogeneous ribonucleoprotein particles (hnRNPs). RBPs have crucial roles in various cellular processes such as: cellular function, transport and localization. They especially play a major role in post-transcriptional control of RNAs, such as: splicing, polyadenylation, mRNA stabilization, mRNA localization and translation. Eukaryotic cells express diverse RBPs with unique RNA-binding activity and protein–protein interaction. According to the Eukaryotic RBP Database (EuRBPDB), there are 2961 genes encoding RBPs in humans. During evolution, the diversity of RBPs greatly increased with the increase in the number of introns. Diversity enabled eukaryotic cells to utilize RNA exons in various arrangements, giving rise to a unique RNP (ribonucleoprotein) for each RNA. Although RBPs have a crucial role in post-transcriptional regulation in gene expression, relatively few RBPs have been studied systematically. It has now become clear that RNA–RBP interactions play important roles in many biological processes among organisms.
Nonsense-mediated mRNA decay (NMD) is a surveillance pathway that exists in all eukaryotes. Its main function is to reduce errors in gene expression by eliminating mRNA transcripts that contain premature stop codons. Translation of these aberrant mRNAs could, in some cases, lead to deleterious gain-of-function or dominant-negative activity of the resulting proteins.
An exonic splicing silencer (ESS) is a short region of an exon and is a cis-regulatory element. A set of 103 hexanucleotides known as FAS-hex3 has been shown to be abundant in ESS regions. ESSs inhibit or silence splicing of the pre-mRNA and contribute to constitutive and alternate splicing. To elicit the silencing effect, ESSs recruit proteins that will negatively affect the core splicing machinery.
Sodium channel protein type 1 subunit alpha (SCN1A), is a protein which in humans is encoded by the SCN1A gene.
Transformer-2 protein homolog beta, also known as TRA2B previously known as splicing factor, arginine/serine-rich 10 (SFRS10), is a protein that in humans is encoded by the TRA2B gene.
Regulator of nonsense transcripts 2 is a protein that in humans is encoded by the UPF2 gene.
Regulator of nonsense transcripts 3B is a protein that in humans is encoded by the UPF3B gene.
RNA-binding motif 10 is a protein that is encoded by the RBM10 gene. This gene maps on the X chromosome at Xp11.23 in humans. RBM10 is a regulator of alternative splicing. Alternative splicing is a process associated with gene expression to produce multiple protein isoforms from a single gene, thereby creating functional diversity and cellular complexity. RBM10 influences the expression of many genes, participating in various cellular processes and pathways such as cell proliferation and apoptosis. Its mutations are associated with various human diseases such as TARP syndrome, an X-linked congenital disorder in males resulting in pre‐ or postnatal lethality, and various cancers in adults.
Regulator of nonsense transcripts 3A is a protein that in humans is encoded by the UPF3A gene.
RNA binding motif protein 9 (RBM9), also known as Rbfox2, is a protein which in humans is encoded by the RBM9 gene.
Fox-1 homolog A, also known as ataxin 2-binding protein 1 (A2BP1) or hexaribonucleotide-binding protein 1 (HRNBP1) or RNA binding protein, fox-1 homolog (Rbfox1), is a protein that in humans is encoded by the RBFOX1 gene.
Serine/arginine-rich splicing factor 1 (SRSF1) also known as alternative splicing factor 1 (ASF1), pre-mRNA-splicing factor SF2 (SF2) or ASF1/SF2 is a protein that in humans is encoded by the SRSF1 gene. ASF/SF2 is an essential sequence specific splicing factor involved in pre-mRNA splicing. SRSF1 is the gene that codes for ASF/SF2 and is found on chromosome 17. The resulting splicing factor is a protein of approximately 33 kDa. ASF/SF2 is necessary for all splicing reactions to occur, and influences splice site selection in a concentration-dependent manner, resulting in alternative splicing. In addition to being involved in the splicing process, ASF/SF2 also mediates post-splicing activities, such as mRNA nuclear export and translation.
mRNA surveillance mechanisms are pathways utilized by organisms to ensure fidelity and quality of messenger RNA (mRNA) molecules. There are a number of surveillance mechanisms present within cells. These mechanisms function at various steps of the mRNA biogenesis pathway to detect and degrade transcripts that have not properly been processed.
An exon junction complex (EJC) is a protein complex which forms on a pre-messenger RNA strand at the junction of two exons which have been joined together during RNA splicing. The EJC has major influences on translation, surveillance, localization of the spliced mRNA, and m6A methylation. It is first deposited onto mRNA during splicing and is then transported into the cytoplasm. There it plays a major role in post-transcriptional regulation of mRNA. It is believed that exon junction complexes provide a position-specific memory of the splicing event. The EJC consists of a stable heterotetramer core, which serves as a binding platform for other factors necessary for the mRNA pathway. The core of the EJC contains the protein eukaryotic initiation factor 4A-III bound to an adenosine triphosphate (ATP) analog, as well as the additional proteins Magoh and Y14. The binding of these proteins to nuclear speckled domains has been measured recently and it may be regulated by PI3K/AKT/mTOR signaling pathways. In order for the binding of the complex to the mRNA to occur, the eIF4AIII factor is inhibited, stopping the hydrolysis of ATP. This recognizes EJC as an ATP dependent complex. EJC also interacts with a large number of additional proteins; most notably SR proteins. These interactions are suggested to be important for mRNA compaction. The role of EJC in mRNA export is controversial.
This article needs additional or more specific categories .(November 2024) |