Surp | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | Surp | ||||||||
Pfam | PF01805 | ||||||||
InterPro | IPR000061 | ||||||||
|
In molecular biology, the protein domain SWAP is derived from the term Suppressor-of-White-APricot, a splicing regulator from the model organism Drosophila melanogaster. The protein domain is found in regulators that control splicing. It is found in splicing regulatory proteins. [1] When a gene is expressed the DNA must be transcribed into messenger RNA (mRNA). However, it sometimes contains intervening or interrupting sequences named introns. mRNA splicing helps to remove these sequences, leaving a more favourable sequence. mRNA splicing is an essential event in the post-transcriptional modification process of gene expression. [2] SWAP helps to control this process in all cells except gametes.
The role of the protein domain SWAP is to control sex-independent pre-mRNA processing in somatic cells, that is, in every cell except the sex cells This includes autoregulation, whereby it regulates the splicing of its own pre-mRNA. [3] The mammalian homologue of SWAP acts as a thyroid hormone regulated gene. This mean it is controlled by the thyroid. [4] [5]
SWAP proteins share a colinearly arrayed series of novel sequence motifs. [3] This means that they have been conserved over time. The SWAP protein in different organisms share some similarity in terms of sequence and may have been related at some point in evolutionary history.
RNA splicing, in molecular biology, is a form of RNA processing in which a newly made precursor messenger RNA (pre-mRNA) transcript is transformed into a mature messenger RNA (mRNA). During splicing, introns are removed and exons are joined together. For nuclear-encoded genes, splicing takes place within the nucleus either during or immediately after transcription. For those eukaryotic genes that contain introns, splicing is usually required in order to create an mRNA molecule that can be translated into protein. For many eukaryotic introns, splicing is carried out in a series of reactions which are catalyzed by the spliceosome, a complex of small nuclear ribonucleoproteins (snRNPs). Self-splicing introns, or ribozymes capable of catalyzing their own excision from their parent RNA molecule, also exist.
Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be included within or excluded from the final, processed messenger RNA (mRNA) produced from that gene. This means the exons are joined in different combinations, leading to different (alternative) mRNA strands. Consequently, the proteins translated from alternatively spliced mRNAs will contain differences in their amino acid sequence and, often, in their biological functions. Notably, alternative splicing allows the human genome to direct the synthesis of many more proteins than would be expected from its 20,000 protein-coding genes.
SR proteins are a conserved family of proteins involved in RNA splicing. SR proteins are named because they contain a protein domain with long repeats of serine and arginine amino acid residues, whose standard abbreviations are "S" and "R" respectively. SR proteins are ~200-600 amino acids in length and composed of two domains, the RNA recognition motif (RRM) region and the RS domain. SR proteins are more commonly found in the nucleus than the cytoplasm, but several SR proteins are known to shuttle between the nucleus and the cytoplasm.
A primary transcript is the single-stranded ribonucleic acid (RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs are modified in preparation for translation. For example, a precursor mRNA (pre-mRNA) is a type of primary transcript that becomes a messenger RNA (mRNA) after processing.
The 5′ untranslated region is the region of an mRNA that is directly upstream from the initiation codon. This region is important for the regulation of translation of a transcript by differing mechanisms in viruses, prokaryotes and eukaryotes. While called untranslated, the 5′ UTR or a portion of it is sometimes translated into a protein product. This product can then regulate the translation of the main coding sequence of the mRNA. In many organisms, however, the 5′ UTR is completely untranslated, instead forming complex secondary structure to regulate translation.
Heterogeneous nuclear ribonucleoproteins (hnRNPs) are complexes of RNA and protein present in the cell nucleus during gene transcription and subsequent post-transcriptional modification of the newly synthesized RNA (pre-mRNA). The presence of the proteins bound to a pre-mRNA molecule serves as a signal that the pre-mRNA is not yet fully processed and therefore not ready for export to the cytoplasm. Since most mature RNA is exported from the nucleus relatively quickly, most RNA-binding protein in the nucleus exist as heterogeneous ribonucleoprotein particles. After splicing has occurred, the proteins remain bound to spliced introns and target them for degradation.
The minor spliceosome is a ribonucleoprotein complex that catalyses the removal (splicing) of an atypical class of spliceosomal introns (U12-type) from eukaryotic messenger RNAs in plants, insects, vertebrates and some fungi. This process is called noncanonical splicing, as opposed to U2-dependent canonical splicing. U12-type introns represent less than 1% of all introns in human cells. However they are found in genes performing essential cellular functions.
Timeless (tim) is a gene in multiple species but is most notable for its role in Drosophila for encoding TIM, an essential protein that regulates circadian rhythm. Timeless mRNA and protein oscillate rhythmically with time as part of a transcription-translation negative feedback loop involving the period (per) gene and its protein.
Period (per) is a gene located on the X chromosome of Drosophila melanogaster. Oscillations in levels of both per transcript and its corresponding protein PER have a period of approximately 24 hours and together play a central role in the molecular mechanism of the Drosophila biological clock driving circadian rhythms in eclosion and locomotor activity. Mutations in the per gene can shorten (perS), lengthen (perL), and even abolish (per0) the period of the circadian rhythm.
The PER1 gene encodes the period circadian protein homolog 1 protein in humans.
RNA-binding protein 8A is a protein that in humans is encoded by the RBM8A gene.
60S ribosomal protein L7a is a protein that in humans is encoded by the RPL7A gene.
Splicing factor 1 also known as zinc finger protein 162 (ZFM162) is a protein that in humans is encoded by the SF1 gene.
RNA binding motif protein 9 (RBM9), also known as Rbfox2, is a protein which in humans is encoded by the RBM9 gene.
Transformer-2 protein homolog alpha is a protein that in humans is encoded by the TRA2A gene.
Fox-1 homolog A, also known as ataxin 2-binding protein 1 (A2BP1) or hexaribonucleotide-binding protein 1 (HRNBP1) or RNA binding protein, fox-1 homolog (Rbfox1), is a protein that in humans is encoded by the RBFOX1 gene.
In molecular biology the DM domain is a protein domain first discovered in the doublesex proteins of Drosophila melanogaster and is also seen in C. elegans and mammalian proteins. In D. melanogaster the doublesex gene controls somatic sexual differentiation by producing alternatively spliced mRNAs encoding related sex-specific polypeptides. These proteins are believed to function as transcription factors on downstream sex-determination genes, especially on neuroblast differentiation and yolk protein genes transcription.
Splicing factor, suppressor of white-apricot homolog is a protein in humans that is encoded by the SFSWAP gene.
A minigene is a minimal gene fragment that includes an exon and the control regions necessary for the gene to express itself in the same way as a wild type gene fragment. This is a minigene in its most basic sense. More complex minigenes can be constructed containing multiple exons and intron(s). Minigenes provide a valuable tool for researchers evaluating splicing patterns both in vivo and in vitro biochemically assessed experiments. Specifically, minigenes are used as splice reporter vectors and act as a probe to determine which factors are important in splicing outcomes. They can be constructed to test the way both cis-regulatory elements and trans-regulatory elements affect gene expression.
dClock (clk) is a gene located on the 3L chromosome of Drosophila melanogaster. Mapping and cloning of the gene indicates that it is the Drosophila homolog of the mouse gene CLOCK (mClock). The Jrk mutation disrupts the transcription cycling of per and tim and manifests dominant effects.