RBFOX1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | RBFOX1 , 2BP1, A2BP1, FOX-1, FOX1, HRNBP1, RNA binding protein, fox-1 homolog 1, RNA binding fox-1 homolog 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 605104 MGI: 1926224 HomoloGene: 69339 GeneCards: RBFOX1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Fox-1 homolog A, also known as ataxin 2-binding protein 1 (A2BP1) or hexaribonucleotide-binding protein 1 (HRNBP1) or RNA binding protein, fox-1 homolog (Rbfox1), is a protein that in humans is encoded by the RBFOX1 gene. [5]
The RBFOX1 gene was first studied in Caenorhabditis elegans (nematodes), Drosophila melanogaster (fruit flies), and Danio rerio (zebrafish) with origins in embryology and development. The derivation of the nomenclature for RBFOX1 comes from the original sexual differentiation studies in C. elegans where the gene was denoted as 'Feminizing locus On X' (Fox-1). This refers to a lethal splicing event which causes an increase in the chromosomal X:A ratio; feminizing XO males. In Drosophila, the gene is known as CG3206 and was noted to code for an RNA-binding protein, be affected by Notch-signaling, and be associated with non-D/V (dorso-ventral) cells of the wing discs during wing development. The 'RB' portion of the gene's name extends from the RNA-binding (RB) properties of the coded protein. In zebrafish, rbfox genes were identified as being essential for cardiac and skeletal muscle development, causing reduced heart rate and paralysis respectively in morphants. The discovery of RBFOX1 in humans was due to the interaction of Rbfox1 with ataxin-2, hence the alternative name of A2BP1 (or ataxin-2 binding protein-1). [6]
RBFOX1 is located on chromosome 16 and consists of 30 exons. The Rbfox1 protein consists of 397 amino acids (AA) and is 42,784 Da. The canonical folding of the protein includes three beta sheets and two alpha helices. The localization of Rbfox1 protein is determined by its own alternative splicing via RBFOX proteins. If exon 19 is included, Rbfox1 will be cytoplasmic, but if exon 19 is excluded, Rbfox1 will be nuclear.
There are five isoforms of Rbfox1 due to alternative splicing. The canonical variant, isoform 1, is also known as gamma. This RBFOX1 transcript includes three conserved domains in its sequence. The most clinically relevant of these domains is the RNA recognition motif (RRM) located between 137-212. This domain allows for the important property of RNA binding for the Rbfox1 protein. Another conserved domain of RBFOX1 is the calcitonin gene-related peptide regulator C terminal. RBFOX1's C terminal is located between 273-363 and, as the name suggests, regulates the calcitonin gene-related peptide. The third conserved domain of RBFOX1 is the ELAV/HuD family splicing factor. HUD is human paraneoplastic encephalomyelitis antigen D whereas ELAV is Drosophila embryonic lethal abnormal visual protein. ELAV-like splicing factors are also known in humans as HuB (ELAV-like protein 2), HuC (ELAV-like protein 3, Paraneoplastic cerebellar degeneration-associated antigen), and HuR (ELAV-like protein 1). This super family domain contains three RRMs and is located between 25-208. [8]
SNP | Alleles | AA Change | Type | Location | Associated Disease | Ref | |
---|---|---|---|---|---|---|---|
Common | rs147023054 | C>T | intron variation | [9] | |||
rs372761949 | G/A | V180M | missense variation | [9] | |||
rs974157467 | ACTGCCG/A | inframe deletion | [9] | ||||
rs145873257 | G/A | G353S | missense variation | [9] | |||
rs2093621567 | CA/C | frameshift variation | [9] | ||||
Disease Relevant | rs12921846 | A>T | intron variation | intron 3 | Conduct Disorder in ADHD | [9] [10] | |
rs10153149 | A>C | intron variation | intron 3 | Conduct Disorder in ADHD | [10] | ||
rs9940753 | G>C | intron variation | ADHD, ASD | [9] | |||
rs12447542 | Unknown | Schizophrenia | [10] | ||||
rs12444931 | G>A | intron variation | Schizophrenia, Bipolar Disorder | [9] [10] | |||
rs133341055 | T>G | intron variation | intron 1 | Anxiety | [10] | ||
rs809682 | Unknown | Anxiety | [10] | ||||
rs142723691 | A>G | intron variation | Hepatitis A | [9] | |||
rs6500818 | C>T | intron variation | Dengue Shock Syndrome | [9] | |||
rs192187627 | A>C | intron variation | COVID-19 | [9] |
There are forty possible isoforms but only five beyond the canonical sequence are understood and confirmed in the population. Isoform 2 of RBFOX1, also known as alpha, is a shorter form of the canonical sequence as it is missing an in-frame segment on the 3' coding region. The third variant, beta, is also a shorter version of isoform 1. This shortening is caused by an alternate exon in the 3' coding region. Because of this, isoform 3 has a differing C-terminus located between 273-360. RBFOX1's isoform 4 differs in that the 5'UTR (untranslated region) lacks an in-frame section of the 3' coding region. This shorter isoform is encoded by variants 4 and 6 and has an alternate N-terminus. This isoform includes changes of locations of two of the conserved domains and one other domain: cell division protein ZipA becomes located between 4-122, calcitonin gene-related peptide regulator C terminal becomes located between 253-342, and the RNA recognition motif becomes located between 117-192. Isoform 5 contains a different 5'UTR as well as multiple coding region differences. Beyond these internal differences, isoform 5 also has a shorter and distinct N-terminus. The C terminus is located between 226-315 while the RRM domain is located between 117-192. The ZipA protein domain is located between 4-122. The differences of isoform 6 results in the use of an alternate start codon and a frameshift in the 3' coding region. The UTR is changed and multiple coding regions are altered. Uniquely, this isoform contains a longer rather than shorter N-terminus and a distinct C-terminus. The locations for the ZipA protein, calcitonin gene-related peptide regulator C terminal, and RRM are 33-165, 296-385, and 160-235, respectively. [9]
RBFOX1 is expressed in human heart, muscle, and neuronal tissues. The primary function is regulation of alternative splicing of associated genes. Several alternatively spliced transcript variants have been found for this gene with some localizing to the nucleus and others to the cytoplasm. Nuclear variants have a well-established role in tissue specific alternative splicing. Rbfox1 cytoplasmic variants modulate mRNA stability and translation. In stressed cells, Rbfox1 has been demonstrated to localize to cytoplasmic stress granules. Rbfox1 has an RNA recognition motif that is highly conserved among RNA-binding proteins. Rbfox1, and the related protein Rbfox2, bind the consensus RNA sequence motif (U)GCAUG within introns to exert their functions as alternative splicing factors. The C terminus of RBFOX1 contains the code for a protein, calcitonin gene-related peptide, involved with mediating neuron-specific splicing. Together, Rbfox1 and Rbfox2 repress exon 4 inclusion. In particular, for Drosophila, two cytoplasmic Rbfox1 isoforms bind Pumilio mRNA for silencing. Because of this destabilization, germline development is promoted and reversion to earlier stages is prevented. [12] [13] The alternative splicing activity of RBFOX1 also aids in neuronal development specifically for CaV1.2 voltage-gated calcium channels and N-methyl-D-aspartate (NMDA receptors). [14] [15] The overall activity and molecular mechanism of alternative splicing mediation for RBFOX1 is not fully understood, but some qualities have been established in recent studies. For example, exon inclusion is sufficiently promoted with only the carboxy terminal tethered downstream of the alternative exon. Conversely, for repression, both the RNA binding motif and carboxy terminal are required when tethered upstream of the alternative exon. Possible proteins that aid in the inclusion or skipping process are not confirmed, though both hnRNPH and RALY have been shown to bind Rbfox1. Thus, the specific mechanisms of alternative splicing maintenance via RBFOX1 are unknown. In one study, dominant-negative RBFOX protein interfered with exon activation, though not exon skipping. Because of this knowledge, repression maintenance most likely includes other proteins or outside factors near the binding sites. In C. elegans, co-operative binding between SUP-12 and RBFOX1 is noted to account for tissue-specific splicing. In mammals, there is a more universal cooperativity between RBFOX and NOVA family of proteins. The overall repression and inclusion activity of exons via RBFOX1 seem to be positionally-related. That is, a location downstream of an intron would lead to exon inclusion and a location upstream of an intron would lead to exon exclusion. [16] [6] [17]
Autism spectrum disorder is a neurodevelopmental disorder of social communication and repetitive behaviors as well as fixated interests and/or sensory behavior. Autism spectrum disorder is typically diagnosed in adolescence, but it is possible to be diagnosed in later stages of life. According to the DSM-5-TR, a diagnosis of Autism spectrum disorder requires at least two of the four restricted repetitive behaviors and all three verbal or nonverbal communication deficits. [18] [19] Mutations of RBFOX1 are not sufficient to single-handedly develop autism, but rather also require an environmental risk factor. Numerous autism spectrum disorder samples from cohorts and isolated autistic patients have been linked to de novo copy number variations of RBFOX1. Universally, cases from these studies involved intragenic deletions of either exons 5, 6, or 1D. In human progenitor cell lines (a stem cell culture method) modeling haploinsufficiency in neuronal differentiation, a knockdown (interference with gene or protein activity) study of RBFOX1 revealed significant changes in RNA splicing and gene expression. Similarly, whole transcriptome analysis of patients with autism spectrum disorder showcased a reduction of RBFOX1 and dysregulation of RBFOX1-dependent alternative splicing. [6] [10] RBFOX1 also contributes to mRNA stability of autism-related genes by blocking miRNA binding. [13]
While epilepsy, episodes of recurrent seizures, is most notably a neurological disorder, there are some cases which link the disease to issues with neuronal development. The two types of seizures are convulsive (60%) and non-convulsive (40%) with varying subcategories in each branch. A seizure is sporadic neural activity with no purpose. [20] [21] Interestingly, there is some comorbidity between autism spectrum disorder and epilepsy. Though it is unknown the specifics of how RBFOX1 affects neuronal development, it has been shown in neural-specific mouse knockouts that synaptic transmission and increased membrane excitability occur, causing a predisposition to seizures. [6] [22] [23] RBFOX1 potentially provides mRNA stability for synaptic genes by blocking miRNA binding. [13]
While the causes of ADHD are not agreed upon, it is known there are genetic risk factors that can contribute to the predisposition to the disorder. Oftentimes, a diagnosis requires a series of tests, observations, and questionnaires with the patient proving at least six of the nine inattentive and at least six of the nine hyperactivity and impulsivity symptoms (according to the DSM-5). [24] Because RBFOX1 has been noted to affect neuronal migration and synapse formation, there may be reasonable concern for its contribution to predisposition of ADHD. [25]
Schizophrenia is a disorder with both positive (delusions, hallucinations, and disorganized thought) and negative (povery of speech, social withdrawal, and flattened effect) symptoms. In some individual studies, copy number variations of RBFOX1 have been linked with schizophrenia at low levels with a notable increase in risk for male-specific schizophrenia. This increased risk is said to be due to a duplication before exon 6. [13]
Spinocerebellar ataxia is a neurodegenerative disease that slowly impedes gait, causes slurred speech, and causes an inability to control motor functions such as balance and coordination. This group of ataxias typically do not begin until adulthood. Several mechanisms play into the manifestation of this disease including ion channel dysfunction, RNA toxicity, and proteotoxicity. Due to the heterogenous nature of spinocerebellar ataxia, therapies are very difficult to develop and would most likely require specificity for each type. Rbfox1 is noted to be a possible contributor to spinocerebellar ataxia type 2 (SCA2), one of twelve dominant repeat expansion SCAs. The repeat is a CAG and causes an excessive string of glutamines to be translated. Unlike most other spinocerebellar ataxias which are purely cerebral, SCA2 also includes neurodegeneration. The Rbfox1/A2BP1 protein binds to the C-terminus of ataxin-2, and may contribute to the restricted pathology of SCA2. Ataxin-2 is the gene product of the SCA2 gene which causes familial diseases. The polyglutamine spinocerebellar aAtaxias not only have RNA foci and proteinaceous inclusions, but also the misfolded proteins themselves seem to aggregate in neuronal nuclei. [26]
Alzheimer's disease is a complex disease with different contributing pathological aspects. The most agreed upon pathologies are amyloid plaques, neurofibrillary tau tangles, and neuroinflammation. The amyloid plaques are extracellular with respect to neurons and occur early on in neurodegenerative diseases. Tau aids in the intracellular structure of the neuron by binding to and strengthening microtubules. When mutated, the tau can abnormally phosphorylate or misfold and bind to itself, causing tangles that damage the neuron. These tangles are typically seen in the later stages of neurodegenerative diseases. [28] In healthy humans, misfolded tau can be cleared from the system by the ubiquitin-proteasome system (UPS) or the autophagy-lysosome pathway. [29] In genetically predisposed or aged humans, these systems lose efficiency and can no longer handle the accumulating amount of misfolded tau, causing tangles to form more often without a way of clearing. One aspect of predisposition includes different isoforms of the beta amyloid precursor protein (APP). These isoforms are caused by varying cleavages of APP by either beta secretase and gamma secretase or alpha secretase. The longer forms of APP are prone to aggregating and causing disruptions of the system. [30] In particular, within in vitro experiments, RBFOX1 upregulation seems to be associated with an increase in the APP714 isoform. This isoform excludes exon 7 without including exon 8 of the APP, causing a shorter form of APP. In the brains of people with Alzheimer's disease, RBFOX1 was downregulated in the dorsolateral prefrontal cortex tissue; this points to the possibility of RBFOX1 playing a role in alternative splicing within the prefrontal cortex and contributing to control of plaques. [31] With regards to neuroinflammatory contribution to AD, RBFOX1 also may have ties with microglia. According to genome wide association (GWA) data, moduleQTL (modQTL) RBFOX1 SNP may alter gene expression of microglia. [32] [10]
Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be included within or excluded from the final, processed messenger RNA (mRNA) produced from that gene. This means the exons are joined in different combinations, leading to different (alternative) mRNA strands. Consequently, the proteins translated from alternatively spliced mRNAs usually contain differences in their amino acid sequence and, often, in their biological functions.
A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene or gene family and are the result of genetic differences. While many perform the same or similar biological roles, some isoforms have unique functions. A set of protein isoforms may be formed from alternative splicings, variable promoter usage, or other post-transcriptional modifications of a single gene; post-translational modifications are generally not considered. Through RNA splicing mechanisms, mRNA has the ability to select different protein-coding segments (exons) of a gene, or even different parts of exons from RNA to form different mRNA sequences. Each unique sequence produces a specific form of a protein.
Aromatic L-amino acid decarboxylase, also known as DOPA decarboxylase (DDC), tryptophan decarboxylase, and 5-hydroxytryptophan decarboxylase, is a lyase enzyme, located in region 7p12.2-p12.1.
L1, also known as L1CAM, is a transmembrane protein member of the L1 protein family, encoded by the L1CAM gene. This protein, of 200-220 kDa, is a neuronal cell adhesion molecule with a strong implication in cell migration, adhesion, neurite outgrowth, myelination and neuronal differentiation. It also plays a key role in treatment-resistant cancers due to its function. It was first identified in 1984 by M. Schachner who found the protein in post-mitotic mice neurons.
Tropomyosin is a two-stranded alpha-helical, coiled coil protein found in many animal and fungal cells. In animals, it is an important component of the muscular system which works in conjunction with troponin to regulate muscle contraction. It is present in smooth and striated muscle tissues, which can be found in various organs and body systems, including the heart, blood vessels, respiratory system, and digestive system. In fungi, tropomyosin is found in cell walls and helps maintain the structural integrity of cells.
RNA-binding proteins are proteins that bind to the double or single stranded RNA in cells and participate in forming ribonucleoprotein complexes. RBPs contain various structural motifs, such as RNA recognition motif (RRM), dsRNA binding domain, zinc finger and others. They are cytoplasmic and nuclear proteins. However, since most mature RNA is exported from the nucleus relatively quickly, most RBPs in the nucleus exist as complexes of protein and pre-mRNA called heterogeneous ribonucleoprotein particles (hnRNPs). RBPs have crucial roles in various cellular processes such as: cellular function, transport and localization. They especially play a major role in post-transcriptional control of RNAs, such as: splicing, polyadenylation, mRNA stabilization, mRNA localization and translation. Eukaryotic cells express diverse RBPs with unique RNA-binding activity and protein–protein interaction. According to the Eukaryotic RBP Database (EuRBPDB), there are 2961 genes encoding RBPs in humans. During evolution, the diversity of RBPs greatly increased with the increase in the number of introns. Diversity enabled eukaryotic cells to utilize RNA exons in various arrangements, giving rise to a unique RNP (ribonucleoprotein) for each RNA. Although RBPs have a crucial role in post-transcriptional regulation in gene expression, relatively few RBPs have been studied systematically.It has now become clear that RNA–RBP interactions play important roles in many biological processes among organisms.
Ataxin-1 is a DNA-binding protein which in humans is encoded by the ATXN1 gene.
Ataxin 7 (ATXN7) is a protein of the SCA7 gene, which contains 892 amino acids with an expandable poly(Q) region close to the N-terminus. The expandable poly(Q) motif region in the protein contributes crucially to spinocerebellar ataxia (SCA) pathogenesis by the induction of intranuclear inclusion bodies. ATXN7 is associated with both olivopontocerebellar atrophy type 3 (OPCA3) and spinocerebellar ataxia type 7 (SCA7).
An exonic splicing silencer (ESS) is a short region of an exon and is a cis-regulatory element. A set of 103 hexanucleotides known as FAS-hex3 has been shown to be abundant in ESS regions. ESSs inhibit or silence splicing of the pre-mRNA and contribute to constitutive and alternate splicing. To elicit the silencing effect, ESSs recruit proteins that will negatively affect the core splicing machinery.
Ataxin-2 is a protein that in humans is encoded by the ATXN2 gene. Mutations in ATXN2 cause spinocerebellar ataxia type 2 (SCA2).
Ataxin-3 is a protein that in humans is encoded by the ATXN3 gene.
CUGBP, Elav-like family member 2, also known as Etr-3 is a protein that in humans is encoded by the CELF2 gene.
RNA-binding motif 10 is a protein that is encoded by the RBM10 gene. This gene maps on the X chromosome at Xp11.23 in humans. RBM10 is a regulator of alternative splicing. Alternative splicing is a process associated with gene expression to produce multiple protein isoforms from a single gene, thereby creating functional diversity and cellular complexity. RBM10 influences the expression of many genes, participating in various cellular processes and pathways such as cell proliferation and apoptosis. Its mutations are associated with various human diseases such as TARP syndrome, an X-linked congenital disorder in males resulting in pre‐ or postnatal lethality, and various cancers in adults.
Ataxin-2-like protein was initially identified in 1996 and designated Ataxin-2 Related protein (A2RP) as the search for the gene causing SCA2 lead to the identification of 2 cDNA clones with high similarity to ATXN2. It was later renamed as ATXN2L. It is a protein that in humans is encoded by the ATXN2L gene.
RNA binding motif protein 9 (RBM9), also known as Rbfox2, is a protein which in humans is encoded by the RBM9 gene.
Autosomal dominant cerebellar ataxia (ADCA) is a form of spinocerebellar ataxia inherited in an autosomal dominant manner. ADCA is a genetically inherited condition that causes deterioration of the nervous system leading to disorder and a decrease or loss of function to regions of the body.
Messenger RNP is mRNA with bound proteins. mRNA does not exist "naked" in vivo but is always bound by various proteins while being synthesized, spliced, exported, and translated in the cytoplasm.
Spinocerebellar ataxia type 1 (SCA1) is a rare autosomal dominant disorder, which, like other spinocerebellar ataxias, is characterized by neurological symptoms including dysarthria, hypermetric saccades, and ataxia of gait and stance. This cerebellar dysfunction is progressive and permanent. First onset of symptoms is normally between 30 and 40 years of age, though juvenile onset can occur. Death typically occurs within 10 to 30 years from onset.
ZTTK syndrome is a rare multisystem disease caused in humans by a genetic mutation of the SON gene. Common symptoms include developmental delay and often mild to severe intellectual disability.
Diane Lipscombe is a British neuroscientist who is a professor of neuroscience and the Reliance Dhirubhai Ambani Director of Brown University’s Robert J. and Nancy D. Carney Institute for Brain Science. She served as the president of the Society for Neuroscience in 2019, the world’s largest organization for the study of the brain and nervous system.