The PAX3 (paired box gene 3) gene encodes a member of the paired box or PAX family of transcription factors. [5] The PAX family consists of nine human (PAX1-PAX9) and nine mouse (Pax1-Pax9) members arranged into four subfamilies. Human PAX3 and mouse Pax3 are present in a subfamily along with the highly homologous human PAX7 and mouse Pax7 genes. The human PAX3 gene is located in the 2q36.1 chromosomal region, and contains 10 exons within a 100 kb region.
Alternative splicing and processing generates multiple PAX3 isoforms that have been detected at the mRNA level. [6] PAX3e is the longest isoform and consists of 10 exons that encode a 505 amino acid protein. In other mammalian species, including mouse, the longest mRNAs correspond to the human PAX3c and PAX3d isoforms, which consist of the first 8 or 9 exons of the PAX3 gene, respectively. Shorter PAX3 isoforms include mRNAs that skip exon 8 (PAX3g and PAX3h) and mRNAs containing 4 or 5 exons (PAX3a and PAX3b). In limited studies comparing isoform expression, PAX3d is expressed at the highest levels. From a functional standpoint, PAX3c, PAX3d, and PAX3h stimulate activities such as cell growth whereas PAX3e and PAX3g inhibit these activities, and PAX3a and PAX3b show no activity or inhibit these endpoints.
A common alternative splice affecting the PAX3 mRNA involves the sequence CAG at the 5’ end of exon 3. [7] This splice either includes or excludes these three bases, thus resulting in the presence or absence of a glutamine residue in the paired box motif. Limited sequencing studies of full-length human cDNAs identified this splicing event as a variant of the PAX3d isoform, and this spliced isoform has been separately termed the PAX3i isoform. The Q+ and Q− isoforms of PAX3 are generally co-expressed in cells. At the functional level, the Q+ isoform shows similar or less DNA binding and transcriptional activation than the Q− isoform.
PAX3 encodes a transcription factor with an N-terminal DNA binding domain consisting of a paired box (PD) encoded by exons 2, 3, and 4, and an octapeptide and complete homeodomain (HD) encoded by exons 5 and 6. [8] In addition, the PAX3 protein has a C-terminal transcriptional activation domain encoded by exons 7 and 8. The highly conserved PD consists of a 128 amino acid region that binds to DNA sequences related to the TCACGC/G motif. [9] The HD motif usually consists of 60 amino acids and binds to sequences containing a TAAT core motif. [10] The combination of these two DNA binding domains enable the PAX3 protein to recognize longer sequences containing PD and HD binding sites. [11] In the C-terminus of PAX3, there is a proline, serine and threonine (PST)-rich region measuring 78 amino acids that functions to stimulate transcriptional activity. [12] There are also transcriptional repression domains in the HD and N-terminal region (including the first half of the PD) that repress the C-terminal transcriptional activation domain. [13]
PAX3 functions as a transcriptional activator for most target genes, but also may repress a smaller subset of target genes. [14] These expression changes are effected through binding of PAX3 to specific recognition sites, which are situated in various genomic locations. [15] Some binding sites are located in or near target genes, such as the 5’ promoter, first intron and 3’ untranslated region. A substantial number of PAX3 binding sites are located at larger distances upstream and downstream of target genes. Among the PAX3 target genes, there is one group associated with muscle development and a second group associated with neural and melanocyte development. The proteins encoded by these target genes regulate various functional activities in these lineages, including differentiation, proliferation, migration, adhesion, and apoptosis.
PAX3 interacts with other nuclear proteins, which modulate PAX3 transcriptional activity. Dimerization of PAX3 with another PAX3 molecule or a PAX7 molecule enables binding to a palindromic HD binding site (TAATCAATTA). [16] Interaction of PAX3 with other transcription factors (such as SOX10) or chromatin factors (such as PAX3/7BP) enables synergistic activation of PAX3 target genes. [17] [18] In contrast, binding of PAX3 to co-repressors, such as calmyrin, inhibits activation of PAX3 target genes. [19] These co-repressors may function by altering chromatin structure at target genes, inhibiting PAX3 recognition of its DNA binding site or directly altering PAX3 transcriptional activity.
Finally, PAX3 protein expression and function can be modulated by post-translational modifications. PAX3 can be phosphorylated at serines 201, 205 and 209 by kinases such as GSK3b, which in some settings will increase PAX3 protein stability. [20] [21] In addition, PAX3 can also undergo ubiquitination and acetylation at lysines 437 and 475, which regulates protein stability and function. [22] [23]
Table 1. Representative PAX3 transcriptional target genes.
Protein category | Name | Phenotypic Activity |
Cell adhesion molecule | NRCAM | Intercellular adhesion |
Chemokine receptor | CXCR4 | Motility |
Receptor tyrosine kinase | FGFR4 | Proliferation, differentiation, migration |
MET | Proliferation, migration, survival | |
RET | Proliferation, migration, differentiation | |
Transcription factor | MITF | Differentiation, proliferation, survival |
MYF5 | Differentiation | |
MYOD1 | Differentiation |
During development, one of the major lineages expressing Pax3 is the skeletal muscle lineage. [24] Pax3 expression is first seen in the pre-somitic paraxial mesoderm, and then ultimately becomes restricted to the dermomyotome, which forms from the dorsal region of the somites. To form skeletal muscle in central body segments, PAX3-expressing cells detach from the dermomyotome and then Pax3 expression is turned off as Myf5 and MyoD1 expression is activated. To form other skeletal muscles, Pax3-expressing cells detach from the dermomyotome and migrate to more distant sites, such as the limbs and diaphragm. A subset of these Pax3-expressing dermomyotome-derived cells also serves as an ongoing progenitor pool for skeletal muscle growth during fetal development. During later developmental stages, myogenic precursors expressing Pax3 and/or Pax7 form satellite cells within the skeletal muscle, which contribute to postnatal muscle growth and muscle regeneration. These adult satellite cells remain quiescent until injury occurs, and then are stimulated to divide and regenerate the injured muscle.
Pax3 is also involved in the development of the nervous system. [25] Expression of Pax3 is first detected in the dorsal region of the neural groove and, as this neural groove deepens to form the neural tube, Pax3 is expressed in the dorsal portion of the neural tube. As the neural tube enlarges, Pax3 expression is localized to proliferative cells in the inner ventricular zone and then this expression is turned off as these cells migrate to more superficial regions. Pax3 is expressed along the length of the neural tube and throughout much of the developing brain, and this expression is subsequently turned off during later developmental stages in a rostral to caudal direction.
During early development, Pax3 expression also occurs at the lateral and posterior margins of the neural plate, which is the region from which the neural crest arises. [26] Pax3 is later expressed by various cell types and structures arising from the neural crest, such as melanoblasts, Schwann cell precursors, and dorsal root ganglia. In addition, Pax3-expressing cells derived from the neural crest contribute to the formation of other structures, such as the inner ear, mandible and maxilla. [27]
Pax3 controls the location of the nasion (a facial feature between the eyes and at the top of the nose), and is associated with the presence of a unibrow. [28]
Germline mutations of the Pax3 gene cause the splotch phenotype in mice. [29] [30] At the molecular level, this phenotype is caused by point mutations or deletions that alter or abolish Pax3 transcriptional function. In the heterozygous state, the splotch phenotype is characterized by white patches in the belly, tail and feet. These white spots are attributed to localized deficiencies in pigment-forming melanocytes resulting from neural crest cell defects. In the homozygous state, these Pax3 mutations cause embryonic lethality, which is associated with prominent neural tube closure defects and abnormalities of neural crest-derived structures, such as melanocytes, dorsal root ganglia and enteric ganglia. Heart malformations also result from the loss of cardiac neural crest cells, which normally contribute to the cardiac outflow tract and innervation of the heart. Finally, limb musculature does not develop in the homozygotes and axial musculature demonstrates varying abnormalities. These myogenic effects are caused by increased cell death of myogenic precursors in the dermomyotome and diminished migration from the dermomyotome.
Germline mutations of the PAX3 gene occur in the human disease Waardenburg syndrome, [31] which consists of four autosomal dominant genetic disorders (WS1, WS2, WS3 and WS4). [32] Of the four subtypes, WS1 and WS3 are usually caused by PAX3 mutations. All four subtypes are characterized by hearing loss, eye abnormalities and pigmentation disorders. In addition, WS1 is frequently associated with a midfacial alteration called dystopia canthorum, while WS3 (Klein-Waardenburg syndrome) is frequently distinguished by musculoskeletal abnormalities affecting the upper limbs. Most WS1 cases are caused by heterozygous PAX3 mutations while WS3 is caused by either partial or total deletion of PAX3 and contiguous genes or by smaller PAX3 mutations in the heterozygous or homozygous state. These PAX3 mutations in WS1 and WS3 include missense, nonsense and splicing mutations; small insertions; and small or gross deletions. Though these changes are usually not recurrent, the mutations generally occur in exons 2 through 6 with exon 2 mutations being most common. As these exons encode the paired box and homeodomain, these mutations often affect DNA binding function.
This section needs more reliable medical references for verification or relies too heavily on primary sources .(January 2018) |
Alveolar rhabdomyosarcoma (ARMS) is an aggressive soft tissue sarcoma that occurs in children and is usually characterized by a recurrent t(2;13)(q35;q14) chromosomal translocation. [33] [34] This 2;13 translocation breaks and rejoins portions of the PAX3 and FOXO1 genes to generate a PAX3-FOXO1 fusion gene that expresses a PAX3-FOXO1 fusion transcript encoding a PAX3-FOXO1 fusion protein. [35] PAX3 and FOXO1 encode transcription factors, and the translocation results in a fusion transcription factor containing the N-terminal PAX3 DNA-binding domain and the C-terminal FOXO1 transactivation domain. A smaller subset of ARMS cases is associated with less common fusions of PAX7 to FOXO1 or rare fusions of PAX3 to other transcription factors, such as NCOA1. [36] [37] Compared to the wild-type PAX3 protein, the PAX3-FOXO1 fusion protein more potently activates PAX3 target genes. [13] In ARMS cells, PAX3-FOXO1 usually functions as a transcriptional activator and excessively increases expression of downstream target genes. [38] [39] In addition, PAX3-FOXO1 binds along with MYOD1, MYOG and MYCN as well as chromatin structural proteins, such as CHD4 and BRD4, to contribute to the formation of super enhancers in the vicinity of a subset of these target genes. [40] These dysregulated target genes contribute to tumorigenesis by altering signaling pathways that affect proliferation, cell death, myogenic differentiation, and migration.
A t(2;4)(q35;q31.1) chromosomal translocation that fuses the PAX3 and MAML3 genes occurs in biphenotypic sinonasal sarcoma (BSNS), a low-grade adult malignancy associated with both myogenic and neural differentiation. [41] MAML3 encodes a transcriptional coactivator involved in Notch signaling. The PAX3-MAML3 fusion juxtaposes the N-terminal PAX3 DNA binding domain with the C-terminal MAML3 transactivation domain to create another potent activator of target genes with PAX3 binding sites. Of note, PAX3 is rearranged without MAML3 involvement in a smaller subset of BSNS cases, and some of these variant cases contain a PAX3-NCOA1 or PAX3-FOXO1 fusion. [42] [43] Though PAX3-FOXO1 and PAX3-NCOA1 fusions can be formed in both ARMS and BSNS, there are differences in the pattern of activated downstream target genes suggesting that the cell environment has an important role in modulating the output of these fusion transcription factors.
In addition to tumors with PAX3-related fusion genes, there are several other tumor categories that express the wild-type PAX3 gene. The presence of PAX3 expression in some tumors can be explained by their derivation from developmental lineages normally expressing wild-type PAX3. For example, PAX3 is expressed in cancers associated with neural tube-derived lineages, (e.g., glioblastoma), neural crest-derived lineages (e.g., melanoma) and myogenic lineages (e.g., embryonal rhabdomyosarcoma). [44] [45] [46] [47] However, PAX3 is also expressed in other cancer types without a clear relationship to a PAX3-expressing developmental lineages, such as breast carcinoma and osteosarcoma. [48] [49] In these wild-type PAX3-expressing cancers, PAX3 function impacts on the control of proliferation, apoptosis, differentiation and motility. [44] [45] Therefore, wild-type PAX3 exerts a regulatory role in tumorigenesis and tumor progression, which may be related to its role in normal development.
The 2018 version of this article was updated by an external expert under a dual publication model. The corresponding academic peer reviewed article was published in Gene and can be cited as: Salah Boudjadi; Bishwanath Chatterjee; Wenyue Sun; Prasantha Vemu; Frederic G. Barr (5 August 2018). "The expression and function of PAX3 in development and disease". Gene . Gene Wiki Review Series. 666: 145–157. doi:10.1016/J.GENE.2018.04.087. ISSN 0378-1119. PMC 6624083 . PMID 29730428. Wikidata Q65666079. |
Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be included within or excluded from the final, processed messenger RNA (mRNA) produced from that gene. This means the exons are joined in different combinations, leading to different (alternative) mRNA strands. Consequently, the proteins translated from alternatively spliced mRNAs usually contain differences in their amino acid sequence and, often, in their biological functions.
Rhabdomyosarcoma (RMS) is a highly aggressive form of cancer that develops from mesenchymal cells that have failed to fully differentiate into myocytes of skeletal muscle. Cells of the tumor are identified as rhabdomyoblasts.
L1, also known as L1CAM, is a transmembrane protein member of the L1 protein family, encoded by the L1CAM gene. This protein, of 200-220 kDa, is a neuronal cell adhesion molecule with a strong implication in cell migration, adhesion, neurite outgrowth, myelination and neuronal differentiation. It also plays a key role in treatment-resistant cancers due to its function. It was first identified in 1984 by M. Schachner who found the protein in post-mitotic mice neurons.
A neurula is a vertebrate embryo at the early stage of development in which neurulation occurs. The neurula stage is preceded by the gastrula stage; consequentially, neurulation is preceded by gastrulation. Neurulation marks the beginning of the process of organogenesis.
In genetics, a silencer is a DNA sequence capable of binding transcription regulation factors, called repressors. DNA contains genes and provides the template to produce messenger RNA (mRNA). That mRNA is then translated into proteins. When a repressor protein binds to the silencer region of DNA, RNA polymerase is prevented from transcribing the DNA sequence into RNA. With transcription blocked, the translation of RNA into proteins is impossible. Thus, silencers prevent genes from being expressed as proteins.
In evolutionary developmental biology, Paired box (Pax) genes are a family of genes coding for tissue specific transcription factors containing an N-terminal paired domain and usually a partial, or in the case of four family members, a complete homeodomain to the C-terminus. An octapeptide as well as a Pro-Ser-Thr-rich C terminus may also be present. Pax proteins are important in early animal development for the specification of specific tissues, as well as during epimorphic limb regeneration in animals capable of such.
Paired box protein Pax-6, also known as aniridia type II protein (AN2) or oculorhombin, is a protein that in humans is encoded by the PAX6 gene.
Myogenesis is the formation of skeletal muscular tissue, particularly during embryonic development.
Fibroblast growth factor receptor 1 (FGFR1), also known as basic fibroblast growth factor receptor 1, fms-related tyrosine kinase-2 / Pfeiffer syndrome, and CD331, is a receptor tyrosine kinase whose ligands are specific members of the fibroblast growth factor family. FGFR1 has been shown to be associated with Pfeiffer syndrome, and clonal eosinophilias.
The steroidogenic factor 1 (SF-1) protein is a transcription factor involved in sex determination by controlling the activity of genes related to the reproductive glands or gonads and adrenal glands. This protein is encoded by the NR5A1 gene, a member of the nuclear receptor subfamily, located on the long arm of chromosome 9 at position 33.3. It was originally identified as a regulator of genes encoding cytochrome P450 steroid hydroxylases, however, further roles in endocrine function have since been discovered.
Alveolar rhabdomyosarcoma (ARMS) is a subtype of the rhabdomyosarcoma soft tissue cancer family whose lineage is from mesenchymal cells and are related to skeletal muscle cells. ARMS tumors resemble the alveolar tissue in the lungs. Tumor location varies from patient to patient, but is commonly found in the head and neck region, male and female urogenital tracts, the torso, and extremities. Two fusion proteins can be associated with ARMS, but are not necessary, PAX3-FKHR. and PAX7-FKHR. In children and adolescents ARMS accounts for about 1 percent of all malignancies, has an incidence rate of 1 per million, and most cases occur sporadically with no genetic predisposition. PAX3-FOXO1 is now known to drive cancer-promoting gene expression programs through creation of distant genetic elements called super enhancers.
Runt-related transcription factor 1 (RUNX1) also known as acute myeloid leukemia 1 protein (AML1) or core-binding factor subunit alpha-2 (CBFA2) is a protein that in humans is encoded by the RUNX1 gene.
Protein C-ets-1 is a protein that in humans is encoded by the ETS1 gene. The protein encoded by this gene belongs to the ETS family of transcription factors.
Malignant ectomesenchymoma(MEM) is a rare, fast-growing tumor of the nervous system or soft tissue that occurs in children and young adults. MEM is part of a group of small round blue cell tumors which includes neuroblastoma, rhabdomyosarcoma, non-Hodgkin's lymphoma, and the Ewing's family of tumors.
Paired box gene 8, also known as PAX8, is a protein which in humans is encoded by the PAX8 gene.
Cux1 is a homeodomain protein that in humans is encoded by the CUX1 gene.
Paired box protein Pax-7 is a protein that in humans is encoded by the PAX7 gene.
Forkhead box protein O1 (FOXO1), also known as forkhead in rhabdomyosarcoma (FKHR), is a protein that in humans is encoded by the FOXO1 gene. FOXO1 is a transcription factor that plays important roles in regulation of gluconeogenesis and glycogenolysis by insulin signaling, and is also central to the decision for a preadipocyte to commit to adipogenesis. It is primarily regulated through phosphorylation on multiple residues; its transcriptional activity is dependent on its phosphorylation state.
Embryonal rhabdomyosarcoma (EMRS) is a rare histological form of cancer in the connective tissue wherein the mesenchymally-derived malignant cells resemble the primitive developing skeletal muscle of the embryo. It is the most common soft tissue sarcoma occurring in children. Embryonal rhabdomyosarcoma is also known as PAX-fusion negative or fusion-negative rhabdomyosarcoma, as tumors of this subtype are unified by their lack of a PAX3-FOXO1 fusion oncogene. Fusion status refers to the presence or absence of a fusion gene, which is a gene formed from joining two different genes together through DNA rearrangements. These types of tumors are classified as embryonal rhabdomyosarcoma "because of their remarkable resemblance to developing embryonic and fetal skeletal muscle."
Gooseberry (gsb) is a segment polarity gene located on chromosome 2 of the Drosophila genome. Gooseberry is known for its interactions with key embryonic signaling pathways Wingless and Hedgehog. The gene also has clinal significance, being linked to diseases such as Waardenburg Syndrome and rhabdomyosarcoma.