T-box transcription factor T, also known as Brachyury protein, is encoded for in humans and other apes by the TBXT gene. [5] [6] [7] Brachyury functions as a transcription factor within the T-box family of genes. [8] Brachyury homologs have been found in all bilaterian animals that have been screened, as well as the freshwater cnidarian Hydra. [8]
The brachyury mutation was first described in mice by Nadezhda Alexandrovna Dobrovolskaya-Zavadskaya in 1927 as a mutation that affected tail length and sacral vertebrae in heterozygous animals. In homozygous animals, the brachyury mutation is lethal at around embryonic day 10 due to defects in mesoderm formation, notochord differentiation and the absence of structures posterior to the forelimb bud (Dobrovolskaïa-Zavadskaïa, 1927). The name brachyury comes from the Greek brakhus meaning short and oura meaning tail.
In 2018, HGNC updated the human gene name from T to TBXT, presumably to overcome difficulties associated with searching for a single letter gene symbol.
Tbxt was cloned by Bernhard Herrmann and colleagues [9] and proved to encode a 436 amino acid embryonic nuclear transcription factor. Tbxt binds to a specific DNA element, a near palindromic sequence TCACACCT through a region in its N-terminus, called the T-box. Tbxt is the founding member of the T-box family which in mammals currently consists of 18 T-box genes.
The crystal structure of the human brachyury protein was solved in 2017 by Opher Gileadi and colleagues at the Structural Genomics Consortium in Oxford. [10]
The gene brachyury appears to have a conserved role in defining the midline of a bilaterian organism, [11] and thus the establishment of the anterior-posterior axis; this function is apparent in chordates and molluscs. [12] Its ancestral role, or at least the role it plays in the Cnidaria, appears to be in defining the blastopore. [8] It also defines the mesoderm during gastrulation. [13] Tissue-culture based techniques have demonstrated one of its roles may be in controlling the velocity of cells as they leave the primitive streak. [14] [15] It effects transcription of genes required for mesoderm formation and cellular differentiation.[ clarification needed ]
Brachyury has also been shown to help establish the cervical vertebral blueprint during fetal development. The number of cervical vertebrae is highly conserved among all mammals; however, a spontaneous vertebral and spinal dysplasia (VSD) mutation in this gene has been associated with the development of six or fewer cervical vertebrae instead of the usual seven. [16]
In mice, T is expressed in the inner cell mass of the blastocyst stage embryo (but not in the majority of mouse embryonic stem cells) followed by the primitive streak (see image). In later development, expression is localised to the node and notochord.
In Xenopus laevis , Xbra (the XenopusT homologue, also recently renamed t) is expressed in the mesodermal marginal zone of the pre-gastrula embryo followed by localisation to the blastopore and notochord at the mid-gastrula stage.
The Danio rerio ortholog is known as ntl (no tail).
TBXT is a transcription factor observed in vertebrate organisms. As such, it is primarily responsible for the genotype that codes for tail formation due to its observed role in axial development and the construction of posterior mesoderm within the lumbar and sacral regions. [17] [13] TBXT transcribes genes that form notochord cells, which are responsible for the flexibility, length, and balance of the spine, including tail vertebrae. [18] Because of the role that the transcription factor plays in spinal development, it is cited as being the protein that is primarily responsible for tail development in mammals. [5] [19] However, due to being a genetically-induced phenotype, it is possible for tail-encoding material to be effectively silenced by mutation. This is the mechanism by which the ntl ortholog developed in the hominidae taxa.
In particular, an Alu element in TBXT is responsible for the taillessness (ntl) ortholog. An Alu element is evolved, mobile RNA that is exclusively in primates. These elements are capable of mobilizing around a genome, making Alu elements transposons. [20] The Alu element that is observed to catalyze taillessness in TBXT is AluY. [21] [22] While normally Alu elements are not individually impactful, the presence of another Alu element active in TBXT, AluSx1, is coded such that its nucleotides are the inverse of AluY’s. Because of this, the two elements are paired together in the replication process, leading up to the formation of a stem-loop structure and an alternative splicing event that fundamentally influences transcription. [23] The structure isolates and positions codons held between the two Alu elements in a hairpin-esque loop that consequently cannot be paired or transcribed. The trapped material, most notably, includes the 6th exon that codes in TBXT. [21] [24] In a stem-loop structure, genetic material trapped within the loop is recognized by transcription-coupled nucleotide excision repair (TC NER) proteins as damage due to RNA polymerase being ostensibly stalled at the neck of the loop. This is also how lesions are able to occur at all–the stalled transcription process serves as a beacon for TC NER proteins to ascertain the location of the stem-loop. [25] Once TBXT is cleaved, trapped nucleotides–including exon 6–are excised from the completed transcription process by the TC NER mechanisms. Because of the resulting excision of exon 6, information contained within the exon is, too, removed from transcription. Consequently, it is posited that the material stored in exon 6 is, in part, responsible for full hominid tail growth. [21] [24]
As a result of the effect on TBXT's tail-encoding material that AluY has alongside AluSx1, isoform TBXT-Δexon6 is created. [21] [26] Isoforms are often a result of mutation, polymorphism, and recombination, and happen to share often highly similar functions to the proteins they derive from. However often they can have some key differences due to either containing added instructions or missing instructions the original protein is known to possess. [27] TBXT-Δexon6 falls into this category, as it is an isoform that lacks the ability to process the code that enables proper tail formation in TBXT-containing organisms. This is because exon 6's material that helps encode for tail formation is excised from the contents of the transcribed RNA. As a result, it is effectively missing in the isoform, and is thus the key factor in determining the isoform's name. Other common examples of influential isoforms include those involved in AMP-induced protein kinase that insert phosphate groups into specific sites of the cell depending on the subunit. [28]
The first insertion of the AluY element occurred approximately 20-25 million years ago, with the earliest hominid ancestor known to exhibit this mutation being the Hominoidea family of apes. [21] Taillessness has become an overwhelmingly dominant phenotype, such that it contributes to speciation. Over time, the mutation occurred more regularly due to the influence of natural selection and fixation to stabilize and expand its presence in the ape gene pool prior to the eventual speciation of homo sapiens. [29] There are several potential reasons for why taillessness has become the standard phenotype in the Hominidae taxa that offset the genetically disadvantageous aspects of tail mitigation, but little is known with certainty. [22] Some experts hypothesize that taillessness contributes to a stronger, more upright stance. The stance observed by primates with a smaller lumbar is seen to be effective. Grounded mobility and maintaining balance in climbing are more feasible given the evenly distributed body weight observed in hominids. [30] The presence of an additional appendage can also mean another appendage for predators to grab, and one that also consumes energy to move and takes up more space.
Brachyury is implicated in the initiation and/or progression of a number of tumor types including chordoma, germ cell tumors, hemangioblastoma, GIST, lung cancer, small cell carcinoma of the lung, breast cancer, colon cancer, hepatocellular carcinoma, prostate cancer, and oral squamous carcinoma. [31]
In breast cancer, brachyury expression is associated with recurrence, metastasis and reduced survival. [32] [33] [34] [35] It is also associated with resistance to tamoxifen [36] and to cytotoxic chemotherapy. [32]
In lung cancer, brachyury expression is associated with recurrence and decreased survival. [37] [38] [39] [40] It is also associated with resistance to cytotoxic chemotherapy, [41] radiation, [42] and EGFR kinase inhibitors. [37]
In prostate cancer, brachyury expression is associated with Gleason score, perineural, invasion and capsular invasion. [43]
In addition to its role in common cancers, brachyury has been identified as a definitive diagnostic marker, key driver and therapeutic target for chordoma, a rare malignant tumor that arises from remnant notochordal cells lodged in the vertebrae. The evidence regarding brachyury's role in chordoma includes:
Brachyury is an important factor in promoting the epithelial–mesenchymal transition (EMT). Cells that over-express brachyury have down-regulated expression of the adhesion molecule E-cadherin, which allows them to undergo EMT. This process is at least partially mediated by the transcription factors AKT [49] and Snail. [19]
Overexpression of brachyury has been linked to hepatocellular carcinoma (HCC, also called malignant hepatoma), a common type of liver cancer. While brachyury is promoting EMT, it can also induce metastasis of HCC cells. Brachyury expression is a prognostic biomarker for HCC, and the gene may be a target for cancer treatments in the future. [49]
Research posits that there are some downsides that are more likely to occur in the embryonic stage due to the tailless mutation of TBXT-Δexon6. Exon 6's excision fundamentally affects the manner in which TBXT-encoded cells divide, distribute information, and form tissue because of how stem-loop sites create genetic instability. [25] [21] As such, it is seen by experts that tail loss has contributed to the existence and frequency of developmental defects in the neural tube and sacral region. Primarily, spina bifida and sacral agenesis are the most likely suspects due to their direct relation to lumbar development. [22] Spina bifida is an error in the build of the spinal neural tube, causing it to not fully close and leaving nerves exposed within the spinal cord. Sacral agenesis, on the other hand, is a series of physical malformations in the hips that result from the omission of sacral matter during the developmental process. Because both of these developmental disorders result in the displacement of organs and other bodily mechanisms, they are both directly related to outright malfunction of the kidney, bladder, and nervous system. [50] [51] This can lead to higher likelihood of diseases related to their functionality or infrastructure, such as neurogenic bladder dysfunction or hydrocephalus. [51]
Overexpression of brachyury may play a part in EMT associated with benign disease such as renal fibrosis. [19]
Because brachyury is expressed in tumors but not in normal adult tissues it has been proposed as a potential drug target with applicability across tumor types. In particular, brachyury-specific peptides are presented on HLA receptors of cells in which it is expressed, representing a tumor specific antigen. Various therapeutic vaccines have been developed which are intended to stimulate an immune response to brachyury expressing cells. [31]
Hepatocyte growth factor receptor is a protein that in humans is encoded by the MET gene. The protein possesses tyrosine kinase activity. The primary single chain precursor protein is post-translationally cleaved to produce the alpha and beta subunits, which are disulfide linked to form the mature receptor.
The epithelial–mesenchymal transition (EMT) is a process by which epithelial cells lose their cell polarity and cell–cell adhesion, and gain migratory and invasive properties to become mesenchymal stem cells; these are multipotent stromal cells that can differentiate into a variety of cell types. EMT is essential for numerous developmental processes including mesoderm formation and neural tube formation. EMT has also been shown to occur in wound healing, in organ fibrosis and in the initiation of metastasis in cancer progression.
The CD44 antigen is a cell-surface glycoprotein involved in cell–cell interactions, cell adhesion and migration. In humans, the CD44 antigen is encoded by the CD44 gene on chromosome 11. CD44 has been referred to as HCAM, Pgp-1, Hermes antigen, lymphocyte homing receptor, ECM-III, and HUTCH-1.
Mothers against decapentaplegic homolog 3 also known as SMAD family member 3 or SMAD3 is a protein that in humans is encoded by the SMAD3 gene.
Forkhead box protein C2 (FOXC2) also known as forkhead-related protein FKHL14 (FKHL14), transcription factor FKH-14, or mesenchyme fork head protein 1 (MFH1) is a protein that in humans is encoded by the FOXC2 gene. FOXC2 is a member of the fork head box (FOX) family of transcription factors.
p16, is a protein that slows cell division by slowing the progression of the cell cycle from the G1 phase to the S phase, thereby acting as a tumor suppressor. It is encoded by the CDKN2A gene. A deletion in this gene can result in insufficient or non-functional p16, accelerating the cell cycle and resulting in many types of cancer.
High-mobility group AT-hook 2, also known as HMGA2, is a protein that, in humans, is encoded by the HMGA2 gene.
The basal-like carcinoma is a recently proposed subtype of breast cancer defined by its gene expression and protein expression profile.
Paired box gene 8, also known as PAX8, is a protein which in humans is encoded by the PAX8 gene.
Forkhead box C1, also known as FOXC1, is a protein which in humans is encoded by the FOXC1 gene.
Integrin beta-6 is a protein that in humans is encoded by the ITGB6 gene. It is the β6 subunit of the integrin αvβ6. Integrins are αβ heterodimeric glycoproteins which span the cell’s membrane, integrating the outside and inside of the cell. Integrins bind to specific extracellular proteins in the extracellular matrix or on other cells and subsequently transduce signals intracellularly to affect cell behaviour. One α and one β subunit associate non-covalently to form 24 unique integrins found in mammals. While some β integrin subunits partner with multiple α subunits, β6 associates exclusively with the αv subunit. Thus, the function of ITGB6 is entirely associated with the integrin αvβ6.
Epithelial cell adhesion molecule (EpCAM), also known as CD326 among other names, is a transmembrane glycoprotein mediating Ca2+-independent homotypic cell–cell adhesion in epithelia. EpCAM is also involved in cell signaling, migration, proliferation, and differentiation. Additionally, EpCAM has oncogenic potential via its capacity to upregulate c-myc, e-fabp, and cyclins A & E. Since EpCAM is expressed exclusively in epithelia and epithelial-derived neoplasms, EpCAM can be used as diagnostic marker for various cancers. It appears to play a role in tumorigenesis and metastasis of carcinomas, so it can also act as a potential prognostic marker and as a potential target for immunotherapeutic strategies.
Zinc finger protein SNAI1 is a protein that in humans is encoded by the SNAI1 gene. Snail is a family of transcription factors that promote the repression of the adhesion molecule E-cadherin to regulate epithelial to mesenchymal transition (EMT) during embryonic development.
Zinc finger E-box-binding homeobox 1 is a protein that in humans is encoded by the ZEB1 gene.
Zinc finger protein SNAI2 is a transcription factor that in humans is encoded by the SNAI2 gene. It promotes the differentiation and migration of certain cells and has roles in initiating gastrulation.
T-box transcription factor TBX3 is a protein that in humans is encoded by the TBX3 gene.
ETS homologous factor is a protein that in humans is encoded by the EHF gene. This gene encodes a protein that belongs to an ETS transcription factor subfamily characterized by epithelial-specific expression (ESEs). The encoded protein acts as a transcriptional repressor and may be associated with asthma susceptibility. This protein may be involved in epithelial differentiation and carcinogenesis.
Cadherin-1 or Epithelial cadherin(E-cadherin), is a protein that in humans is encoded by the CDH1 gene. Mutations are correlated with gastric, breast, colorectal, thyroid, and ovarian cancers. CDH1 has also been designated as CD324. It is a tumor suppressor gene.
TOX high mobility group box family member 3, also known as TOX3, is a human gene.
Vestigial-like family member 3 is a protein that in humans is encoded by the VGLL3 gene.