TEX9 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TEX9 , testis expressed 9 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1201610 HomoloGene: 32072 GeneCards: TEX9 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. [5] The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. [6] The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities. [5]
TEX9 is located at 15q21.3 and has 18 exons. [5] However, some exons overlap; therefore, there are only 13 distinguishable exons in the human genome. [7] TEX9 is on the sense strand and spans from base 56,365,573 to 56,428,441. TEX9 is located in the gene neighborhood of CD24P2, RFX7, MNS1, and HMGB1P33. [5] [8]
The promoter for TEX9 was determined using 19 supporting transcripts to be GXP_7531542, spanning from base 56,364,254 to base 56,365,775 on the sense strand of chromosome 15. [9] A number of transcription factors with a matrix similarity greater than or equal to 0.780 that are predicted to regulate transcription of TEX9 are listed below with their respective binding site:
Transcription Factor | Binding Site | Strand |
---|---|---|
Estrogen response elements (ER alpha) | ATTGGTCAGGCTGGTCTTG | + |
Retinoid receptor-related testis-associated receptor | CCGACCAGAACTTGAGGGT and TTGTAATTCAAGGTCATAA | - and + |
Hypermethylated in cancer 1 | CTCTGCCCAGCCT and CTTCACCCGTGAT | + and - |
T-box TF TBX21, dimeric binding site | TACTGCTTTTGGTGTCATATCTAAG | + |
Sine oculis homeobox homolog 4 | CTTTTGGTGTCATAT | + |
Ecotropic viral integration site 1 encoded factor, amino-terminal zinc finger domain | AAAACCACAGTATAGAT | - |
Estrogen-related receptor alpha | GAATTGTAATTCAAGGTCATAAA and AGTGATTTGCCCAAGG/CCATATA | + and + |
Regulatory factor X, 4 | TTAGGTCTTTGATACATT and AGCCATTGGCGCAGCGTCA | + and - |
Thyroid hormone receptor, beta | TCGAGGATTCAAATCCAGAAACT and CTGGTATGTAGTATAGTGCCA | - and - |
Homeodomain protein NKX3.2 | ACTGTGAAGTGGGCACTAT | + |
Lentivirus LTR TATA box | CCATATAACTGGTAAGT | + |
Cdx-2 mammalian caudal related intestinal TF | GTTCCGGTATATTGACCAT | - |
GA binding protein TF, alpha | CTCTCGCGGGAAGATGCGTCG | + |
Olfactory neuron-specific factor | ACCTTTGAGAGCGCCCTTCTACG | - |
Kidney-enriched kruppel-like factor | GAAGATGGCGGGGCGAAGT | + |
The expression of TEX9 is highest in the testis, followed by the thyroid, duodenum, and kidney, although other tissues have been shown to express TEX9. [5] TEX9 is expected to have a subcellular localization in the cytoplasm or nucleus. [10]
Isoform 1 of TEX9 has a 5' UTR region of 27 base pairs and a 3' UTR region of 356 base pairs. [11] The transcript is 1,559 base pairs long. [12]
Less common isoforms of TEX9 include isoforms: 2, X1, X2, X3, X4, X5, and X6. [5]
The theoretical molecular weight of the 391 amino acid TEX9 protein is 45kDa and the theoretical pI is 6. [13] However, the experimental molecular weight has been shown to be ~55kDa. [14]
The most pronounced domains in TEX9 are the two coiled-coil regions, which include amino acids 32-59 and 194-351. [15] Repetitive domains within the protein include ALEE (34-37 and 302-305) and EKYK (251-254 and 307-310). [16] TEX9 has more glutamate, lysine, and glutamine residues and less glycine residues compared to a typical human protein. [16]
TEX9 has been shown to be phosphorylated at tyrosine (Y) residues 85 and 264, and have a ubiquitylation site at the lysine (K) residue at 159. [12] It is predicted that there are multiple other phosphorylation, glycation, 0-beta-GlcNAc, and SUMO protein attachment sites. [17] [18] [19] [20]
TEX9's two coiled-coil regions make up its tertiary structure and can be visualized using the predicted structure from Phyre2. [10] Shown on the structure are the two known phosphorylated sites and one ubiquitylation site.
TEX9 has been experimentally determined to have interactions including coiled-coil containing 112 (CCDC112), chromosome 20 open reading frame 112 (C20orf112), and nucleolar protein 4 (NOL4). [21] Textmining has suggested that TEX9 also interacts with olfactory receptor, family 4, subfamily C, member 3, odorant receptor (OR4C3). [21] Other interactions include gene products of human genes NOL4-2 (at an unknown location), GOGA2 (in the cis-Golgi network membrane, spindle pole of cytoskeleton, and ER-Golgi intermediate compartment membrane), and KDM1A (in the nucleus). [13] Another proposed interaction between TEX9 involves attachment with the SUMO protein, which has a molecular weight of 11kDa. [20] The realized MW of TEX9 is 55kDa but the theoretical MW is 45kDa, which provides evidence for this interaction. [22]
TEX9 has homologs in over 260 other organisms, including vertebrates, invertebrates, archaea, and one species of bacteria. [5] TEX9 has been found in all clades of organisms except land plants. [23]
Genus species | Common Name | Taxonomic Group | Divergence (MYA) | Accession Number | Seq. Length (aa) | Corr. ID to HP (%) | Corr. Sim. To HP (%) |
Homo sapiens | Human | Hominini | 0 | NP_940926.1 | 391 | 100 | 100 |
Pan paniscus | Bonobo | Primate | 6.65 | XP_008951441.1 | 391 | 99 | 99 |
Loxodonta africana | African bush/savanna elephant | Mammal | 105 | XP_010596294.1 | 391 | 83 | 90 |
Apteryx rowi | Okarito (brown) kiwi | Bird | 312 | XP_025916696.1 | 422 | 61 | 74 |
Gekko japonicus | Calling gecko | Reptile | 312 | XP_015264647.1 | 359 | 49 | 63 |
Xenopus laevis | African clawed frog | Amphibian | 352 | XP_018108534.1 | 434 | 60 | 76 |
Astyanax mexicanus | Mexican tetra/blind cave fish | Bony fish | 432 | XP_007244936.2 | 394 | 51 | 68 |
Apostichopus japonicus | Japanese (spiky) sea cucumber | Echinodermata | 684 | PIK45906.1 | 404 | 43 | 56 |
Capitella teleta | Capitella | Annelida | 797 | ELT92672.1 | 257 | 34 | 45 |
Anoplophora glabripennis | Asian long-horned beetle | Mollusca | 797 | XP_018561745.1 | 259 | 18 | 31 |
Pocillopora damicornis | Cauliflower (lace) coral | Cnidaria | 824 | XP_027039795.1 | 387 | 42 | 60 |
Clonorchis sinensis | Chinese liver fluke | Platyhelminthes | 824 | RJW72461.1 | 952 | 27 | 41 |
Echinococcus multilocularis | Echinococcus | Platyhelminthes | 824 | CDS43228.1 | 299 | 18 | 33 |
Trichoplax sp. H2 | Trichoplax | Placozoa | 948 | RDD37208.1 | 451 | 30 | 44 |
Amphimedon queenslandica | Amphimedon | Porifera | 951.8 | XP_003384031.2 | 339 | 27 | 41 |
Spizellomyces punctatus DAOM BR117 | Spizellomyces | Chytrid (fungi) | 1105 | XP_016611327.1 | 373 | 31 | 52 |
Planoprotostelium fungivorum | Planoprotostelium | Amoebozoa (protist) | 1480 | PRP73397.1 | 373 | 12 | 18 |
Klebsormidium nitens | Klebsormidium | Charophyte (green algae) | 1496 | GAQ91967.1 | 345 | 29 | 45 |
Hondaea fermentalgiana | Hondaea | Stramenopiles (protist) | 1768 | GBG25987.1 | 379 | 22 | 35 |
Thecamonas trahens ATCC 50062 | Thecamonas | Apusozoa (protist) | 2101 | XP_013753981.1 | 324 | 13 | 23 |
Chlamydia trachomatis | Chlamydia | Bacteria | 4290 | CPS19605.1 | 72 | 14 | 14 |
The relative rate of change for TEX9 is fairly slow compared to fibrinogen and beta-globin, but not as slow as cytochrome c. [24]
TEX9 sequences that are most conserved between humans and other organisms are found within the two coiled-coil regions, where some amino acids are conserved in vertebrates, invertebrates, and microorganisms. The bacterial ortholog is most similar to vertebrates than invertebrates or microorganisms.
All of the orthologs of TEX9 are derived from the same common ancestor except the gene found in Chlamydia, which is thought to have transferred from humans into the bacterium. [25]
No diseases have been shown to be directly linked to TEX9, but some correlations have been found regarding estrogen receptor knockdown and increased TEX9 expression [26] as well as colorectal cancer cells with decreased TEX9 expression. [27]
Reduced expression of TEX9 has been shown to boost tumor growth in immunocompetent mice but not in immunocompromised mice. [28] This result suggested that TEX9 may function as a tumor antigen in some tumors. Mutations of the TEX9 protein have been found in 1-2% of tumors taken from certain cancers, including endometrial, head and neck, colorectal, and squamous lung. [29]
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Neuroblastoma breakpoint family member 19, or NBPF19, is a protein that in humans is encoded by the NBPF19 gene. This protein is included in the neuroblastoma breakpoint family of proteins.
C2orf81 is a human gene encoding protein c2orf81, which is predicted to have nuclear localization.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
ZNF839 or zinc finger protein 839 is a protein which in humans is encoded by the ZNF839 gene. It is located on the long arm of chromosome 14. Zinc finger protein 839 is speculated to pay a role in humoral immune response to cancer as a renal carcinoma antigen (NY-REN-50). This is because NY-REN-50 was found to be over expressed in cancer patients, especially those with renal carcinoma. Zinc finger protein 839 also plays a role in transcription regulation by metal-ion binding since it binds to DNA via C2H2-type zinc finger repeats.