TTC39C | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TTC39C , C18orf17, HsT2697, tetratricopeptide repeat domain 39C | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1919997 HomoloGene: 124438 GeneCards: TTC39C | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Tetratricopeptide repeat protein 39C is a protein that in humans is encoded by the TTC39C gene. TTC39C is one of three TTC39. [5] Its function is currently unknown; however, there is some evidence suggesting that it plays a role in anaphase. [6] [7] It also contains a relatively well-characterized structural motif called the tetratricopeptide repeat (TPR).
TTC39C is located on the long arm of human chromosome 18 at 18q11.2. [8] Its most common aliases are FLJ33761, C18orf17, and HsT2697. [8]
The TTC39C protein is 583 amino acids long, and appears to be localized to the cytoplasm of the cell based on its predicted secondary structure [9] It has three isoforms, the longest of which is transcript variant 1. [8] It contains three tetratricopeptide repeats, as well as the domain of unknown function DUF3808. [5] The protein product has a molecular weight of 65.9 kDa and an isoelectric point of 6.584. [10] [11] There are several predicted phosphorylation, acetylation, and palmitoylation sites, which are shown in the table below. [12] Its whole predicted secondary structure is composed almost entirely of helices, and forms a tertiary structure that is predicted to match the structure of the cut9 protein in yeast with 100% accuracy and 85% coverage. [6] [13]
Type | Location [12] | Function [14] | Quality of Sites |
---|---|---|---|
Phosphorylation |
| Regulation of cell cycle, apoptosis, cell signaling, etc. | High |
Acetylation |
| Regulates gene expression | Moderate |
Palmitoylation |
| Anchors Protein to membrane | Moderate |
The TTC39C protein is over 50% conserved in most vertebrates, and has conservation levels as low as about 24% in invertebrates. [15] At least one ortholog and several homologs have also been identified in fungi. [8] No orthologs have been identified in plants. [8] Several of its orthologs are shown in the table below.
Scientific Name | Common Name | Mean Predicted Date of Divergence (MYA) [16] | Protein Length (aa) [15] | RNA NCBI Accession Number [15] | Protein NCBI Accession Number [15] | RNA Identity [15] | Protein Identity [15] |
---|---|---|---|---|---|---|---|
Callithrix jacchus | Marmoset | 42.6 | 522 | XM_002807499 | XP_002807545 | 42.2% | 99.4% |
Anolis carolinensis | Anole | 296.0 | 574 | XM_003219667 | XP_003219715 | 27.7% | 81.7% |
Gallus gallus | Chicken | 296.0 | 576 | XM_419163 | XP_419163 | 36.1% | 88.0% |
Xenopus tropicalis | Western Clawed Frog | 371.2 | 570 | NM_001113821 | NP_001107293 | 55.20% | 88.0% |
Oreochromis niloticus | Tilapia | 400.1 | 578 | XM_003444269 | XP_003444317 | 24.0% | 70.2% |
Apis mellifera | Honeybee | 782.7 | 877 | XM_001120231 | XP_001120231 | 38.7% | 24.3% |
Saccharomyces cerevisiae | Baker's Yeast | 1215.8 | 725 | NM_001179808 | NP_012943 | 29.7% | 17.6% |
TTC39C has two paralogs from the TTC39 family: TTC39A, and TTC39B, which are located on chromosome 1, at 1p32.3 and chromosome 9, at 9p22.3 respectively. TTC39B has been associated with the management of HDL cholesterol, and may be involved in the prevention of cardiovascular disease [17]
The promoter region controlling expression of transcript variant 1 begins approximately 952 bases upstream of the start codon, and includes the entire 5' UTR. [19] Several possible transcription factor binding sites have been identified using the program El Dorado including the CCCTC binding factor and a site for CTCF, an insulator protein that binds CCCTC. [19] This transcription factor is associated with a number of functions including organization of chromatin. [20] There were also several sites that appear to be the general transcription factor TFIIB, and both E2F and E2F transcription factor binding sites. [19] The E2F transcription factors are involved in mediating the cell cycle, which could be a potential link to the hypothesized role of TTC39C in anaphase. [21] Several microarray studies of humans, dogs and mice have provided evidence that TTC39C is most highly expressed in the liver. [18] [22] [23] It exhibits relatively high expression in all tissues, and had a percentile rank above 50% in all tissues except in kidney, spinal cord, and skeletal muscle samples of humans. [18]
The function of TTC39C is currently unknown. [8] However, one of its structural motifs, the tetratricopeptide repeat is relatively well characterized, and has been shown to be active primarily in four categories of protein-protein interactions: interacting with molecular chaperones, mediating the start of anaphase during cell division, transcription repression, and the transport of proteins. [7] Of these four areas of functionality, the most evidence exists for its involvement in the initiation of anaphase. Two of the three possible interacting proteins identified by STRING play a role in anaphase. [24] Finally, the protein that had a structure that appears to perfectly match 85% of the TTC39C sequence is involved in anaphase in the yeast, Schizosaccharomyces pombe . [6] However, the role of TTC39C in anaphase must be confirmed through additional studies.
Textmining studies have identified several proteins that TTC39C may interact with. [24] These protein-protein interactions have not been confirmed; however, two of the identified proteins, AGBL1 and HAUS4, are likely candidates due to their roles in anaphase. [24] Additional potential protein-protein interactions were identified for TTC39C in mice; however, there is no apparent connection to TTC39C besides coexpression.
Transcriptional repressor CTCF also known as 11-zinc finger protein or CCCTC-binding factor is a transcription factor that in humans is encoded by the CTCF gene. CTCF is involved in many cellular processes, including transcriptional regulation, insulator activity, V(D)J recombination and regulation of chromatin architecture.
Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. TTC39A is also known as DEME-6, KIAA0452, and c1orf34. The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain.
Protein KIAA1958 is a protein that in humans is encoded by the KIAA1958 gene. Orthologs of KIAA1958 go as far back in evolution to chordates, although, it is closer in homology to primates than any other orthologs. KIAA1958 has no known paralogs.
POTE ankyrin domain family, member B is a protein in humans that is encoded by the POTEB gene.(Prostate, Ovary, Testes Expressed ankyrin domain family member B).It is most likely involved in mediating protein-protein interaction via its 5 ankyrin domains. POTEB is most probably aids in intracellular signaling, but is not likely to be a secreted or nuclear protein. POTEB's function is likely to be regulated via 17 potential phosphorylation sites. There is currently no evidence to suggest that POTEB has nuclear localization signals.
Protein FAM46B also known as family with sequence similarity 46 member B is a protein that in humans is encoded by the FAM46B gene. FAM46B contains one protein domain of unknown function, DUF1693. Yeast two-hybrid screening has identified three proteins that physically interact with FAM46B. These are ATX1, PEPP2 and DAZAP2.
Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.
Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.
Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.
WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.
KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
Transmembrane protein 268 is a protein that in humans is encoded by TMEM268 gene. The protein is a transmembrane protein of 342 amino acids long with eight alternative splice variants. The protein has been identified in organisms from the common fruit fly to primates. To date, there has been no protein expression found in organisms simpler than insects.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
C22orf31 is a protein which in humans is encoded by the C22orf31 gene. The C22orf31 mRNA transcript has an upstream in-frame stop codon, while the protein has a domain of unknown function (DUF4662) spanning the majority of the protein-coding region. The protein has orthologs with high percent similarity in mammals. The most distant orthologs are found in species of bony fish, but C22orf31 is not found in any species of birds or amphibians.