TTC39A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TTC39A , C1orf34, DEME-6, Tetratricopeptide Repeat 39A, tetratricopeptide repeat domain 39A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2444350 HomoloGene: 17739 GeneCards: TTC39A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. [5] TTC39A is also known as DEME-6 (differentially expressed in MCF7 with estradiol protein 6), KIAA0452, and c1orf34. [5] The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. [6] KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain. [7]
The gene for TTC39A is located on the first chromosome at 1p32.3. [5] The genomic DNA is 57,859 bases long, consists of 19 exons and is located on the minus strand. [8] The mRNA has a length of 2759 bases. [8] TTC39A is surrounded by genes for RNF11 and EPS15. [6] Neither of these genes have a known interaction or similar function as TTC39A. [6] There are not any known medical disorders resulting from the deletion of TTC39A. [6]
The gene for KIAA0452 is also located on the first chromosome at 1p32.3. The genomic DNA is 34,096 bases long, consists of 11 exons and is located on the minus strand. [9] The mRNA has a length of 6256 bases. [10]
TTC39A has several orthologs that have been found in eukaryotes, archea, and bacteria. [8] Two paralogs, TTC39B and TTC39C, have also been identified. [8] The functions of TTC39B and TTC39C are not known. [8]
Species | Common name | Accession number | Length | Protein identity | Protein similarity |
---|---|---|---|---|---|
Homo sapiens | Human | NP_001073963.1 | 613aa | 100% | 100% |
Mus musculus | Mouse | NP_001139420.1 | 578aa | 87.9% | 91.0% |
Macaca mulatta | Rhesus monkey | XP_002801613.1 | 550aa | 92.0% | 93.0% |
Gallus gallus | Chicken | XP_422466.3 | 575aa | 71.3% | 81.0% |
Taeniopygia guttata | Zebra finch | XP_002193183 | 575aa | 73.0% | 82.0% |
Anolis carolinensis | Green anole | XP_003216860.1 | 570aa | 58.8% | 74.0% |
Danio rerio | Zebra fish | XP_002663143.1 | 598aa | 60.8% | 76.0% |
Branchiostoma floridae | Lancelet | XP_002591442.1 | 544aa | 41.1% | 61.0% |
Physcomitrella patens | Moss | XP_001766901.1 | 601aa | 24.2% | 41.0% |
Leishmania braziliensis | Euglenozoa | XP_001566995.1 | 584aa | 21.8% | 40.0% |
Aster yellows | Phytoplasma | AAO61940.1 | 299aa | 15.1% | 42.0% |
KIAA0452 has several similar orthologs as TTC39A from eukaryotes, achea and bacteria but different protein identities and similarities. [9]
Species | Common name | Accession number | Length | Protein identity | Protein similarity |
---|---|---|---|---|---|
Homo sapiens | Human | AB007921 | 453aa | 100% | 100% |
Homo sapiens | Human | BC028374.2 | 629aa | 99% | 99% |
Mus musculus | Mouse | AK172951.1 | 574aa | 85% | 87% |
Macaca mulatta | Rhesus monkey | XP_002801613 | 550aa | 88% | 88% |
Taeniopygia gutlata | Zebra finch | XP_002193183.2 | 549aa | 72% | 79% |
Anolis carolinensis | Green anole | XP_003216860.1 | 570aa | 56% | 69% |
Danio rerio | Zebra fish | XP_002663143 | 598aa | 60% | 71% |
Branchiostoma floridae | Lanclet | XP_002591442.1 | 544aa | 42% | 62% |
Physcomitrella patens | Moss | XP_001766901.1 | 601aa | 25% | 41% |
Leishmania braziliensis | Euglenoza | XP_001566995.1 | 584aa | 28% | 48% |
Aster yellows | Phytoplasma | AAO61940.1 | 299aa | 24% | 42% |
The predicted promoter for the TTC39A gene spans 713 bp from 51,796,048 to 51,796,760. [11]
A predicted promoter for KIAA0452 gene spans 600 bp from 51,796,517 to 51,797,117. [11]
Below are some of transcription factors that exist for the predicted the promoter sequence. [11]
Detailed family information | Start | End | Strand |
---|---|---|---|
Myeloid zinc finger protein MZF1 | 286 | 296 | + |
Fork head domain factors FOXK2 | 54 | 70 | - |
X-linked zinc finger protein ZFX and ZFY | 118 | 128 | - |
E2F transcription factor 4, p107/p130-binding protein | 475 | 491 | - |
SPI-1 proto-oncogene; hematopoietic transcription factor PU.1 | 205 | 225 | + |
Hematopoietically expressed homeobox, proline-rich homeodomain protein | 108 | 126 | + |
Below are some of transcription factors, with similarity of 1, that exist for the predicted promoter sequence. [11] [12]
Detailed family information | Start | End | Strand |
---|---|---|---|
AP1 activating protein 1 | 22 | 34 | + |
Myeloid Zinc finger protein MZF1 | 577 | 587 | + |
Myeloid transforming protein EVI1 (Ectotropic viral integration site 1 encoded factor, amino-terminal zinc finger domain) | 16 | 32 | + |
C2H2 xinc finger transcription factor 2 | 539 | 561 | - |
Fork Head domain factor alternative splice variant of FOXP1 (activated in ESCs) | 410 | 426 | - |
Pleomorphic adenoma gene 1 | 209 | 231 | + |
The longest protein sequence consists of 613 amino acids. [13] Within TTC39A, there is a domain of unknown function, DUF 3808. [13] DUF 3808 is generally considered to be an outer mitochondrial membrane protein and has been conserved from fungi to humans. [13] DUF 3808 also contains a tetratricopeptide, TPR_16 and a predicted transmembrane domain. [13] [14] The structure of TTC39A is predicted to have 12 alpha helices and a single beta sheet. [15]
KIAA0452 has a protein sequence with 453 amino acids. [16] The structure of KIAA0452 is predicted to have 11 alpha helices and 9 beta sheets. [15] KIAA0452 overlaps TTC39A via a domain of unknown function, DUF 3808. [13] KIAA0452 has a predicted molecular weight of 49.4 kdal [17] and an isoelectric point of 6.766000. [18]
There are 17 splice variants for TTC39A. [19] TTC39A is a highly expressed protein in the human body. [19] The highest levels of expression are located in mammary glands and testis. [13] The lowest levels of expression are found in the immune system. [13]
KIAA0452 expression is most prevalent in the brain, with the highest levels in the trigeminal ganglion and occipital lobe [20]
TTC39A is induced by TFAP2C in hormone responsive breast carcinoma cells [21] and TTC39A is expressed in estrogen receptor positive carcinoma cell lines. [22]
There are several predicted phosphorylation sites but none have been experimentally confirmed. [23] A single sulfination site is predicted at position 175. [24]
KIAA0452 has a single N-acetalation site at position 274. [25]
TTC39A has been found to interact with BTNL2 [26] and MAPK3. [27]
MALSU1 is a gene on chromosome 7 in humans that encodes the protein MALSU1. This protein localizes to mitochondria and is probably involved in mitochondrial translation or the biogenesis of the large subunit of the mitochondrial ribosome.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
DEPDC5 is a human protein of poorly understood function but has been associated with cancer in several studies. It is encoded by a gene of the same name, located on chromosome 22.
Transmembrane protein 131 (TMEM131) is a protein that is encoded by the TMEM131 gene in humans. The TMEM131 protein contains three domains of unknown function 3651 (DUF3651) and two transmembrane domains. This protein has been implicated as having a role in T cell function and development. TMEM131 also resides in a locus (2q11.1) that is associated with Nievergelt's Syndrome when deleted.
Tetratricopeptide repeat protein 39C is a protein that in humans is encoded by the TTC39C gene. TTC39C is one of three TTC39. Its function is currently unknown; however, there is some evidence suggesting that it plays a role in anaphase. It also contains a relatively well-characterized structural motif called the tetratricopeptide repeat (TPR).
Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.
Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.
Family with sequence similarity 149, member A is a protein that in humans is encoded by the FAM149A gene. It is well conserved in primates, dog, cow, mouse, rat, and chicken. It has one paralog, FAM149B.
Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38
TM6SF2 is the Transmembrane 6 superfamily 2 human gene which codes for a protein by the same name. This gene is otherwise called KIAA1926. Its exact function is currently unknown.
KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.
Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.