Tetratricopeptide repeat protein 39C

Last updated
TTC39C
Identifiers
Aliases TTC39C , C18orf17, HsT2697, tetratricopeptide repeat domain 39C
External IDs MGI: 1919997 HomoloGene: 124438 GeneCards: TTC39C
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001135993
NM_001243425
NM_001292030
NM_153211

NM_028341

RefSeq (protein)

NP_001129465
NP_001230354
NP_001278959
NP_694943

NP_082617

Location (UCSC) Chr 18: 23.99 – 24.14 Mb Chr 18: 12.73 – 12.87 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Tetratricopeptide repeat protein 39C is a protein that in humans is encoded by the TTC39C gene. TTC39C is one of three TTC39. [5] Its function is currently unknown; however, there is some evidence suggesting that it plays a role in anaphase. [6] [7] It also contains a relatively well-characterized structural motif called the tetratricopeptide repeat (TPR).

Contents

Gene

TTC39C is located on the long arm of human chromosome 18 at 18q11.2. [8] Its most common aliases are FLJ33761, C18orf17, and HsT2697. [8]

Protein

The TTC39C protein is 583 amino acids long, and appears to be localized to the cytoplasm of the cell based on its predicted secondary structure [9] It has three isoforms, the longest of which is transcript variant 1. [8] It contains three tetratricopeptide repeats, as well as the domain of unknown function DUF3808. [5] The protein product has a molecular weight of 65.9 kDa and an isoelectric point of 6.584. [10] [11] There are several predicted phosphorylation, acetylation, and palmitoylation sites, which are shown in the table below. [12] Its whole predicted secondary structure is composed almost entirely of helices, and forms a tertiary structure that is predicted to match the structure of the cut9 protein in yeast with 100% accuracy and 85% coverage. [6] [13]

TypeLocation [12] Function [14] Quality of Sites
Phosphorylation
  • Threonine
    • 1, 4, and 9
  • Serine
    • 8, 9, 29, 31, 32, 35, and 39
  • Tyrosine
    • 5, 10, and 18
Regulation of cell cycle, apoptosis, cell signaling, etc.High
Acetylation
  • Possible acetylation site immediately downstream of the N-terminal methionine
Regulates gene expressionModerate
Palmitoylation
  • Cysteines 87 and 497
Anchors Protein to membraneModerate

Homology

The TTC39C protein is over 50% conserved in most vertebrates, and has conservation levels as low as about 24% in invertebrates. [15] At least one ortholog and several homologs have also been identified in fungi. [8] No orthologs have been identified in plants. [8] Several of its orthologs are shown in the table below.

Scientific NameCommon NameMean Predicted Date of Divergence (MYA) [16] Protein Length (aa) [15] RNA NCBI Accession Number [15] Protein NCBI Accession Number [15] RNA Identity [15] Protein Identity [15]
Callithrix jacchus Marmoset 42.6522XM_002807499XP_00280754542.2%99.4%
Anolis carolinensis Anole 296.0574XM_003219667XP_00321971527.7%81.7%
Gallus gallus Chicken 296.0576XM_419163XP_41916336.1%88.0%
Xenopus tropicalis Western Clawed Frog 371.2570NM_001113821NP_00110729355.20%88.0%
Oreochromis niloticus Tilapia 400.1578XM_003444269XP_00344431724.0%70.2%
Apis mellifera Honeybee 782.7877XM_001120231XP_00112023138.7%24.3%
Saccharomyces cerevisiae Baker's Yeast 1215.8725NM_001179808NP_01294329.7%17.6%

TTC39C has two paralogs from the TTC39 family: TTC39A, and TTC39B, which are located on chromosome 1, at 1p32.3 and chromosome 9, at 9p22.3 respectively. TTC39B has been associated with the management of HDL cholesterol, and may be involved in the prevention of cardiovascular disease [17]

Expression

Expression in Various Tissues TTC39C expression in samples of various human tissues.png
Expression in Various Tissues

[18]

The promoter region controlling expression of transcript variant 1 begins approximately 952 bases upstream of the start codon, and includes the entire 5' UTR. [19] Several possible transcription factor binding sites have been identified using the program El Dorado including the CCCTC binding factor and a site for CTCF, an insulator protein that binds CCCTC. [19] This transcription factor is associated with a number of functions including organization of chromatin. [20] There were also several sites that appear to be the general transcription factor TFIIB, and both E2F and E2F transcription factor binding sites. [19] The E2F transcription factors are involved in mediating the cell cycle, which could be a potential link to the hypothesized role of TTC39C in anaphase. [21] Several microarray studies of humans, dogs and mice have provided evidence that TTC39C is most highly expressed in the liver. [18] [22] [23] It exhibits relatively high expression in all tissues, and had a percentile rank above 50% in all tissues except in kidney, spinal cord, and skeletal muscle samples of humans. [18]

Function

Anaphase IF Anaphase IF.jpg
Anaphase IF

The function of TTC39C is currently unknown. [8] However, one of its structural motifs, the tetratricopeptide repeat is relatively well characterized, and has been shown to be active primarily in four categories of protein-protein interactions: interacting with molecular chaperones, mediating the start of anaphase during cell division, transcription repression, and the transport of proteins. [7] Of these four areas of functionality, the most evidence exists for its involvement in the initiation of anaphase. Two of the three possible interacting proteins identified by STRING play a role in anaphase. [24] Finally, the protein that had a structure that appears to perfectly match 85% of the TTC39C sequence is involved in anaphase in the yeast, Schizosaccharomyces pombe . [6] However, the role of TTC39C in anaphase must be confirmed through additional studies.

Interactions

TTC39C Interaction Network TTC39C Interaction Network.png
TTC39C Interaction Network

[24]

Textmining studies have identified several proteins that TTC39C may interact with. [24] These protein-protein interactions have not been confirmed; however, two of the identified proteins, AGBL1 and HAUS4, are likely candidates due to their roles in anaphase. [24] Additional potential protein-protein interactions were identified for TTC39C in mice; however, there is no apparent connection to TTC39C besides coexpression.

Related Research Articles

<span class="mw-page-title-main">CTCF</span> Transcription factor

Transcriptional repressor CTCF also known as 11-zinc finger protein or CCCTC-binding factor is a transcription factor that in humans is encoded by the CTCF gene. CTCF is involved in many cellular processes, including transcriptional regulation, insulator activity, V(D)J recombination and regulation of chromatin architecture.

<span class="mw-page-title-main">Tetratricopeptide repeat 39A</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. TTC39A is also known as DEME-6, KIAA0452, and c1orf34. The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain.

<span class="mw-page-title-main">KIAA1958</span> Protein-coding gene in the species Homo sapiens

Protein KIAA1958 is a protein that in humans is encoded by the KIAA1958 gene. Orthologs of KIAA1958 go as far back in evolution to chordates, although, it is closer in homology to primates than any other orthologs. KIAA1958 has no known paralogs.

<span class="mw-page-title-main">POTEB</span> Protein-coding gene in the species Homo sapiens

POTE ankyrin domain family, member B is a protein in humans that is encoded by the POTEB gene.(Prostate, Ovary, Testes Expressed ankyrin domain family member B).It is most likely involved in mediating protein-protein interaction via its 5 ankyrin domains. POTEB is most probably aids in intracellular signaling, but is not likely to be a secreted or nuclear protein. POTEB's function is likely to be regulated via 17 potential phosphorylation sites. There is currently no evidence to suggest that POTEB has nuclear localization signals.

<span class="mw-page-title-main">Protein FAM46B</span> Protein-coding gene in the species Homo sapiens

Protein FAM46B also known as family with sequence similarity 46 member B is a protein that in humans is encoded by the FAM46B gene. FAM46B contains one protein domain of unknown function, DUF1693. Yeast two-hybrid screening has identified three proteins that physically interact with FAM46B. These are ATX1, PEPP2 and DAZAP2.

<span class="mw-page-title-main">QSER1</span> Protein-coding gene in the species Homo sapiens

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

<span class="mw-page-title-main">FAM203B</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.

<span class="mw-page-title-main">CCDC94</span> Protein found in humans

Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.

<span class="mw-page-title-main">Tetratricopeptide repeat protein 39B</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.

<span class="mw-page-title-main">WWC2</span> Protein-coding gene in the species Homo sapiens

WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.

<span class="mw-page-title-main">KIAA1841</span> Protein-coding gene in the species Homo sapiens

KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.

<span class="mw-page-title-main">EVI5L</span> Protein-coding gene in the species Homo sapiens

EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

<span class="mw-page-title-main">Transmembrane protein 268</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 268 is a protein that in humans is encoded by TMEM268 gene. The protein is a transmembrane protein of 342 amino acids long with eight alternative splice variants. The protein has been identified in organisms from the common fruit fly to primates. To date, there has been no protein expression found in organisms simpler than insects.

<span class="mw-page-title-main">ANKRD24</span> Protein-coding gene in the species Homo sapiens

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">C22orf31</span> Protein-coding gene in the species Homo sapiens

C22orf31 is a protein which in humans is encoded by the C22orf31 gene. The C22orf31 mRNA transcript has an upstream in-frame stop codon, while the protein has a domain of unknown function (DUF4662) spanning the majority of the protein-coding region. The protein has orthologs with high percent similarity in mammals. The most distant orthologs are found in species of bony fish, but C22orf31 is not found in any species of birds or amphibians.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000168234 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000024424 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 NCBI. "TTC39C tetratricopeptide repeat domain 39C [ Homo sapiens (human) ]". U.S. National Library of Medicine. Retrieved 4 May 2013.
  6. 1 2 3 "Phyre2" . Retrieved 18 April 2013.[ permanent dead link ]
  7. 1 2 Blatch GL, Lässle M (November 1999). "The tetratricopeptide repeat: a structural motif mediating protein-protein interactions". BioEssays. 21 (11): 932–9. doi:10.1002/(SICI)1521-1878(199911)21:11<932::AID-BIES5>3.0.CO;2-N. PMID   10517866.
  8. 1 2 3 4 5 6 Genecards. "TTC39C Gene". Weizmann Institute of Science. Retrieved 4 May 2013.
  9. Nakai and Horton. "PSORTII" . Retrieved 4 May 2013.[ permanent dead link ]
  10. San Diego Supercomputer's Biology Workbench. "SAPS" . Retrieved 9 May 2013.[ permanent dead link ]
  11. San Diego Supercomputer's Biology Workbench. "PI" . Retrieved 9 May 2013.[ permanent dead link ]
  12. 1 2 SIB Swiss Institute of Bioinformatics. "ExPASy" . Retrieved 9 May 2013.
  13. San Diego Supercomputer-Biology Workbench. "PELE" . Retrieved 4 May 2013.[ permanent dead link ]
  14. Thermo Scientific. "Overview o Post-Translational Modifications". Thermo Fisher Scientific Inc. Retrieved 9 May 2013.
  15. 1 2 3 4 5 6 NCBI. "BLAST". National Library of Medicine. Retrieved 8 May 2013.
  16. Kimar and Hedges. "Time Tree" . Retrieved 11 May 2013.
  17. Kathiresan S, Willer CJ, Peloso GM, Demissie S, Musunuru K, Schadt EE, Kaplan L, Bennett D, Li Y, Tanaka T, Voight BF, Bonnycastle LL, Jackson AU, Crawford G, Surti A, Guiducci C, Burtt NP, Parish S, Clarke R, Zelenika D, Kubalanza KA, Morken MA, Scott LJ, Stringham HM, Galan P, Swift AJ, Kuusisto J, Bergman RN, Sundvall J, Laakso M, Ferrucci L, Scheet P, Sanna S, Uda M, Yang Q, Lunetta KL, Dupuis J, de Bakker PI, O'Donnell CJ, Chambers JC, Kooner JS, Hercberg S, Meneton P, Lakatta EG, Scuteri A, Schlessinger D, Tuomilehto J, Collins FS, Groop L, Altshuler D, Collins R, Lathrop GM, Melander O, Salomaa V, Peltonen L, Orho-Melander M, Ordovas JM, Boehnke M, Abecasis GR, Mohlke KL, Cupples LA (January 2009). "Common variants at 30 loci contribute to polygenic dyslipidemia". Nat. Genet. 41 (1): 56–65. doi:10.1038/ng.291. PMC   2881676 . PMID   19060906.
  18. 1 2 3 Affymetrix, Inc. "GDS424 / 64649_at / TTC39C". NCBI- GEO Profiles. Retrieved 4 May 2013.
  19. 1 2 3 "El Dorado-TTC39C". Genomatix. Archived from the original on 2 December 2021. Retrieved 4 May 2013.
  20. Phillips JE, Corces VG (June 2009). "CTCF: master weaver of the genome". Cell. 137 (7): 1194–211. doi:10.1016/j.cell.2009.06.001. PMC   3040116 . PMID   19563753.
  21. Zheng N, Fraenkel E, Pabo CO, Pavletich NP (March 1999). "Structural basis of DNA recognition by the heterodimeric cell cycle transcription factor E2F-DP". Genes Dev. 13 (6): 666–74. doi:10.1101/gad.13.6.666. PMC   316551 . PMID   10090723.
  22. Affymetrix, Inc. "GDS4164 / CfaAffx.27815.1.S1_s_at / TTC39C". NCBI-GeoProfiles. Retrieved 4 May 2013.
  23. EBI. "Ttc39c differential expression summary" . Retrieved 4 May 2013.
  24. 1 2 3 4 "STRING - Known and Predicted Protein-Protein Interactions" . Retrieved 2 May 2013.