C16orf78

Last updated

Uncharacterized protein C16orf78(NP_653203.1) is a protein that in humans is encoded by the chromosome 16 open reading frame 78 gene. [1]

Contents

Gene

The C16orf78 gene(123970) is located at 16q12.1 on the plus strand, spanning 25,609 bp from 49,407,734 to 49,433,342. [2]

mRNA

There is one mRNA transcript (NM_144602.3) and no other known splice isoforms. There are 5 exons, totaling a length of 1068 base pairs. [2]

Protein

Sequence

C16orf78 is 265 amino acids long with a predicted molecular weight of 30.8 kDal and pI of 9.8. [3] It is rich in both methionine and lysine, composed of 6.4% methionine and 13.6% lysine. [4] This methionine richness has been hypothesized to serve as a mitochondrial antioxidant. [5]

Post-Transnational Modifications

There are four verified ubiquitination sites and three verified phosphorylation sites. [6] [7]

Diagram of C16orf78 protein with ubiquitination sites marked in red and phosphorylation sites marked in gray. C16orf78Diagram.png
Diagram of C16orf78 protein with ubiquitination sites marked in red and phosphorylation sites marked in gray.

Structure

Predictions of C16orf78's secondary structure consist primarily of alpha helices and coiled coils. [9] [10] [11] Phyre2 also predicted C16orf78 is primarily helical, but 253 of 265 amino acids were modeled ab initio so the confidence of the model is low. [12]

Phyre2 generated model of C16orf78 rendered in Chimera. C16orf78Phyre2WikiShot.png
Phyre2 generated model of C16orf78 rendered in Chimera.

Subcellular Localization

C16orf78 is predicted to be localized to the cell nucleus. [13] There is also a predicted bipartite nuclear localization signal. [14]

Expression

C16orf78 has restricted expression toward the testis, with much lower expression in other tissues. [15]

Expression of C16orf78 across multiple human tissues C16orf78 Tissue Expression Profile Graph.png
Expression of C16orf78 across multiple human tissues

Interaction

C16orf78 has a physical association with DNA/RNA-binding protein KIN17 (NP_036443.1), suggesting C16orf78 may also play a role in DNA repair. [17] C16orf78 was found to be phosphorylated by SRPK1(NP_003128.3) and SPRK2 (AAH68547.1). [6]

Clinical Significance

Deletion of the C16orf78 gene has been identified as a determinant of prostate cancer. [18] A SNP in C16orf78 interacts with a SNP in LMTK2 and is associated with risk of prostate cancer. [19]

Amplification of the C16orf78 gene has been linked to metabolically adaptive cancer cells. [20] A duplication of the C16orf78 gene was associated with at least one case of Rolandic Epilepsy. [21]

Homology

Paralogs

C16orf78 has no known paralogs in humans. [22]

Orthologs

C16orf78 has over 80 orthologs, including animals as distant Ciona intestinalis (XP_002132057.1), which is estimated to have diverged from humans 676 million years ago. [2] [23] C16orf78 has orthologs in many types of mammals, reptiles, bony fish, and even some invertebrates, but has no known orthologs in amphibians or birds. [22] Below is a table with samples of orthologs, with divergence dates from TimeTree and similarity calculated by pairwise sequence alignment. [24]

Table of C16orf78 Orthologs
Species NameNCBI AccessionDivergence (mya) (estimated)Length (aa)% Identity% Similarity
Homo sapiensNP_653203.10265100%100%
Gorilla gorilla gorillaXP_004057673.29.0626596%98%
Macaca mulattaXP_001082258.129.4426789%93%
Galeopterus variegatusXP_008591134.17626665%77%
Oryctolagus cuniculusXP_008273281.19025562%76%
Mus musculusNP_808569.19027057%69%
Lipotes vexilliferXP_007459548.19626665%77%
Capra hircusXP_017918754.19627663%74%
Callorhinus ursinusXP_025708226.19625062%74%
Pteropus vampyrusXP_011358492.19626360%74%
Loxodonta africanaXP_023411324.110528548%55%
Sarcophilus harrisiiXP_003757266.115927038%53%
Vombatus ursinusXP_027723426.115927538%54%
Pogona vitticepsXP_020643996.131231526%43%
Gekko japonicusXP_015263322.131226125%47%
Python bivittatusXP_025030465.131231323%37%
Latimeria chalumnaeXP_014344069.141331019%42%
Acipenser ruthenusRXM34621.143520215%37%
Ciona intestinalisXP_002132057.167639610%32%
Apostichopus japonicusPIK46940.16842929%33%

Related Research Articles

C12orf66 is a protein that in humans is encoded by the C12orf66 gene. The C12orf66 protein is one of four proteins in the KICSTOR protein complex which negatively regulates mechanistic target of rapamycin complex 1 (mTORC1) signaling.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">C2orf16</span> Protein-coding gene in the species Homo sapiens

C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.

Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development

<span class="mw-page-title-main">C7orf26</span> Human protein-encoding gene on chromosome 7

c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">SMCO3</span>

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">LSMEM2</span>

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">C12orf29</span>

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

References

  1. "uncharacterized protein C16orf78 [Homo sapiens] - Protein - NCBI". ncbi.nlm.nih.gov. Retrieved 2019-02-26.
  2. 1 2 3 "Gene: C16orf78 (ENSG00000166152) - Summary - Homo sapiens - Ensembl genome browser 96". useast.ensembl.org. Retrieved 2019-05-05.
  3. "ExPASy - ProtParam tool". web.expasy.org. Retrieved 2019-05-05.
  4. "SAPS < Sequence Statistics < EMBL-EBI". ebi.ac.uk. Retrieved 2019-05-05.
  5. Schindeldecker, Mario; Moosmann, Bernd (10 April 2015). "Protein-borne methionine residues as structural antioxidants in mitochondria". Amino Acids. 47 (7): 1421–1432. doi:10.1007/s00726-015-1955-8. PMID   25859649. S2CID   16953847.
  6. 1 2 "C16orf78 Result Summary | BioGRID". thebiogrid.org. Retrieved 2019-05-05.
  7. "C16orf78 (human)". phosphosite.org. Retrieved 2019-05-05.
  8. "PROSITE". prosite.expasy.org. Retrieved 2019-05-05.
  9. "CFSSP: Chou & Fasman Secondary Structure Prediction Server". biogem.org. Retrieved 2019-05-05.
  10. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2019-05-05.
  11. "JPred: A Protein Secondary Structure Prediction Server". compbio.dundee.ac.uk. Retrieved 2019-05-05.
  12. Kelley, Lawrence A; Mezulis, Stefans; Yates, Christopher M; Wass, Mark N; Sternberg, Michael J E (7 May 2015). "The Phyre2 web portal for protein modeling, prediction and analysis". Nature Protocols. 10 (6): 845–858. doi:10.1038/nprot.2015.053. PMC   5298202 . PMID   25950237.
  13. Horton, P.; Park, K.-J.; Obayashi, T.; Fujita, N.; Harada, H.; Adams-Collier, C.J.; Nakai, K. (8 May 2007). "WoLF PSORT: protein localization predictor". Nucleic Acids Research. 35 (Web Server): W585–W587. doi:10.1093/nar/gkm259. PMC   1933216 . PMID   17517783.
  14. "Motif Scan". myhits.isb-sib.ch. Retrieved 2019-05-05.
  15. "C16orf78 chromosome 16 open reading frame 78 [Homo sapiens (human)] - Gene - NCBI". ncbi.nlm.nih.gov. Retrieved 2019-05-05.
  16. "49000288 - GEO Profiles - NCBI". ncbi.nlm.nih.gov. Retrieved 2019-05-05.
  17. IntAct. "IntAct Portal". ebi.ac.uk. Retrieved 2019-05-05.
  18. DePihno, R. A et al. (2016). U.S. Patent No. 9458510. Washington, DC: U.S. Patent and Trademark Office.
  19. Tao, Sha; Wang, Zhong; Feng, Junjie; Hsu, Fang-Chi; Jin, Guangfu; Kim, Seong-Tae; Zhang, Zheng; Gronberg, Henrik; Zheng, Lilly S.; Isaacs, William B.; Xu, Jianfeng; Sun, Jielin (March 2012). "A genome-wide search for loci interacting with known prostate cancer risk-associated genetic variants". Carcinogenesis. 33 (3): 598–603. doi:10.1093/carcin/bgr316. PMC   3291863 . PMID   22219177.
  20. Singh, Balraj; Shamsnia, Anna; Raythatha, Milan R.; Milligan, Ryan D.; Cady, Amanda M.; Madan, Simran; Lucci, Anthony; Das, Gokul M. (3 October 2014). "Highly Adaptable Triple-Negative Breast Cancer Cells as a Functional Model for Testing Anticancer Agents". PLOS ONE. 9 (10): e109487. Bibcode:2014PLoSO...9j9487S. doi: 10.1371/journal.pone.0109487 . PMC   4184880 . PMID   25279830.
  21. Reinthaler, Eva M.; Lal, Dennis; Lebon, Sebastien; Hildebrand, Michael S.; Dahl, Hans-Henrik M.; Regan, Brigid M.; Feucht, Martha; Steinböck, Hannelore; Neophytou, Birgit; Ronen, Gabriel M.; Roche, Laurian; Gruber-Sedlmayr, Ursula; Geldner, Julia; Haberlandt, Edda; Hoffmann, Per; Herms, Stefan; Gieger, Christian; Waldenberger, Melanie; Franke, Andre; Wittig, Michael; Schoch, Susanne; Becker, Albert J.; Hahn, Andreas; Männik, Katrin; Toliat, Mohammad R.; Winterer, Georg; Lerche, Holger; Nürnberg, Peter; Mefford, Heather; Scheffer, Ingrid E.; Berkovic, Samuel F.; Beckmann, Jacques S.; Sander, Thomas; Jacquemont, Sebastien; Reymond, Alexandre; Zimprich, Fritz; Neubauer, Bernd A.; Reinthaler, Eva M.; Zimprich, Fritz; Feucht, Martha; Steinböck, Hannelore; Neophytou, Birgit; Geldner, Julia; Gruber-Sedlmayr, Ursula; Haberlandt, Edda; Ronen, Gabriel M.; Roche, Laurian; Lal, Dennis; Nürnberg, Peter; Sander, Thomas; Lerche, Holger; Neubauer, Bernd; Zimprich, Fritz; Mörzinger, Martina; Feucht, Martha; Suls, Arvid; Weckhuysen, Sarah; Claes, Lieve; Deprez, Liesbet; Smets, Katrien; Van Dyck, Tine; Deconinck, Tine; De Jonghe, Peter; Møller, Rikke S; Klitten, Laura L.; Hjalgrim, Helle; Møller, Rikke S; Campus, Kiel; Helbig, Ingo; Muhle, Hiltrud; Ostertag, Philipp; von Spiczak, Sarah; Stephani, Ulrich; Nürnberg, Peter; Sander, Thomas; Trucks, Holger; Elger, Christian E.; Kleefuß-Lie, Ailing A.; Kunz, Wolfram S.; Surges, Rainer; Gaus, Verena; Janz, Dieter; Sander, Thomas; Schmitz, Bettina; Rosenow, Felix; Klein, Karl Martin; Reif, Philipp S.; Oertel, Wolfgang H.; Hamer, Hajo M.; Becker, Felicitas; Weber, Yvonne; Lerche, Holger; Koeleman, Bobby P.C.; de Kovel, Carolien; Lindhout, Dick; Lindhout, Dick; Ameil, Agnès; Andrieux, Joris; Bouquillon, Sonia; Boute, Odile; de Flandre, Jeanne; Cuisset, Jean Marie; Cuvellier, Jean-Christophe; Salengro, Roger; David, Albert; de Vries, Bert; Delrue, Marie-Ange; Doco-Fenzy, Martine; Fernandez, Bridget A.; Heron, Delphine; Keren, Boris; Lebel, Robert; Leheup, Bruno; Lewis, Suzanne; Mencarelli, Maria Antonietta; Mignot, Cyril; Minet, Jean-Claude; Moerman, Alexandre; Morice-Picard, Fanny; Mucciolo, Mafalda; Ounap, Katrin; Pasquier, Laurent; Petit, Florence; Ragona, Francesca; Rajcan-Separovic, Evica; Renieri, Alessandra; Rieubland, Claudine; Sanlaville, Damien; Sarrazin, Elisabeth; Shen, Yiping; van Haelst, Mieke; Silfhout, Anneke Vulto-van (15 November 2014). "16p11.2 600 kb Duplications confer risk for typical and atypical Rolandic epilepsy". Human Molecular Genetics. 23 (22): 6069–6080. doi: 10.1093/hmg/ddu306 . PMID   24939913.
  22. 1 2 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2019-05-05.
  23. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2019-05-05.
  24. "Pairwise Sequence Alignment Tools < EMBL-EBI". ebi.ac.uk. Retrieved 2019-05-05.