C1orf112

Last updated
C1orf112
Identifiers
Aliases C1orf112 , chromosome 1 open reading frame 112
External IDs MGI: 3590554 HomoloGene: 10058 GeneCards: C1orf112
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_201364
NM_001357049

RefSeq (protein)

NP_958752
NP_001343978

Location (UCSC) Chr 1: 169.66 – 169.85 Mb Chr 1: 163.77 – 163.82 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. [5] C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. [6] C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence. [7] [8]

Gene

The gene spans 192,073 base pairs, with 29 different exons. C1orf112 is located at position 1q24.2. C1orf112 shares antisense coding regions with C1orf156 and SCYL3. [9]

Location of chromosome 1 open reading frame 112 in context of other genes within the region 1q24.2. Image courtesy of National Center for Biotechnology Information (NCBI). C1orf112 Location.png
Location of chromosome 1 open reading frame 112 in context of other genes within the region 1q24.2. Image courtesy of National Center for Biotechnology Information (NCBI).

Protein

There are currently eight experimentally determined RefSeq isoforms. [9] C1orf112 has a domain of unknown function DUF4487. [9]

Composition

Compositional analysis through SAPS predicted much less glycine and much more leucine than expected relative to other human protein sequences. This characteristic is conserved across primate orthologs. A mixed charge cluster was found in Isoform X1 from position 747 to 805, indicating that this segment may be aqueous and tightly bound. This mixed charge cluster is only partially conserved across orthologs. [11]

Transcripts

C1orf112 is determined to have 9 transcripts, or splice variants by Ensembl. [12]

Subcellular Localization

Antibody immunocytochemistry and immunofluorescent staining of human cell line A-431 indicates C1orf112 is localized to the mitochondria. [13] [14]

Immunofluorescent staining of human cell line A-431 indicates localization to mitochondria. Image courtesy of Human Protein Atlas and Sigma-Aldrich antibody HPA024451. C1orf112SubcellularLocalization.jpg
Immunofluorescent staining of human cell line A-431 indicates localization to mitochondria. Image courtesy of Human Protein Atlas and Sigma-Aldrich antibody HPA024451.

Regulation

Gene level regulation

Expression

Although tissue-level expression is ubiquitous, C1orf112 is expressed highest in the testes, lymph nodes, brain marrow, and cerebellum, with samples from 97 individual in 27 different tissues. [16] In-situ hybridization of the human transcriptome indicates expression is highest in the atrioventricular node, followed by the testis, testis germ cells, and testis interstitial tissue. [17]

Transcript level regulation

Transcription factor assessment indicates many potential TATA-binding protein and CCAAT-enhancer-binding proteins sites, along with transcription factors associated with the testis, thymus, kidneys, and cardiac tissue. [18]

Protein level regulation

There are two ubiquitination sites on C1orf112, at position lysine 73 and at position 783 on isoform X1. Downstream of reading frame, there are three polyadenylation signals. In addition, there is an N6-acetyllysine site at leucine 747 and a phosphoserine site at serine 23. [12] C1orf112 has been found experimentally to interact with ATG1, an aldosterone secretion whose overexpression characterizes certain forms of breast cancer. [19] Post-translational modifications predictions include O-glycosyl-oligosaccharide-glycoprotein N-acetylglucosaminyltransferase III and sumoylation, and sumoylation interaction sites. [20] [21]

Interacting proteins

C1orf112 is predicted to interact with a diverse range of proteins, including multiple mitosis-associated proteins. [22] C1orf112 is also predicted to interact with FIGNL1, a protein involved in DNA double-stranded break repair via homologous recombination. [23] Experimental findings indicate C1orf112 interacts with NUF2, a spindle-pole body protein that plays a critical role in nuclear division, and TTK, a protein kinase capable of phosphorylating serine, threonine, and tyrosine. [24]

Homology/evolution

Paralogs

There are no known paralogs of C1orf112. [9]

Orthologs

C1orf112 is highly conserved in Pan troglodytes, Rhinopithecus bieti, Castor canadensis, Miniopterus natalensis, and other select primates, with percent identity relative to Homo sapien C1orf112, with percent identity greater than 90%. [25] Orthologs with the greatest date of divergence (date of speciation) to human C1orf112 include Trichosporon asahii, a placozoa, and Amphimedon queenslandica, indicated that C1orf112 has been preserved over evolutionary time. [25]

C1orf112 Orthologs in Ascending Divergence
Genus/SpeciesCommon NameTaxonomic GroupDate of Divergence (Estimated Time)Accession #Sequence Length (aa)% Identity% SimilarityE Value
Homo sapiensHumansMammals0NP_001306976.1853100100
Pan troglodytesChimpanzeesPrimates6.65 myaXP_009436263.191199990
Rhinopithecus bietiBlack Stub-Nosed MonkeyPrimates29.44 myaXP_017723911.191197980
Castor canadensisAmerican BeaverRodentia90 myaXP_020026631.190887920
Miniopterus natalensisNatural long-fingered batChiroptera96 myaXP_016077003.191284900
Condylura cristataStar-Nosed MoleSoricomorpha96 myaXP_024409392.153177850
Bos indicusZebuCetartiodactyla96 myaXP_019832063.187584900
Acinonyx jubatusCheetahCarnivora96 myaXP_026902260.191286900
Aptenodytes forsteriEmperor PenguinAves312 myaXP_009271565.183959760
Chelonia mydasGreen Sea TurtleReptilia312 myaXP_007061247.184964780
Xenopus laevisAfrican Clawed FrogAmphibia352 myaXP_018114274.190455720
Nanorana parkeriNanora parkeriAmphibia352 myaXP_018428126.169852700
Salmo salarAtlantic SalmonActinopterygii435 myaXP_013979201.192447640
Helobdella robustaEarth wormClitellata797 myaXP_009029571.1100425430
Amphimedon queenslandicaSpongePorifera951.8 myaXP_019856681.190328460
Trichosporon asahiiFungiFungi1105 myaXP_014176969.1258843680.006

Date of divergence was calculated using TimeTree. [26] The E value indicates the number of "hits" one can expect to see by chance when using the NCBI database, with a low E value indicated a significant result. Percent identity is the percentage of character that align to Homo sapien C1orf112 Isoform X1, while percent similarity is the degree of resemblance when the two sequences are aligned with one another. [27]

Predicted tertiary structure for C1orf112 with highest statistical significance, generated by I-TASSER. C1orf112Predicted3DStructure.png
Predicted tertiary structure for C1orf112 with highest statistical significance, generated by I-TASSER.

Protein Structure

Secondary and Tertiary Structure

C1orf112 secondary structure is predicted to be predominately alpha helical, with < 5% of the protein composed of beta sheets. Ligand binding sites are predicted by I-TASSER from positions 377 to 530 in Isoform X1. [28] A leucine zipper motif is present in Isoform X1 from positions 831-852, predicted by MyHits. [29]

Clinical significance

C1orf112 was one of many genes found to be co-expressed with cancer-associated genes, and the knockdown of this gene in a HeLa cell line suppressed growth. [30]

Related Research Articles

<span class="mw-page-title-main">IFFO1</span> Protein-coding gene in the species Homo sapiens

Intermediate filament family orphan 1 is a protein that in humans is encoded by the IFFO1 gene. IFFO1 has uncharacterized function and a weight of 61.98 kDa. IFFO1 proteins play an important role in the cytoskeleton and the nuclear envelope of most eukaryotic cell types.

<span class="mw-page-title-main">C9orf152</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 152 is a protein that in humans is encoded by the C9orf152 gene. The exact function of the protein is not completely understood.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">SHLD1</span> Protein-coding gene in the species Homo sapiens

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in humans

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

FAM237A is a protein coding gene which encodes a protein of the same name. Within Homo sapiens, FAM237A is believed to be primarily expressed within the brain, with moderate heart and lesser testes expression,. FAM237A is hypothesized to act as a specific activator of receptor GPR83.

<span class="mw-page-title-main">TBC1D30</span> Protein-coding gene in the species Homo sapiens

TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.

<span class="mw-page-title-main">TEKTIP1</span> Gene

TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.

C4orf45 Human protein

Chromosome 4 Open Reading Frame 45 (C4orf45) is a protein which in humans is encoded by the C4orf45 gene. It is predicted to be localized in the cytoplasm and nucleus of a cell

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">Chromosome 12 open reading frame 71</span> Protein encoded in humans by c12orf71 gene

Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000000460 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000041406 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: Chromosome 1 open reading frame 112".
  6. Bjellqvist B, Hughes GJ, Pasquali C, Paquet N, Ravier F, Sanchez JC, Frutiger S, Hochstrasser D (October 1993). "The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences". Electrophoresis. 14 (10): 1023–31. doi:10.1002/elps.11501401163. PMID   8125050. S2CID   38041111.
  7. Berglund L, Björling E, Oksvold P, Fagerberg L, Asplund A, Szigyarto CA, Persson A, Ottosson J, Wernérus H, Nilsson P, Lundberg E, Sivertsson A, Navani S, Wester K, Kampf C, Hober S, Pontén F, Uhlén M (October 2008). "A genecentric Human Protein Atlas for expression profiles based on antibodies". Molecular & Cellular Proteomics. 7 (10): 2019–27. CiteSeerX   10.1.1.549.7756 . doi: 10.1074/mcp.R800013-MCP200 . PMID   18669619.
  8. Claros MG (August 1995). "MitoProt, a Macintosh application for studying mitochondrial proteins". Computer Applications in the Biosciences. 11 (4): 441–7. doi:10.1093/bioinformatics/11.4.441. PMID   8521054.
  9. 1 2 3 4 "C1orf112 chromosome 1 open reading frame 112 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-25.
  10. "C1orf112 chromosome 1 open reading frame 112 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-04.
  11. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  12. 1 2 "Gene: C1orf112 (ENSG00000000460) - Summary - Homo sapiens - Ensembl genome browser 96". uswest.ensembl.org. Retrieved 2019-04-29.[ permanent dead link ]
  13. "AB_1848667 Search - The Antibody Registry". antibodyregistry.org. Retrieved 2019-04-29.
  14. "C1orf112 - Antibodies - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-04-29.
  15. "C1orf112 - Antibodies - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-05-02.
  16. "GTEx Gene Expression for C1orf112". GTEx.
  17. Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB (April 2004). "A gene atlas of the mouse and human protein-encoding transcriptomes". Proceedings of the National Academy of Sciences of the United States of America. 101 (16): 6062–7. Bibcode:2004PNAS..101.6062S. doi: 10.1073/pnas.0400782101 . PMC   395923 . PMID   15075390.
  18. Cartharius K, Frech K, Grote K, Klocke B, Haltmeier M, Klingenhoff A, Frisch M, Bayerlein M, Werner T (July 2005). "MatInspector and beyond: promoter analysis based on transcription factor binding sites". Bioinformatics. 21 (13): 2933–42. doi: 10.1093/bioinformatics/bti473 . PMID   15860560.
  19. Deveaux Y, Alonso B, Pierrugues O, Godon C, Kazmaier M (October 2000). "Molecular cloning and developmental expression of AtGR1, a new growth-related Arabidopsis gene strongly induced by ionizing radiation" (PDF). Radiation Research. 154 (4): 355–64. doi:10.1667/0033-7587(2000)154[0355:MCADEO]2.0.CO;2. PMID   11023598. S2CID   31816421.
  20. Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S (June 2004). "Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence". Proteomics. 4 (6): 1633–49. doi:10.1002/pmic.200300771. PMID   15174133. S2CID   18810164.
  21. Xue Y, Zhou F, Fu C, Xu Y, Yao X (July 2006). "SUMOsp: a web server for sumoylation site prediction". Nucleic Acids Research. 34 (Web Server issue): W254-7. doi:10.1093/nar/gkl207. PMC   1538802 . PMID   16845005.
  22. Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P, Jensen LJ, von Mering C (January 2017). "The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible". Nucleic Acids Research. 45 (D1): D362–D368. doi:10.1093/nar/gkw937. PMC   5210637 . PMID   27924014.
  23. Yuan J, Chen J (June 2013). "FIGNL1-containing protein complex is required for efficient homologous recombination repair". Proceedings of the National Academy of Sciences of the United States of America. 110 (26): 10640–5. Bibcode:2013PNAS..11010640Y. doi: 10.1073/pnas.1220662110 . PMC   3696823 . PMID   23754376.
  24. DeLuca JG, Dong Y, Hergert P, Strauss J, Hickey JM, Salmon ED, McEwen BF (February 2005). "Hec1 and nuf2 are core components of the kinetochore outer plate essential for organizing microtubule attachment sites". Molecular Biology of the Cell. 16 (2): 519–31. doi:10.1091/mbc.e04-09-0852. PMC   545888 . PMID   15548592.
  25. 1 2 Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL (July 2008). "NCBI BLAST: a better web interface". Nucleic Acids Research. 36 (Web Server issue): W5-9. doi:10.1093/nar/gkn201. PMC   2447716 . PMID   18440982.
  26. Hedges SB, Dudley J, Kumar S (December 2006). "TimeTree: a public knowledge-base of divergence times among organisms". Bioinformatics. 22 (23): 2971–2. doi: 10.1093/bioinformatics/btl505 . PMID   17021158.
  27. "BLAST Frequently Asked questions". blast.ncbi.nlm.nih.gov. Retrieved 2019-05-04.
  28. 1 2 Zhang Y (January 2008). "I-TASSER server for protein 3D structure prediction". BMC Bioinformatics. 9 (1): 40. doi: 10.1186/1471-2105-9-40 . PMC   2245901 . PMID   18215316.
  29. Pagni M, Ioannidis V, Cerutti L, Zahn-Zabal M, Jongeneel CV, Hau J, Martin O, Kuznetsov D, Falquet L (July 2007). "MyHits: improvements to an interactive resource for analyzing protein sequences". Nucleic Acids Research. 35 (Web Server issue): W433-7. doi:10.1093/nar/gkm352. PMC   1933190 . PMID   17545200.
  30. van Dam S, Cordeiro R, Craig T, van Dam J, Wood SH, de Magalhães JP (October 2012). "GeneFriends: an online co-expression analysis tool to identify novel gene targets for aging and complex diseases". BMC Genomics. 13 (1): 535. doi: 10.1186/1471-2164-13-535 . PMC   3495651 . PMID   23039964.

Further reading