C3orf67

Last updated
CFAP20DC
Identifiers
Aliases CFAP20DC , chromosome 3 open reading frame 67, C3orf67, CFAP20 domain containing
External IDs MGI: 1926154 HomoloGene: 18873 GeneCards: CFAP20DC
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_028934

RefSeq (protein)

NP_083210

Location (UCSC) Chr 3: 58.72 – 59.05 Mb Chr 14: 13.8 – 14.04 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 3 open reading frame 67 or C3orf67 is a protein that in humans is encoded by the gene C3orf67. [5] [6] The function of C3orf67 is not yet fully understood.

Contents

Gene

C3orf67 is located at 3p14.2 on the reverse strand ranging from 58716417 to 59050045 base pairs. [7] [5] The accession number is NP_001338459.1. [8]

Protein

Map of C3orf67 with exon/exon boundaries, conserved domains, and highly conserved regions. Protein map.png
Map of C3orf67 with exon/exon boundaries, conserved domains, and highly conserved regions.

Primary sequence and isoforms

The coding sequence is 402-2681 base pairs of 3135 base pairs, [7] making up 759 amino acids. [5] [8] C3orf67 has six validated isoforms. [5] Isoform one is the most complete with 16 exons. [7] C3orf67 weighs 84.35 kilodaltons. [9]

Domains and motifs

There are three functional domains identified for C3orf67 [10]

Post-translational modifications

Map of conserved PTMs for C3orf67. Conserved PTM map.tif
Map of conserved PTMs for C3orf67.

Several post-translational modifications have been predicted for C3orf67 in conserved regions using various bioinformatic prediction tools [11] [12] [13] [14] [15] [16] [17] [18]

Secondary structure

The beginning of C3orf67 is predicted to consist of a series of beta strands and a couple alpha helices that coincide with the DUF667 domain. There are also alpha helices predicted in regions that correspond to the CM_mono2 and OCRE domains. [19] [20] [21]

Tertiary structure

Predicted tertiary structure of DUF667 region of C3orf67. C3orf67 - DUF667.png
Predicted tertiary structure of DUF667 region of C3orf67.

The DUF667 region is predicted to form a tube-like structure from a series of beta sheets. [21]

Homology and Evolution

Paralogs

There are no known paralogs of C3orf67.

Orthologs

Orthologs have been identified for C3orf67 in species ranging from fungus, plants, hemichordates, parasites, fish, reptiles, birds, invertebrates, and mammals.

Variety of orthologous species of C3orf67.
SpeciesCommon NameDate of Divergence (MYA)Accession NumberSequence Length (aa)% Identity
Orbicella faveolata Mountainous star coral824XP_020630732.1 / XP_020630739.184932.20%
Exaiptasia pallida Pale anemone824XP_020899564.179732.00%
Acanthaster planci Crown-of-thorns starfish684XP_022107809.197631.60%
Stylophora pistillata Smooth cauliflower coral824XP_022782397.182530.80%
Crassostrea gigas Pacific oyster797XP_011453705.195029.50%
Lingula anatina Lamp shell797XP_013404893.1107729.30%
Octopus bimaculoides California two-spotted octopus797XP_014778712.190229.10%
Saccoglossus kowalevskii Acorn worm684XP_006821003.159623.30%
Amphimedon queenslandica Sponge951.8XP_011402616.150822.70%
Mutation rate of C3orf67. Mutation rate of C3orf67.png
Mutation rate of C3orf67.

Distant homologs

Most distant homologs of C3orf67.
SpeciesCommon NameDate of Divergence (MYA)Accession NumberSequence Length (aa)% Identity
Trichinella spiralis Trichina worm797XP_003374081.139312.60%
Spizellomyces punctatus Unknown1105XP_016608387.11838.20%
Selaginella moellendorffii Spikemoss1496XP_002989784.12096.00%

Expression

Promoter

The promoter is well conserved across humans, gibbons, baboons, orangutans, cats, squirrels, alpacas, rabbits and mice. [22] There are several high quality transcription factor binding sites. [23] There are also several stem-loop structures that are predicted to be formed in the promoter region, some of which overlap with transcription factor binding sites. [24]

General tissue expression pattern of C3orf67. Tissue expression C3orf67.png
General tissue expression pattern of C3orf67.

Expression

C3orf67 is prominently expressed in the liver, tonsils, trachea, ovaries, testis, placenta, and colon. In other tissues it is expressed at low levels. [25] An increase in expression has been linked to small cell lung cancer. [26]

Function

The protein has been identified as one of seventeen (17) genes that may play a novel role in the intersection of tumor promotion and DNA-damaging stress and may be linked to carcinogenesis. [27]

Interacting Proteins

Transcription factors

There are three notable transcription factors that are known to be involved in the regulation of cell growth or immune responses:

Other interacting proteins

Several other proteins have been predicted to interact with C3orf67:

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">C20orf27</span> Protein-coding gene in the species Homo sapiens

UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.

<span class="mw-page-title-main">C1orf21</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

<span class="mw-page-title-main">TMEM242</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.

<span class="mw-page-title-main">ANKRD24</span> Protein-coding gene in the species Homo sapiens

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">ZCCHC18</span> Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">CXorf38 Isoform 1</span> Human protein

Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.

<span class="mw-page-title-main">TEDC2</span> Protein-coding gene in the species Homo sapiens

Tubulin epsilon and delta complex 2 (TEDC2), also known as Chromosome 16 open reading frame 59 (C16orf59), is a protein that in humans is encoded by the TEDC2 gene. Its NCBI accession number is NP_079384.2.

<span class="mw-page-title-main">TMEM155</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">SBK3</span> Protein-coding gene in the species Homo sapiens

SH3 Domain Binding Kinase Family Member 3 is an enzyme that in humans is encoded by the SBK3 gene. SBK3 is a member of the serine/threonine protein kinase family. The SBK3 protein is known to exhibit transferase activity, especially phosphotransferase activity, and tyrosine kinase activity. It is well-conserved throughout mammalian organisms and has two paralogs: SBK1 and SBK2.

<span class="mw-page-title-main">TMEM247</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 247 is a multi-pass transmembrane protein of unknown function found in Homo sapiens encoded by the TMEM247 gene. Notable in the protein are two transmembrane regions near the c-terminus of the translated polypeptide. Transmembrane protein 247 has been found to be expressed almost entirely in the testes.

<span class="mw-page-title-main">SNAP47</span>

Synaptosome-associated protein, 47 kDal (SNAP47) is a human protein encoded by the SNAP47 gene. Other aliases of this gene are SVAP1, HEL170, ESFI5812, and HEL-S-290. SNAP47 is a synaptosome protein which is associated with the protein coding in multiple diseases, including non small cell lung cancer and schizophrenia. SNAP47 is a member of the SNAP protein family. SNAP proteins are t-snare proteins that are a component of SNARE complex. The SNARE complex mediates vesicle fusion by creating tight complex that brings vesicle and membrane together. This protein causes ubiquitous expression in testis, ovary, and many other tissues

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000163689 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000021747 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 "NCBI Gene". National Center for Biotechnology Information.
  6. "C3orf67". GeneCards Human Gene Database.
  7. 1 2 3 "NCBI Nucleotide". National Center for Biotechnology Information. May 2019.
  8. 1 2 "NCBI Protein". National Center for Biotechnology Information.
  9. "Protein Molecular Weight Calculator". www.sciencegateway.org. Retrieved 2018-02-25.
  10. "MOTIF: Searching Protein Sequence Motifs". www.genome.jp. Retrieved 2018-02-25.
  11. "DictyOGlyc 1.1 Server". www.cbs.dtu.dk. Retrieved 2018-04-30.
  12. "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Retrieved 2018-04-30.
  13. "ExPASy - Sulfinator tool". web.expasy.org. Retrieved 2018-04-30.
  14. "SUMOplot™ Analysis Program | Abgent". www.abgent.com. Retrieved 2018-04-30.
  15. "C3orf67 (human)". www.phosphosite.org. Retrieved 2018-04-30.
  16. "NetOGlyc 4.0 Server". www.cbs.dtu.dk. Retrieved 2018-04-30.
  17. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2018-04-30.
  18. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2018-04-30.
  19. "JPred: A Protein Secondary Structure Prediction Server". www.compbio.dundee.ac.uk. Retrieved 2018-04-24.
  20. Kelley, Lawrence. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2018-04-24.
  21. 1 2 "SWISS-MODEL | Workspace". swissmodel.expasy.org. Retrieved 2018-04-24.
  22. "Human BLAT Search". genome.ucsc.edu. Retrieved 2018-04-24.
  23. 1 2 "Genomatix: Matrix Library information". www.genomatix.de. Retrieved 2018-04-24.
  24. "RNA Folding Form | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2018-05-02.
  25. Dezso Z, Nikolsky Y, Sviridov E, Shi W, Serebriyskaya T, Dosymbekov D, Bugrim A, Rakhmatulin E, Brennan RJ, Guryanov A, Li K, Blake J, Samaha RR, Nikolskaya T (November 2008). "A comprehensive functional analysis of tissue specificity of human gene expression". BMC Biology. 6: 49. doi: 10.1186/1741-7007-6-49 . PMC   2645369 . PMID   19014478.
  26. Sato T, Kaneda A, Tsuji S, Isagawa T, Yamamoto S, Fujita T, Yamanaka R, Tanaka Y, Nukiwa T, Marquez VE, Ishikawa Y, Ichinose M, Aburatani H (2013-05-29). "PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small cell lung cancer". Scientific Reports. 3 (1): 1911. Bibcode:2013NatSR...3E1911S. doi:10.1038/srep01911. PMC   3665955 . PMID   23714854.
  27. Glover KP, Chen Z, Markell LK, Han X (2 October 2015). "Synergistic Gene Expression Signature Observed in TK6 Cells upon Co-Exposure to UVC-Irradiation and Protein Kinase C-Activating Tumor Promoters". PLOS ONE. 10 (10): e0139850. Bibcode:2015PLoSO..1039850G. doi: 10.1371/journal.pone.0139850 . PMC   4592187 . PMID   26431317.
  28. Zawel L, Dai JL, Buckhaults P, Zhou S, Kinzler KW, Vogelstein B, Kern SE (March 1998). "Human Smad3 and Smad4 are sequence-specific transcription activators". Molecular Cell. 1 (4): 611–7. doi: 10.1016/S1097-2765(00)80061-1 . PMID   9660945.
  29. "Genomatix: Matrix Library information". www.genomatix.de. Retrieved 2018-04-24.
  30. Treiber T, Mandel EM, Pott S, Györy I, Firner S, Liu ET, Grosschedl R (May 2010). "Early B cell factor 1 regulates B cell gene networks by activation, repression, and transcription- independent poising of chromatin". Immunity. 32 (5): 714–25. doi: 10.1016/j.immuni.2010.04.013 . PMID   20451411.
  31. "Genomatix: Matrix Library information". www.genomatix.de. Retrieved 2018-04-24.
  32. Molnár A, Georgopoulos K (December 1994). "The Ikaros gene encodes a family of functionally diverse zinc finger DNA-binding proteins". Molecular and Cellular Biology. 14 (12): 8292–303. doi:10.1128/MCB.14.12.8292. PMC   359368 . PMID   7969165.
  33. Lipp JJ, Marvin MC, Shokat KM, Guthrie C (August 2015). "SR protein kinases promote splicing of nonconsensus introns". Nature Structural & Molecular Biology. 22 (8): 611–7. doi:10.1038/nsmb.3057. PMID   26167880. S2CID   24363149.
  34. "Antibodypedia - CLK1 antibodies". www.antibodypedia.com. Retrieved 2018-05-01.
  35. Mikolcevic P, Sigl R, Rauch V, Hess MW, Pfaller K, Barisic M, Pelliniemi LJ, Boesl M, Geley S (February 2012). "Cyclin-dependent kinase 16/PCTAIRE kinase 1 is activated by cyclin Y and is essential for spermatogenesis". Molecular and Cellular Biology. 32 (4): 868–79. doi:10.1128/MCB.06261-11. PMC   3272973 . PMID   22184064.
  36. "Antibodypedia - CDK16 antibodies". www.antibodypedia.com. Retrieved 2018-05-01.
  37. 1 2 van Meel E, Wegner DJ, Cliften P, Willing MC, White FV, Kornfeld S, Cole FS (October 2013). "Rare recessive loss-of-function methionyl-tRNA synthetase mutations presenting as a multi-organ phenotype". BMC Medical Genetics. 14: 106. doi: 10.1186/1471-2350-14-106 . PMC   3852179 . PMID   24103465.
  38. 1 2 "STRING: functional protein association networks". string-db.org. Retrieved 2018-05-01.
  39. Cornen S, Guille A, Adélaïde J, Addou-Klouche L, Finetti P, Saade MR, Manai M, Carbuccia N, Bekhouche I, Letessier A, Raynaud S, Charafe-Jauffret E, Jacquemier J, Spicuglia S, de The H, Viens P, Bertucci F, Birnbaum D, Chaffanet M (2014-01-09). "Candidate luminal B breast cancer genes identified by genome, gene expression and DNA methylation profiling". PLOS ONE. 9 (1): e81843. Bibcode:2014PLoSO...981843C. doi: 10.1371/journal.pone.0081843 . PMC   3886975 . PMID   24416132.