C3orf70

Last updated

C3orf70 also known as Chromosome 3 Open Reading Frame 70, is a 250aa protein in humans that is encoded by the C3orf70 gene. The protein encoded is predicted to be a nuclear protein; however, its exact function is currently unknown. [1] C3orf70 can be identified with known aliases: Chromosome 3 Open Reading Frame 70, AK091454, UPF0524, and LOC285382. [1] [2]

Contents

C3orf70
Identifiers
Aliases C3orf70 , chromosome 3 open reading frame 70
External IDs MGI: 1919440 HomoloGene: 78167 GeneCards: C3orf70
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001025266

NM_001001881

RefSeq (protein)

NP_001020437

NP_001001881

Location (UCSC) Chr 3: 185.08 – 185.15 Mb Chr 16: 21.47 – 21.51 Mb
PubMed search [5] [6]
Wikidata
View/Edit Human View/Edit Mouse

Gene

In humans, C3orf70 is located on the reverse strand of Chromosome 3 at 3q27.2 (see Figure 1). [7] This identifies its location starting 184,795,838 base pairs and ending 184,870,802 base pairs from PTER, the terminus of the short arm, on chromosome 3. C3orf70 spans 74,964 bases containing two exons and two introns.

mRNA

The transcribed mRNA is a 5,901 base pair transcript. C3orf70 consists of one known splice variant with two exons of 388 base pairs and 5,512 base pairs respectively (see Figure 2); location of junction occurs at 67aa[C]. [1] A single 5’ cap and three possible 3’ polyadenylation signals have been identified. [1]

Protein

Composition

The translated protein is a 250 amino acid product. The precursor protein has been predicted with a molecular weight of 27.8kdal and an isoelectric point of 4.67. [8] With 33 serines and 8 glycines, the C3orf70 protein is both Serine rich and Glycine poor. [8]

Domains

C3orf70 protein has no known signal peptides or domains.

Homology

C3orf70 has no known paralogs in humans; however C3orf70 has conserved homologs, see Figure 3. Highly conserved across species excluding invertebrates, plants, fungi, and bacteria, C3orf70 shows a moderate rate of evolution, see Figure 4 and 5. [9]

Regulation

Transcription

Promoter

There is only one known promoter predicted by Genomatix for the C3orf70 protein located on the minus strand of chromosome 3 at location 184870702-184871302bp, therefore identified as 600bp in length. [10] High mammalian conservation was observed for the identified promoter sequence.

Transcription factors

Figure 6: Generated list of a selection of transcription factor binding sites in the C3orf70 promoter Transcription Factor Binding Sites in C3orf70 Promoter.png
Figure 6: Generated list of a selection of transcription factor binding sites in the C3orf70 promoter

Through the use of Genomatix, a table was generated of the top 20 transcription factors and their binding sites in the C3orf70 promoter (see Figure 6). [10]

Translation

Post-translational modifications

Utilizing NetPhos, a total of 25 phosphorylation sites have been predicted (20 Serines, 3 Threonines, and 2 Tyrosines) which occur throughout the protein indicating an intracellular localization. [11] Figure 7 pinpoints the location of the 25 potential phosphorylation sites. Additionally, two N-myrisolation sites were predicted at amino acid position 40-45 and 210-215 indicating a possible N-terminus and C-terminus membrane anchor region. [12] There are also 28 potential missense mutations in the human C3orf70. [13]

Subcellular localization

PSORT II indicates the subcellular localization of C3orf70 is in the nucleus. [14] In addition to this, following SDSC's Biology Workbench's SAPS kNN-Prediction, the C3orf70 protein for humans has a 60.9% likelihood to end up in the nuclear region of a cell, as determined by the amino acid make-up of C3orf70. [8] Homologs including chimp, mouse, alligator, and zebrafish conclude the same nuclear region with a >60% likelihood. [14] A nuclear localization site has not been identified in the C3orf70 sequence.

Expression

From Unigene's EST cDNA tissue abundance display (see Figure 8), C3orf70 is non-ubiquitously expressed and has relatively low expression levels with slightly higher expression levels seen in the brain. [15] Also, microarray data profile GDS426 (see Figure 9) showing the expression of C3orf70 across normal tissues displays a notably high presence in the brain, spinal cord, and prostate tissue. [16]

Function and further reading

The function of C3orf70 is unknown. It is suggested to be a nuclear protein that plays a role in neurological development. Additional avenues of research pertaining to the C3orf70 gene include:

There is a patent that identified genes associated with midbrain dopamine neurons for engraftment by looking at the differentiation of hESC and/or hiPSC in floor plate midbrain progenitor cells. C3orf70 was found to have a fold-change of 2.45, which was not determined significant in experimentation [17]

A publication was discovered through multiple sources that linked the C3orf70 gene to a “Genome-wide association study of major depressive disorder”. [18]

A microdeletion has been identified from 3q26.33-3q27.2. [19] Mandrille et al. associates this discovered microdeletion with a possible clinical syndrome characterized by clinical features related to brain development.

Related Research Articles

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

<span class="mw-page-title-main">C1orf21</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

<span class="mw-page-title-main">TMEM242</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.

<span class="mw-page-title-main">CCDC94</span> Protein-coding gene in the species Homo sapiens

Coiled-coil domain containing 94 (CCDC94), is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.

<span class="mw-page-title-main">CFAP206</span> Protein-coding gene in the species Homo sapiens

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

<span class="mw-page-title-main">CXorf66</span> Human protein

CXorf66 also known as Chromosome X Open Reading Frame 66, is a 361aa protein in humans that is encoded by the CXorf66 gene. The protein encoded is predicted to be a type 1 transmembrane protein; however, its exact function is currently unknown. CXorf66 has one alias: RP11-35F15.2.

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

<span class="mw-page-title-main">C11orf86</span> Protein-coding gene in the species Homo sapiens

Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C5orf46</span> Protein-coding gene in the species Homo sapiens

C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">CCDC190</span> Protein-coding gene in the species Homo sapiens

Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.

<span class="mw-page-title-main">C3orf38</span> An article about the uncharacterized gene C3orf38.

Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.

References

  1. 1 2 3 4 NCBI AceView. "Homo sapiens gene C3orf70, encoding chromosome 3 open reading frame 70" . Retrieved 2015-04-10.
  2. NCBI. "C3orf70 chromosome 3 open reading frame 70 [ Homo sapiens (human) ]" . Retrieved 2015-04-10.
  3. 1 2 3 GRCh38: Ensembl release 89: ENSG00000187068 - Ensembl, May 2017
  4. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000043391 - Ensembl, May 2017
  5. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  6. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  7. "NCBI C3orf70 protein". Conserved Domain Database. National Center for Biotechnology Information. Retrieved 2015-04-18.
  8. 1 2 3 Department of Bioengineering. "SDSC Biology Workbench". University of California, San Diego. Retrieved 2015-04-18.
  9. "BLAST: Basic Local Alignment Search Tool". Conserved Domain Database. National Center for Biotechnology Information. Retrieved 2015-04-18.
  10. 1 2 Genomatix Software. "Genomatix ElDorado" . Retrieved 2015-04-18.
  11. Blom, N.; Gammeltoft, S.; Brunak, S. (1999). "Sequence- and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–62. doi:10.1006/jmbi.1999.3310. PMID   10600390 . Retrieved 2015-04-19.
  12. Sigrist CJ, Cerutti L, de Castro E, Langendijk-Genevaux PS, Bulliard V, Bairoch A, Hulo N. "a protein domain database for functional characterization and annotation". Nucleic Acids Res. Retrieved 2015-04-19.
  13. "dbSNP". Conserved Domain Database. National Center for Biotechnology Information. Retrieved 2015-03-11.
  14. 1 2 Paul Horton. "PSORT II". Psort.hgc.jp. Retrieved 2015-04-19.
  15. "Unigene". Conserved Domain Database. National Center for Biotechnology Information. Retrieved 2015-04-19.
  16. "C3orf70 –Normal human tissue expression profiling". NCBI GEO Profile. National Center for Biotechnology Information. Retrieved 2015-04-19.
  17. Lorenz Studer; Jae-won Shim; Sonja Kriks. "Midbrain Dopamine (DA) Neurons for Engraftment" . Retrieved 2015-04-18.
  18. Wray NR, Pergadia ML, Blackwood DH, Penninx BW, Gordon SD, Nyholt DR, Ripke S, MacIntyre DJ, McGhee KA, Maclean AW, Smit JH, Hottenga JJ, Willemsen G, Middeldorp CM, de Geus EJ, Lewis CM, McGuffin P, Hickie IB, van den Oord EJ, Liu JZ, Macgregor S, McEvoy BP, Byrne EM, Medland SE, Statham DJ, Henders AK, Heath AC, Montgomery GW, Martin NG, Boomsma DI, Madden PA, Sullivan PF (2012). "Genome-wide association study of major depressive disorder: new results, meta-analysis, and lessons learned" (PDF). Mol Psychiatry. Molecular Psychiatry. 17 (1): 36–48. doi:10.1038/mp.2010.109. PMC   3252611 . PMID   21042317.
  19. Mandrile G, Dubois A, Hoffman JD, Uliana V, Di Maria E, Malacarne M, Coviello D, Faravelli F, Zwolinski S, Hellens S, Wright M, Forzano F (2014). "3q26.33-3q27.2 microdeletion: A new microdeletion syndrome?". Eur J Med Genet. European Journal of Medical Genetics. 57 (2–3): 76–80. doi:10.1016/j.ejmg.2013.12.007. PMID   24462885.