This article provides insufficient context for those unfamiliar with the subject.(April 2015) |
C3orf70 also known as Chromosome 3 Open Reading Frame 70, is a 250aa protein in humans that is encoded by the C3orf70 gene. The protein encoded is predicted to be a nuclear protein; however, its exact function is currently unknown. [1] C3orf70 can be identified with known aliases: Chromosome 3 Open Reading Frame 70, AK091454, UPF0524, and LOC285382. [1] [2]
C3orf70 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C3orf70 , chromosome 3 open reading frame 70 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1919440 HomoloGene: 78167 GeneCards: C3orf70 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
In humans, C3orf70 is located on the reverse strand of Chromosome 3 at 3q27.2 (see Figure 1). [7] This identifies its location starting 184,795,838 base pairs and ending 184,870,802 base pairs from PTER, the terminus of the short arm, on chromosome 3. C3orf70 spans 74,964 bases containing two exons and two introns.
The transcribed mRNA is a 5,901 base pair transcript. C3orf70 consists of one known splice variant with two exons of 388 base pairs and 5,512 base pairs respectively (see Figure 2); location of junction occurs at 67aa[C]. [1] A single 5’ cap and three possible 3’ polyadenylation signals have been identified. [1]
The translated protein is a 250 amino acid product. The precursor protein has been predicted with a molecular weight of 27.8kdal and an isoelectric point of 4.67. [8] With 33 serines and 8 glycines, the C3orf70 protein is both Serine rich and Glycine poor. [8]
C3orf70 protein has no known signal peptides or domains.
C3orf70 has no known paralogs in humans; however C3orf70 has conserved homologs, see Figure 3. Highly conserved across species excluding invertebrates, plants, fungi, and bacteria, C3orf70 shows a moderate rate of evolution, see Figure 4 and 5. [9]
There is only one known promoter predicted by Genomatix for the C3orf70 protein located on the minus strand of chromosome 3 at location 184870702-184871302bp, therefore identified as 600bp in length. [10] High mammalian conservation was observed for the identified promoter sequence.
Through the use of Genomatix, a table was generated of the top 20 transcription factors and their binding sites in the C3orf70 promoter (see Figure 6). [10]
Utilizing NetPhos, a total of 25 phosphorylation sites have been predicted (20 Serines, 3 Threonines, and 2 Tyrosines) which occur throughout the protein indicating an intracellular localization. [11] Figure 7 pinpoints the location of the 25 potential phosphorylation sites. Additionally, two N-myrisolation sites were predicted at amino acid position 40-45 and 210-215 indicating a possible N-terminus and C-terminus membrane anchor region. [12] There are also 28 potential missense mutations in the human C3orf70. [13]
PSORT II indicates the subcellular localization of C3orf70 is in the nucleus. [14] In addition to this, following SDSC's Biology Workbench's SAPS kNN-Prediction, the C3orf70 protein for humans has a 60.9% likelihood to end up in the nuclear region of a cell, as determined by the amino acid make-up of C3orf70. [8] Homologs including chimp, mouse, alligator, and zebrafish conclude the same nuclear region with a >60% likelihood. [14] A nuclear localization site has not been identified in the C3orf70 sequence.
From Unigene's EST cDNA tissue abundance display (see Figure 8), C3orf70 is non-ubiquitously expressed and has relatively low expression levels with slightly higher expression levels seen in the brain. [15] Also, microarray data profile GDS426 (see Figure 9) showing the expression of C3orf70 across normal tissues displays a notably high presence in the brain, spinal cord, and prostate tissue. [16]
The function of C3orf70 is unknown. It is suggested to be a nuclear protein that plays a role in neurological development. Additional avenues of research pertaining to the C3orf70 gene include:
There is a patent that identified genes associated with midbrain dopamine neurons for engraftment by looking at the differentiation of hESC and/or hiPSC in floor plate midbrain progenitor cells. C3orf70 was found to have a fold-change of 2.45, which was not determined significant in experimentation [17]
A publication was discovered through multiple sources that linked the C3orf70 gene to a “Genome-wide association study of major depressive disorder”. [18]
A microdeletion has been identified from 3q26.33-3q27.2. [19] Mandrille et al. associates this discovered microdeletion with a possible clinical syndrome characterized by clinical features related to brain development.
C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.
Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.
Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.
Coiled-coil domain containing 94 (CCDC94), is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38
CXorf66 also known as Chromosome X Open Reading Frame 66, is a 361aa protein in humans that is encoded by the CXorf66 gene. The protein encoded is predicted to be a type 1 transmembrane protein; however, its exact function is currently unknown. CXorf66 has one alias: RP11-35F15.2.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.
C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.
Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.