C2orf80

Last updated
C2orf80
Identifiers
Aliases C2orf80 , GONDA1, chromosome 2 open reading frame 80
External IDs OMIM: 615536 MGI: 2138198 HomoloGene: 129674 GeneCards: C2orf80
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001099334

NM_001285881
NM_001285882
NM_175293

RefSeq (protein)

NP_001092804

NP_001272810
NP_001272811
NP_780502

Location (UCSC) Chr 2: 208.17 – 208.19 Mb Chr 1: 65.11 – 65.12 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

C2orf80 (chromosome 2 open reading frame 80) is a protein that, in humans, is encoded by the c2orf80 gene. The gene c2orf80 also goes by the alias GONDA1 (gonad development associated 1). In humans, c2orf80 is exclusively expressed in the brain. While relatively little is known about the function of c2orf80, medical studies have shown a strong association between variations in c2orf80 and IDH-mutant gliomas, 46,XY gonadal dysgenesis, and a possible association with blood pressure.

Contents

Gene

The c2orf80 gene is located on the negative sense strand of chromosome 2 at locus 2q33.3, and has nine exons. [5] It spans from 208165347-2081902581. [5] There are six isoforms of c2orf80. [5]

Location of c2orf80 on chromosome 2 at locus 2q33.3 Location of c2orf80 on chromosome 2.png
Location of c2orf80 on chromosome 2 at locus 2q33.3
Chromosome 2 Ideogram human chromosome 2.svg
Chromosome 2
Isoforms of c2orf80 in Humans [5]
IsoformAccession NumberBase PairsAmino Acids
X1NM_001099334.31195193
X2XM_017004075.11098200
X3XM_017004076.11183193
X4XM_017004077.11043183
X5XM_024452880.11118176
X6XM_017004078.1578131

Neighbors

The loci near c2orf80 are protein encoding, and include genes IDH1 (histone deacetylase inhibitor 1), and CRYGB (crystallin gamma B). [5]

The IDH1 protein is the NADP(+)-dependent isocitrate dehydrogenase found in the cytoplasm and peroxisomes. It is involved in catalyzing the oxidative decarboxylation of isocitrate to 2-oxoglutarate. [6] Mutations in IDH have been associated strongly with gliomas, and it is a strong candidate for therapeutic targeting. [7]

The gene CRYGB, or crystallin gamma B, encodes a protein found in the lens and cornea of the eye, and have been involved in cataract formation. [6]

Gene expression

Within the cell, it is possible that c2orf80 is expressed in the nucleus, or the mitochondria. [8] An analysis via PSORT II concluded that there is a 47.8% probability that c2orf80 is located in the nucleus, and a 39.1% probability it is located in the mitochondria. [8] The c2orf80 protein has two nuclear localization signals that may play a role in directing the protein to the nucleus of the cell. [8]

C2orf80 is tissue specific, and expressed only in the brain. [5] A tumor specific RNA assessment via The Cancer Genome Atlas showed significant expression of c2orf80 in gliomas. [9] This reveals that c2orf80 is likely not functioning in neurons because gliomas are not present in neurons. Gene Paint explored expression of c2orf80 in infant mice, revealing concentrated expression in the frontal lobe. [10]

Regulation of expression

There are many transcription factors involved in the regulation of expression of c2orf80. Due to the tissue specific expression, only transcription factors with brain tissue specificity increase c2orf80 transcription. The promoter of c2orf80 X1 is GXP_9792289. [11]

Select c2orf80 Transcription Factors with Brain Tissue Expression
Transcription Factor [11] Function [11]
NKX Homeodomain transcription factorPlays a role in organ development
CLOX and CLOX homology (CDP) factorPlays a role in nervous system growth
Paralog hox genes 1-8 from the four hox clusters A, B, C, DPlay a role in central nervous system development
SOX/SRY-sex/testis determining and related HMG box factorPlays a role in sex determination

Protein

The protein product of c2orf80 is 193 amino acids in length with a predicted molecular weight of 22kDa and an isoelectric point of 9.5. [12] [13]

Post-translational modifications

Several different post-translation factors regulate the expression of the c2orf80 gene. There is a large disordered region spanning from amino acid 155-193 that is a possible site for phosphorylation from many kinases. [12] The phosphorylations with the highest confidence scores are all modified by the same kinases, AGC, PKC, PKCh, and PRKCE which are involved in cell growth and death, and protein function modification. [14] Other post-transcriptional modifications posited by EML are shown below. [15]

Affinity chromatography verified interaction between Protein Kinase D2 (PRKD2) and c2orf80. [16] The protein PRKD2 can bind to diacylglycerol (DAG) in the trans-Golgi network (TGN), and can be activated by phorbol esters as well as by gastrin via the cholecystokinin B receptor (CCKBR) in gastric cancer cells. [17]

The protein ZNF804A (zinc finger protein 804A) has been shown by the first genome-wide significant association with the broad phenotype of psychosis to be associated with schizophrenia and bipolar disorder. When ZNF804A was knocked out, c2orf80 expression was downregulated, indicating a regulatory relationship between the two. [18]

Select Post-Translational Modification Motifs of c2orf80
Motif [15] Function [15]
MAPK MAPK cascades are central signaling pathways that regulate a wide variety of stimulated cellular processes. Deregulation can lead to cancer.
SUMO-1Causes dramatic rearrangements of subcellular location. Three possible functions of sumoylation: (i) protein targeting, (ii) enhanced protein stability, (iii) transcriptional control
PP2BA Ca2+- and calmodulin-regulated serine/threonine protein that regulates Ca2+-dependent migration of neutrophils, synaptic plasticity, and apoptosis
Canonical Arg-Containing Phosphorylation14-3-3 proteins are involved in cellular processes such as signal transduction, cell-cycle control, apoptosis, stress response and malignant transformation. 14-3-3 proteins possess no catalytic activity so they perform functional tasks only binding and modulating the activity of their partner proteins

Structure

The secondary structure is predicted to consist mostly of alpha helices. [16] The predicted 3D structure via Alphafold is shown. [19] Various alpha helices and a disordered region can be seen.

3D structure of c2orf80 Alphafold 3D c2orf80 protein.png
3D structure of c2orf80

Function

The function of the c2orf80 protein is unknown. There are indications that it could be involved in sex determination, or cell division regulation. [20] [21]

Homology

C2orf80 is found in all vertebrates and was first seen 465 million years ago in sharks. [22] There are no observed paralogs for c2orf80 within the human genome. [23]

Orthologs

Corrected Amino Acid Changes per 100 Residues vs Million Years Since Divergence for Fibrinogen Alpha, c2orf80, and Cytochrome C Mutation rate of c2orf80.png
Corrected Amino Acid Changes per 100 Residues vs Million Years Since Divergence for Fibrinogen Alpha, c2orf80, and Cytochrome C

The similarity to the human sequence generally decreases as the divergence date from humans increases, but there are some fluctuations in this trend possibly due to a functionally unimportant, poorly conserved region. [22]

Orthologs of c2orf80
Genus, SpeciesCommon NameAccession Number [23] Sequence Length (amino acids) [23] Sequence Identity to Human c2orf80 (%) [22] Median Divergence Date (Million Years Ago) [24]
Homo sapiensHumanNP_001092804.21931000
Ictidomys tridecemlineatusThirteen lined ground squirrelXP_040150742.12107189
Mus musculusMouseNP_001272811.119662.989
Orycteropus afer aferAardvarkXP_007934115.120775.4102
Anolis carolinesisAnole lizardXP_003220055.117853318
Chelonia mydasGreen se turtleXP_037769043.121852.7318
Egretta garzettalittle egretXP_009635806.119851.2318
Alligator sinensisalligatorXP_006029205.123349.8318
Nothoprocta perdicariaChilean tinamouXP_025904208.121547.9318
Aptenodytes forsteriEmperor penguinXP_009287939.114240.8318
Bufo bufoCommon toadXP_040297412.118845.5351.7
Xenopus tropicalisWestern clawed frogXP_017953126.221043.5351.7
Rhinatrema bivittatumTwo-lined caecilianXP_029462232.123543.6351.7
Carcharodon carchariasGreat white sharkXP_041056463.120442.7465
Oreochromis aureusBlue tilapiaXP_039455604.121038.3433
Rhincodon typusWhale sharkXP_020371397.121337.1465
Astyanax mexicanusMexican tetraXP_022536111.120136.4433
Unrooted phylogenetic tree showing the evolution of c2orf80. Constricted via Phylogeny.fr. Mammals: Hsu (human), Mmu (mouse), Lca (River Otter), Oaf (Aardvark), Itr (Thirteen lined ground squirrel), Bta (cow) Birds: Ega (little egret), Afo (emperor penguin), Npe (chilean tinamou), Cla (lance tailed manakin), Fch (saker falcon) Reptiles: Aca (anole lizard), Cmy (green sea turtle), Asi (alligator) Amphibians: Rbi (two-lined caecilian), Bbu (common toad), Xtr (western clawed frog) Fish: Oau (blue tilapia), Ame (mexican tetra), Rty (whale shark), Cca (great white shark) Unrooted phylogenetic treen c2orf80.png
Unrooted phylogenetic tree showing the evolution of c2orf80. Constricted via Phylogeny.fr. Mammals: Hsu (human), Mmu (mouse), Lca (River Otter), Oaf (Aardvark), Itr (Thirteen lined ground squirrel), Bta (cow) Birds: Ega (little egret), Afo (emperor penguin), Npe (chilean tinamou), Cla (lance tailed manakin), Fch (saker falcon) Reptiles: Aca (anole lizard), Cmy (green sea turtle), Asi (alligator) Amphibians: Rbi (two-lined caecilian), Bbu (common toad), Xtr (western clawed frog) Fish: Oau (blue tilapia), Ame (mexican tetra), Rty (whale shark), Cca (great white shark)

Clinical significance

Published research studies that mention c2orf80 indicate it could be involved in a number of maladies. Variants in c2orf80 are associated with IDH-mutant gliomas, possibly due to the proximity of c2orf80 to IDH1 on chromosome 2. [20]

Siblings with 46X,Y gonadal dysgenesis, a disorder of sex development, had a deletion of 8 of the 9 c2orf80 exons, and an interstitial duplication of the SUPT3H gene. [21] Both mutations were inherited from the healthy mother. [21] There is no known interaction between these two genes as of now. [21]

Related Research Articles

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C17orf98</span> Protein-coding gene in the species Homo sapiens

C17orf98 is a protein which in humans is coded by the gene c17orf98. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in humans

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

RING Finger Protein 227, also known as RNF227 and LINC02581, is a protein which in humans is encoded by the RNF227 gene. According to DNA microarray data, it is found in at least 15 tissues.

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C12orf29</span> Protein-coding gene in humans

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

<span class="mw-page-title-main">C3orf38</span> Uncharacterized gene

Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.

<span class="mw-page-title-main">TEDDM1</span> Protein-coding gene in the species Homo sapiens

Transmembrane epididymal protein 1 is a transmembrane protein encoded by the TEDDM1 gene. TEDDM1 is also commonly known as TMEM45C and encodes 273 amino acids that contains six alpha-helix transmembrane regions. The protein contains a 118 amino acid length family of unknown function. While the exact function of TEDDM1 is not understood, it is predicted to be an integral component of the plasma membrane.


<span class="mw-page-title-main">TMEM248</span> Transmembrane protein 248/TMEM248 gene

Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.

<span class="mw-page-title-main">FAM131A</span> Information on the FAM131A gene and the protein it encodes

FAM131A is a protein that is encoded by the FAM131A gene in humans. Aliases for FAM131A include C3orf40, FLAT715, and PRO1378.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

<span class="mw-page-title-main">ZFP62</span> Gene in Humans

Zinc Finger Protein 62, also known as "ZNF62," "ZNF755," or "ZET," is a protein that in humans is encoded by the ZFP62 gene. ZFP62 is part of the C2H2 Zinc Finger family of genes.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000188674 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000044816 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 6 7 "C2orf80 chromosome 2 open reading frame 80 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  6. 1 2 "CRYGB crystallin gamma B [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  7. Fang ZH, Pausch H (September 2019). "Multi-trait meta-analyses reveal 25 quantitative trait loci for economically important traits in Brown Swiss cattle". BMC Genomics. 20 (1): 695. doi: 10.1186/s12864-019-6066-6 . PMC   6724290 . PMID   31481029.
  8. 1 2 3 "PSORT II Prediction". psort.hgc.jp. Retrieved 2021-12-18.
  9. "The Cancer Genome Atlas Program - National Cancer Institute". www.cancer.gov. 2018-06-13. Retrieved 2021-12-18.
  10. "Genepaint - Home of High Resolution Gene Expression Data". gp3.mpg.de. Retrieved 2021-12-18.
  11. 1 2 3 "Genomatix". Archived from the original on 2001-02-24.
  12. 1 2 "uncharacterized protein C2orf80 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  13. "SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2021-12-18.
  14. "GPS 5.0 - Kinase-specific Phosphorylation Site Prediction". gps.biocuckoo.cn. Retrieved 2021-12-18.
  15. 1 2 3 "ELM - Search the ELM resource". elm.eu.org. Retrieved 2021-12-18.
  16. 1 2 "Bioinformatics Toolkit". toolkit.tuebingen.mpg.de. Retrieved 2021-12-18.
  17. "PRKD2 protein kinase D2 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  18. Hill MJ, Jeffries AR, Dobson RJ, Price J, Bray NJ (March 2012). "Knockdown of the psychosis susceptibility gene ZNF804A alters expression of genes involved in cell adhesion". Human Molecular Genetics. 21 (5): 1018–1024. doi: 10.1093/hmg/ddr532 . PMID   22080834.
  19. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2021-12-18.
  20. 1 2 Wu WY, Johansson G, Wibom C, Brännström T, Malmström A, Henriksson R, et al. (December 2019). "The Genetic Architecture of Gliomagenesis-Genetic Risk Variants Linked to Specific Molecular Subtypes". Cancers. 11 (12): 2001. doi: 10.3390/cancers11122001 . PMC   6966482 . PMID   31842352.
  21. 1 2 3 4 Norling A, Lindén Hirschberg A, Iwarsson E, Persson B, Wedell A, Barbaro M (December 2013). "Novel candidate genes for 46,XY gonadal dysgenesis identified by a customized 1 M array-CGH platform". European Journal of Medical Genetics. 56 (12): 661–668. doi: 10.1016/j.ejmg.2013.09.003 . PMID   24055526.
  22. 1 2 3 "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-18.
  23. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  24. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2021-12-18.