C6orf222

Last updated October 31, 2025

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31).^[1]^[2] C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.^[3]^[4]

Gene

C6orf222 is located on the negative DNA strand of the short arm of chromosome 6 at locus 21.31. The gene is 21,129 base pairs long and extends from base pair 36315757 to 36336977. The gene produces a single transcript that is 3,750 base pairs long.^[5] There is a unique upstream, in-frame stop codon in the 5'UTR region from base pairs 86-88 of the mRNA.^[6]

Gene neighborhood

The gene PNPLA1 is found on the positive strand just upstream of C6orf222 and belongs to the patatin-like phospholipase family. PNPLA1 extends from base pairs 36239766 to 36313955.^[7]ETV7 is found downstream on the negative strand and extends from base pairs 36354221 to 36387800.^[8] There are two genes located upstream (LOC105375036) and downstream (LOC105375037) of C6orf222 on the negative and positive strand, respectively. These genes code for Non-coding RNA.^[9]^[10]

Gene expression

There are several lines of evidence that suggest the C6orf222 protein is expressed non-ubiquitously in select tissue locations at low levels, including the ascites, bladder, intestine, testes, vagina, lymph nodes and skin. The ascites and bladder displayed the strongest expression with 25 and 66 transcripts per million, respectively.^[11]^[12]

Promoter

The promoter region for C6orf222 is predicted to exist between 36304561 and 36305162 and has a length of 602 base pairs.^[13]

Protein

C6orf222 consists of 652 amino acids, has a weight of 71.9 kDa and an isoelectric point of 8.70 in humans.^[1]^[14]^[15]

Function

While the exact function of C6orf222 remains unknown, there has been evidence for it being involved in apoptosis as a pro-apoptotic protein due to the conserved BH3 interacting domain death agonist located at the C-terminus of the protein between amino acid residues 535 to 611 in humans.^[3]^[4]

Structure

C6orf222 contains a single domain of unknown function, DUF3292, found between amino acid residues 38-110. The protein is proline rich but asparagine and tyrosine poor. The charge distribution of the protein is fairly equal in that there are no positive or negative charge clusters sequestered within the protein.^[14]

A schematic representation of the conserved domains, nuclear localization signal, and phosphorylated amino acid residues in human C6orf222. The red diamond markers indicate conserved phosphoserine residues and they grey diamond markers indicate conserved phosphothreonine residues.

The secondary structures of the protein were predicted via multiple bioinformatics servers. All four programs predicted extensive disorder in the protein at the N-terminus, ranging from 79% to 88% of the protein. This extent of disorder is consistent with the amino acid composition of the protein and being proline rich. There were several α-helix structures predicted at the C-terminus of the protein and conserved in orthologs of the human protein.^[14]^[17]^[18]^[19] The protein was predominantly soluble, with a majority of the amino acid residues being exposed.15 The average of hydrophobicity for c6orf222 was -0.968, indicating the soluble nature of the protein.^[20]^[21]

Post-translational modifications

There is extensive, predicted phosphorylation of C6orf222, with 42 phosphoserines and 7 phosphothreonines being conserved in orthologs of the human C6orf222 protein. These results implicate C6orf222 as being a phosphoprotein. The protein contains only one nuclear export signal residue, found at 275-L; however, the NES score was rather low at 0.672. Structural analysis of the protein to be sequestered in the nucleus with a 95% probability.^[22] This prediction is supported by the presence of a nuclear localization sequence (NLS) found between residues 142–151.^[21]^[23]

Protein interactions

Although the bioinformatics programs MINT, STRING and Gene Cards did not reveal any protein interactions with C6orf222.^[24]^[25]^[26] A predicted BH3 domain in the C6orf222 protein was found to interact with both Bcl-2 and Bcl-xL, implicating C6orf222 as being involved in apoptosis.^[4]

Table of Potential Transcription Factor Binding Sites in the Predicted C6orf222 Promoter:^[13]

Transcription factor	Start	End	Strand	Matrix Score	Sequence
Myeloid zinc finger protein MZF1	115	125	-	1.000	gtGGGGaggga
Heat shock factor 2	91	115	-	0.953	aggtacctagaaAGAAaggtcagcg
NF-κβ	106	120	-	0.894	gaGGGAggtacctag
GATA-binding factor 2	133	145	+	0.958	caaaGATAactga
Pleomorphic adenoma gene 1	168	190	-	1.000	taGGGGgagaaagtcgaggtggc
TF-yin yang 2	196	218	+	0.839	catttcCCATtaagtcttgtttt
RBPJ-κ	196	208	-	0.951	ttaaTGGGaaatg
Human acute myelogenous leukemia factors	281	295	-	0.834	actGGGGtttgggt
Smad3 TF involved in TGF-β signaling	244	254	-	0.995	aagGTCTggct

Homology and evolution

Orthologous space

82 organisms have predicted orthologs with C6orf222.^[2] The most distant ortholog is the green sea turtle, which diverged from humans 296 million years ago, indicating C6orf222 developed in reptiles and birds.^[3]^[27]

Table of C6orf222 Orthologs:^[3]

Scientific name	Common name	Divergence Date from Humans (MYA) ^[28]	NCBI Protein Accession	Protein Length (amino acids)	Sequence Similarity (%)
Homo sapiens	Human	0	NP_001010903.3	652	100
Gorilla gorilla	Gorilla	8.8	XP_004043942.1	652	97
Callithrix jacchus	Common marmoset	42.6	XP_002746521.2	678	81
Camelus ferus	Bactrian camel	94.2	XP_006195372	658	62
Physeter catodon	Sperm whale	94.2	XP_007115001.1	638	60
Orcinus orca	Orca	94.2	XP_004267852.1	639	60
Canis lupus familiaris	Dog	94.2	XP_850456	662	58
Mustela putorius furo	Ferret	94.2	XP_004770808	664	57
Oryctolagus cuniculus	Rabbit	92.3	XP_008261475	716	49
Mus musculus	Mouse	92.3	NP_766038.1	669	47
Haliaeetus leucocephalus	Bald eagle	296	XP_010583084.0	407	28
Fulmarus glacialis	Northern fulmar	296	XP_009570924.1	435	28
Chelonia mydas	Green sea turtle	296	XP_007063595.1	387	33

Paralogous space

There are no predicted paralogs for C6orf222 in both humans and mice.^[3]

Conserved regions

Multiple sequence alignments demonstrated amino acid reside conservation throughout the C6orf222 protein in a variety of orthologs, with the most extensive conservation being found at the C-terminus of the protein. The BH3 interacting-domain death agonist (BID) was found to be conserved at the C-terminus in a multiple sequence alignment in both strict and distant orthologs.^[27] The equal distribution of positively and negatively charged amino acid residues is found to be conserved throughout orthologs of the human C6orf222 protein.^[22]

Evolution

C6orf222 is a fast evolving gene, similar in nature to Fibrinogen alpha chain FGA. This evolutionary trend is in contrast to Cytochrome c, which has been shown to be a slow evolving gene.^[28]

References

1 2 NCBI Protein. "uncharacterized protein c6orf222". NCBI. Retrieved 23 January 2015.
1 2 NCBI Gene. "C6orf222 chromosome 6 open reading frame 222 [ Homo sapiens (human) ]". NCBI. Retrieved 23 January 2015.
1 2 3 4 5 NCBI BLAST. "NP_001010903:uncharacterized protein C6orf222". NCBI. Retrieved 30 January 2015.
1 2 3 DeBartolo, Joe; Taipale, Mikko; Keating, Amy (June 26, 2014). "Genome-Wide Prediction and Validation of Peptides That Bind Human Prosurvival Bcl-2 Proteins". PLOS Computational Biology. 10 (6) e1003693. Bibcode:2014PLSCB..10E3693D. doi: 10.1371/journal.pcbi.1003693 . PMC 4072508 . PMID 24967846.
↑ Ensembl. "Gene: C6orf222". Ensembl. Retrieved 10 February 2015.
↑ NCBI Nucleotide. "Homo sapiens chromosome 6 open reading frame 222 (C6orf222), mRNA". NCBI. Retrieved 23 January 2015.
↑ NCBI Gene. "PNPLA1 patatin-like phospholipase domain containing 1 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.
↑ NCBI Gene. "ETV7 ets variant 7 [ Homo sapiens (human)]". NCBI. Retrieved 30 April 2015.
↑ NCBI Gene. "LOC105375036 uncharacterized LOC105375036 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.
↑ NCBI Gene. "LOC105375037 uncharacterized LOC105375037 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.
↑ NCBI UniGene. "EST Profile: C6orf222". NCBI. Retrieved 25 February 2015.^{[ dead link ]}
↑ The Human Protein Atlas. "C6orf222" . Retrieved 28 April 2015.
1 2 El Dorado. "Annotation and Analysis: C6orf222". Genomatix. Retrieved 15 March 2015.
1 2 3 Subramaniam, Shankar. "SAPS". SDSC. Retrieved 11 February 2015.^{[ permanent dead link ]}
↑ ExPASy. "Compute pI/MW". ExPASy. Retrieved 12 March 2015.^{[ permanent dead link ]}
↑ "My Domains - Image Creator". My Domains. Prosite. 9 May 2015. Retrieved 9 May 2015.
↑ Kelley, Lawrence; Sternberg, Michael. "Phyre2". Structural Bioinformatics Group. Retrieved 12 April 2015.
↑ Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo (2012). "Template-based protein structure modeling using the RaptorX web server". Nature Protocols. 7 (8): 1511–1522. doi:10.1038/nprot.2012.085. PMC 4730388 . PMID 22814390 . Retrieved 13 April 2015.
↑ Rost, B; Yachdav, G; Liu, J (2004). "The PredictProtein Server". Nucleic Acids Research. 32 (Web Server issue): 321–326. doi:10.1093/nar/gkh377. PMC 441515 . PMID 15215403 . Retrieved 12 April 2015.
↑ Gomi, M; Sonoyama, M; Mitaku, S (2004). "High performance system for signal peptide prediction: SOSUIsignal". Chem-Bio Informatics Journal. 4 (4): 142–147. doi: 10.1273/cbij.4.142 . Retrieved 12 April 2015.
1 2 Kosugi, S; Hasebe, M; Matsumura, N; Takashima, H; Miyamoto-Simo, E; Tomita, M; Yanagawa, H (2009). "Six classes of nuclear localization signals specific to different binding grooves of importin α." Journal of Biological Chemistry. 284 (1): 478–485. doi: 10.1074/jbc.M807017200 . PMID 19001369.
1 2 Nakai, Kenta; Horton, Paul. "PSORTII" . Retrieved 2 April 2015.^{[ permanent dead link ]}
↑ cNLS Mapper. "cNLS Mapper Result". Archived from the original on 10 August 2021. Retrieved 29 April 2015.
↑ Gene Cards. "Chromosome 6 Open Reading Frame 222". Gene Cards. Retrieved 10 February 2015.
↑ Licata, L.; Briganti, L.; Peluso, D.; Perfetto, L.; Iannuccelli, M.; Galeota, E.; Sacco, F.; Palma, A.; Nardozza, A. P.; Santonico, E.; Castagnoli, L.; Cesareni, G. (16 November 2011). "MINT, the molecular interaction database: 2012 update". Nucleic Acids Research. 40 (D1): D857 –D861. doi:10.1093/nar/gkr930. PMC 3244991 . PMID 22096227.
↑ Szklarczyk, D.; Franceschini, A.; Wyder, S.; Forslund, K.; Heller, D.; Huerta-Cepas, J.; Simonovic, M.; Roth, A.; Santos, A.; Tsafou, K. P.; Kuhn, M.; Bork, P.; Jensen, L. J.; von Mering, C. (28 October 2014). "STRING v10: protein-protein interaction networks, integrated over the tree of life". Nucleic Acids Research. 43 (D1): D447 –D452. doi:10.1093/nar/gku1003. PMC 4383874 . PMID 25352553.
1 2 Subramaniam, Shankar. "CLUSTALW". SDSC. Retrieved 1 May 2015.^{[ permanent dead link ]}
1 2 Kumar, S; Hedges, S; Dudley, J (2006). "TimeTree: a public knowledge-base of divergence times among organisms". Bioinformatics. 22 (23): 2971–2972. doi: 10.1093/bioinformatics/btl505 . PMID 17021158 . Retrieved 1 May 2015.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:1-1] 1 2 NCBI Protein. "uncharacterized protein c6orf222". NCBI. Retrieved 23 January 2015.

[:2-2] 1 2 NCBI Gene. "C6orf222 chromosome 6 open reading frame 222 [ Homo sapiens (human) ]". NCBI. Retrieved 23 January 2015.

[:3-3] 1 2 3 4 5 NCBI BLAST. "NP_001010903:uncharacterized protein C6orf222". NCBI. Retrieved 30 January 2015.

[:4-4] 1 2 3 DeBartolo, Joe; Taipale, Mikko; Keating, Amy (June 26, 2014). "Genome-Wide Prediction and Validation of Peptides That Bind Human Prosurvival Bcl-2 Proteins". PLOS Computational Biology. 10 (6) e1003693. Bibcode:2014PLSCB..10E3693D. doi: 10.1371/journal.pcbi.1003693 . PMC 4072508 . PMID 24967846.

[5] Ensembl. "Gene: C6orf222". Ensembl. Retrieved 10 February 2015.

[6] NCBI Nucleotide. "Homo sapiens chromosome 6 open reading frame 222 (C6orf222), mRNA". NCBI. Retrieved 23 January 2015.

[7] NCBI Gene. "PNPLA1 patatin-like phospholipase domain containing 1 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.

[8] NCBI Gene. "ETV7 ets variant 7 [ Homo sapiens (human)]". NCBI. Retrieved 30 April 2015.

[9] NCBI Gene. "LOC105375036 uncharacterized LOC105375036 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.

[10] NCBI Gene. "LOC105375037 uncharacterized LOC105375037 [ Homo sapiens (human) ]". NCBI. Retrieved 30 April 2015.

[11] NCBI UniGene. "EST Profile: C6orf222". NCBI. Retrieved 25 February 2015.^{[ dead link ]}

[12] The Human Protein Atlas. "C6orf222" . Retrieved 28 April 2015.

[:0-13] 1 2 El Dorado. "Annotation and Analysis: C6orf222". Genomatix. Retrieved 15 March 2015.

[:14-14] 1 2 3 Subramaniam, Shankar. "SAPS". SDSC. Retrieved 11 February 2015.^{[ permanent dead link ]}

[15] ExPASy. "Compute pI/MW". ExPASy. Retrieved 12 March 2015.^{[ permanent dead link ]}

[16] "My Domains - Image Creator". My Domains. Prosite. 9 May 2015. Retrieved 9 May 2015.

[17] Kelley, Lawrence; Sternberg, Michael. "Phyre2". Structural Bioinformatics Group. Retrieved 12 April 2015.

[18] Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo (2012). "Template-based protein structure modeling using the RaptorX web server". Nature Protocols. 7 (8): 1511–1522. doi:10.1038/nprot.2012.085. PMC 4730388 . PMID 22814390 . Retrieved 13 April 2015.

[19] Rost, B; Yachdav, G; Liu, J (2004). "The PredictProtein Server". Nucleic Acids Research. 32 (Web Server issue): 321–326. doi:10.1093/nar/gkh377. PMC 441515 . PMID 15215403 . Retrieved 12 April 2015.

[20] Gomi, M; Sonoyama, M; Mitaku, S (2004). "High performance system for signal peptide prediction: SOSUIsignal". Chem-Bio Informatics Journal. 4 (4): 142–147. doi: 10.1273/cbij.4.142 . Retrieved 12 April 2015.

[:5-21] 1 2 Kosugi, S; Hasebe, M; Matsumura, N; Takashima, H; Miyamoto-Simo, E; Tomita, M; Yanagawa, H (2009). "Six classes of nuclear localization signals specific to different binding grooves of importin α." Journal of Biological Chemistry. 284 (1): 478–485. doi: 10.1074/jbc.M807017200 . PMID 19001369.

[:20-22] 1 2 Nakai, Kenta; Horton, Paul. "PSORTII" . Retrieved 2 April 2015.^{[ permanent dead link ]}

[23] NLS Mapper. "cNLS Mapper Result". Archived from the original on 10 August 2021. Retrieved 29 April 2015.

[24] Gene Cards. "Chromosome 6 Open Reading Frame 222". Gene Cards. Retrieved 10 February 2015.

[25] Licata, L.; Briganti, L.; Peluso, D.; Perfetto, L.; Iannuccelli, M.; Galeota, E.; Sacco, F.; Palma, A.; Nardozza, A. P.; Santonico, E.; Castagnoli, L.; Cesareni, G. (16 November 2011). "MINT, the molecular interaction database: 2012 update". Nucleic Acids Research. 40 (D1): D857 –D861. doi:10.1093/nar/gkr930. PMC 3244991 . PMID 22096227.

[26] Szklarczyk, D.; Franceschini, A.; Wyder, S.; Forslund, K.; Heller, D.; Huerta-Cepas, J.; Simonovic, M.; Roth, A.; Santos, A.; Tsafou, K. P.; Kuhn, M.; Bork, P.; Jensen, L. J.; von Mering, C. (28 October 2014). "STRING v10: protein-protein interaction networks, integrated over the tree of life". Nucleic Acids Research. 43 (D1): D447 –D452. doi:10.1093/nar/gku1003. PMC 4383874 . PMID 25352553.

[:27-27] 1 2 Subramaniam, Shankar. "CLUSTALW". SDSC. Retrieved 1 May 2015.^{[ permanent dead link ]}

[:28-28] 1 2 Kumar, S; Hedges, S; Dudley, J (2006). "TimeTree: a public knowledge-base of divergence times among organisms". Bioinformatics. 22 (23): 2971–2972. doi: 10.1093/bioinformatics/btl505 . PMID 17021158 . Retrieved 1 May 2015.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]