C22orf15

Last updated
C22orf15
Identifiers
Aliases C22orf15 , N27C7-3, chromosome 22 open reading frame 15
External IDs MGI: 2685713 HomoloGene: 122921 GeneCards: C22orf15
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_182520
NM_001331041
NM_001376903
NM_001376904
NM_001376905

Contents

NM_001037714
NM_001277132

RefSeq (protein)

NP_001317970
NP_872326
NP_001363832
NP_001363833
NP_001363834

NP_001264061

Location (UCSC) Chr 22: 23.76 – 23.77 Mb Chr 10: 75.77 – 75.78 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

C22orf15 (Chromosome 22 Open Reading Frame 15) is a protein which, in humans, is encoded by the C22orf15 gene.

Gene

The locus of C22orf15 in humans is on the long arm (q) of chromosome 22 in region 11, band 23 (22q11.23); it spans 3,340 base pairs on the plus strand from base pair 23,762,523 to 23,765,863 and contains ten introns and six exons. [5] [6]

Overview of C22orf15 gene transcript Overview of C22orf15 gene transcript.png
Overview of C22orf15 gene transcript
C22orf15 Gene Neighborhood C22orf15 Neighboring Genes.gif
C22orf15 Gene Neighborhood

Transcripts

C22orf15 is transcribed into 23 transcripts in Homo sapiens. [7]

C22orf15 Transcripts
TranscriptLength (base pairs)ProteinLength (amino acid)Isoform
XM_024452160.11818XP_024307928.1243X9
XM_011529908.21718XP_011528210.2210X13
XM_024452161.11796XP_024307929.1236X10
XM_017028612.11702XP_016884101.1205X14
NM_001376903.1881NP_001363832.11423
NM_001376904.1800NP_001363833.11154
NM_001331041.2900NP_001317970.11572
NM_001376905.1790NP_001363834.11135
NM_182520.3895NP_872326.21481
XM_017028605.12012XP_016884094.1280X5
XM_017028613.11558XP_016884102.1185X15
XM_017028602.12081XP_016884091.1303X2
XM_017028606.11818XP_016884095.1273X7
XM_017028608.11737XP_016884097.1246X8
XM_017028604.12021XP_016884093.1283X4
XM_024452158.12090XP_024307926.1306X1
XM_017028611.11658XP_016884100.1227X11
XM_011529907.21997XP_011528209.1275X6
XM_017028614.11543XP_016884103.1180X16
EXM_024452162.11537XP_024307930.1178X17
XM_024452159.12069XP_024307927.1299X3
XM_011529912.21838XP_011528214.1222X12
XM_017028618.1684XP_016884107.1105X20

Promoter

C22orf15 has two upstream promoter regions. GXP_9794292 is 1040 base pairs in length spanning from base pair 23,758,601 to 23,759,640 on chromosome 22, while GXP_6747563 is 1740 base pairs spanning from base pair 23,761,523 to 23,763,262. The transcription start region is located at base pair 23,762,523. [8]

Post-translational modifications

The post-translational modifications of C22orf15, which include phosphorylation, O-GlcNAcylation, and SUMOylation, are annotated on the conceptual translation.

Structure

C22orf15 in humans is 148 amino acids in length. The molecular weight of human C22orf15 is 17 kdal, and the isoelectric point is 9.0. C22orf15 does not have any unusually high or low amino acid compositions compared to other proteins in humans. [9]

C22orf15 Tertiary Structure C22orf15 Tertiary Structure.png
C22orf15 Tertiary Structure

The interleukin 2 receptor subunit gamma (II2rg) domain, which spans from amino acids 2-102 in C22orf15, facilitates the creation of the common gamma chain protein. This protein is a component of various receptors that aid in immune system function. These receptors direct and regulate the growth of lymphocytes, T cells, B cells, and natural killer cells, which promote immune response. [11] [12] [13]

Secondary

The protein structure prediction tool I-TASSER predicts C22orf15 to have both an alpha helical and beta strand structure; there are three disordered regions present in the protein. [14]

Tissue distribution

C22orf15 is ubiquitously expressed at a low level with some variability among tissues. C22orf15 is expressed highest in the adrenal glands, salivary glands, trachea, heart, spinal cord, and skeletal muscle tissues. [15] [16]

C22orf15 microarray-assessed tissue expression pattern C22orf15 microarray-assessed tissue expression pattern.png
C22orf15 microarray-assessed tissue expression pattern

In an anti-C22orf15 antibody produced in rabbits, immunochemical staining shows strong cytoplasmic positivity in the glandular cells of the human colon. [18]

Subcellular distribution

C22orf15 is localized in both the nucleus and cytoplasm. Antibody staining of glandular cells in the human colon shows strong cytoplasmic positivity. However, as seen on the conceptual translation, C22orf15 has conserved nuclear localization signals that would suggest nuclear localization as well. [19]

Homo Sapien C22orf15 Conceptual Translation Hsa c22orf15 Conceptual Translation PDF.pdf
Homo Sapien C22orf15 Conceptual Translation

Homology

Orthologous sequences to C22orf15 in humans are found in mammals, aves, reptiles, amphibians, and fishes as seen in the C22orf15 orthologs table. C22orf15 has no paralogs and is not present in invertebrates (Porifera, Platyhelminthes, Nematoda, Annelida, Echinodermata, Mollusca, Arthropoda), fungi, or bacteria; the gene appears to have originated in cartilaginous fishes approximately 473 million years ago. [20]

Table of C22orf15 Orthologs C22orf15 Orthologs Table.png
Table of C22orf15 Orthologs

Evolution

C22orf15 Evolutionary Rate C22orf15 Evolutionary History.png
C22orf15 Evolutionary Rate

The slope of the trendline for C22orf15 more closely resembles the slope of the trendline for fibrinogen alpha chain, which evolves rapidly, as opposed to the slope of the trendline for cytochrome c, which evolves slowly. This signifies that C22orf15 is evolving rapidly, but not as rapidly as fibrinogen alpha chain.

C22orf15 Phylogeny C22orf15 Phylogeny.pdf
C22orf15 Phylogeny
C22orf15 Phylogenetic Tree C22orf15 Phylogenetic Tree.pdf
C22orf15 Phylogenetic Tree

Clinical Significance

There appears to be a linkage between C22orf15 and both gastric and breast cancers. A copy number loss of DNA region 22q11.23, which includes C22orf15, has been linked to chromosomal instability (CIN)-positive aneuploid gastric cancer. [21] DNA region 22q11.23 is also significantly more prevalent in basal-like BRCA1-mutated tumors compared to nonmutated wild-type BRCA1 tumors. [22] Additionally, C22orf15 is highly differentially expressed in breast cancer cells compared to unaffected mammary epithelial cells. [23]

Tetralogy of Fallot, a birth defect that alters normal blood flow through the heart, is a disease associated with C22orf15. [24] [25] Survival rate of Nasopharyngeal carcinoma has been found to negatively correlate with the expression of six specific genes, including C22orf15. [26] C22orf15 has also been identified as one of the 40 novel protein biomarkers that are risk factors for incident atrial fibrillation. [27]

Genetics

Phenotypes of patients with C22orf15 copy-number variants include intellectual disability, micrognathia, ventricular septal defect, and Tetralogy of Fallot. [28]

Related Research Articles

<span class="mw-page-title-main">Chromosome 13</span> Human chromosome

Chromosome 13 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 13 spans about 113 million base pairs and represents between 3.5 and 4% of the total DNA in cells.

<span class="mw-page-title-main">Chromosome 17</span> Human chromosome

Chromosome 17 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 17 spans more than 84 million base pairs and represents between 2.5 and 3% of the total DNA in cells.

<span class="mw-page-title-main">METTL26</span> Protein-coding gene in the species Homo sapiens

METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.

<span class="mw-page-title-main">C14orf80</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C14orf80 is a protein which in humans is encoded by the chromosome 14 open reading frame 80, C14orf80, gene.

<span class="mw-page-title-main">C6orf201</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. In humans this gene encodes for a nuclear protein that is primarily expressed in the testis.

<span class="mw-page-title-main">C1orf74</span> Protein-coding gene in the species Homo sapiens

UPF0739 protein C1orf74 is a protein that in humans is encoded by the C1orf74 gene.

<span class="mw-page-title-main">C11orf52</span> Protein-coding gene in the species Homo sapiens

C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C17orf50</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C17orf50 is a protein which in humans is encoded by the C17orf50 gene.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C22orf23</span> Protein-coding gene in the species Homo sapiens

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">C11orf53</span> Protein-coding gene in the species Homo sapiens

Chromosome 11 open reading frame 53 is a protein that in humans is encoded by the C11orf53 gene. Reduction in C11orf53 gene expression is associated with increased odds of occurrence of colorectal cancer. Specifically sequence variation (rs3802842) close to the C11orf53 gene locus that lowers the expression of C11orf53 has been observed in the colonic mucosal cells immediately adjacent to colon cancer tumors. C11orf53 downregulation aids in cells' ability to survive in acidic conditions, which are typical of the tumor microenvironment. CRISPR-Cas9 inactivation of C11orf53 in an acute myeloid leukemia cell line made the cells resistant to the BCL2 inhibitor Venetoclax, further supporting a role in cancer predisposition.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">C4orf36</span> Draft for page on C4orf36 gene/protein

C4orf36 is a protein that in humans is encoded by the c4orf36 gene.

<span class="mw-page-title-main">C1orf159</span> Protein encoded on a gene

C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.

<span class="mw-page-title-main">C20orf144</span> Human protein-encoding gene

Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family.

References

  1. 1 2 3 ENSG00000169314 GRCh38: Ensembl release 89: ENSG00000273579, ENSG00000169314 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000050157 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "C22orf15 Gene - Chromosome 22 Open Reading Frame 15". GeneCards. Weizmann Institute of Science.
  6. "C22orf15 chromosome 22 open reading frame 15 [ Homo sapiens (human) ]". NCBI gene. U.S. National Library of Medicine.
  7. "C22orf15 Transcripts". NCBI. National Library of Medicine.
  8. "Genomatix". Precigen Bioinformatics.
  9. "of C22orf15 Protein Sequence (SAPS)Statistical Analysis". EMBL-EBI.
  10. "Uncharacterized protein C22orf15 AlphaFold structure prediction". AlphaFold Protein Structure Database. EMBL-EBI.
  11. "IL2RG interleukin 2 receptor subunit gamma [Homo sapiens (human)]". NCBI Gene. U.S. National Library of Medicine.
  12. "Uncharacterized protein C22orf15". PhosphoSitePlus. Cell Signaling Technology.
  13. "IL2RG gene". MedlinePlus. National Library of Medicine.
  14. "C22orf15". I-TASSER Protein Structure & Function Positions. University of Michigan.
  15. "C22orf15 chromosome 22 open reading frame 15 [ Homo sapiens (human) ]". NCBI gene. U.S. National Library of Medicine.
  16. "C22orf15 NCBI GEO Profile". NCBI Gene. U.S National Library of Medicine.
  17. "C22orf15 in Various Normal Tissues". NCBI Gene. U.S Library of Medicine.
  18. "Anti-C22orf15 antibody produced in rabbit". Sigma Aldrich Solutions. MilliporeSigma.
  19. "C22orf15 Subcellular Localization". PSORTII.
  20. "Protein BLAST". Basic Local Alignment Search Tool. U.S National Library of Medicine.
  21. Kawauchi S, Furuay T, Uchiyama T, Adachi A, Okada T, Nakao M, et al. (September 2010). "Genomic instability and DNA ploidy are linked to DNA copy number aberrations of 8p23 and 22q11.23 in gastric cancers". International Journal of Molecular Medicine. 26 (3): 333–339. PMID   20664948.
  22. Prat A, Cruz C, Hoadley KA, Díez O, Perou CM, Balmaña J (August 2014). "Molecular features of the basal-like breast cancer subtype based on BRCA1 mutation status". Breast Cancer Research and Treatment. 147 (1): 185–191. doi:10.1007/s10549-014-3056-x. PMC   4131128 . PMID   25048467.
  23. Ramos J, Felty Q, Roy D (2020). "Integrated Chip-Seq and RNA-Seq Data Analysis Coupled with Bioinformatics Approaches to Investigate Regulatory Landscape of Transcription Modulators in Breast Cancer Cells". Molecular Toxicology Protocols. Methods in Molecular Biology. Vol. 2102. pp. 35–59. doi:10.1007/978-1-0716-0223-2_3. ISBN   978-1-0716-0222-5. PMID   31989549. S2CID   210935020.
  24. "C22orf15 Gene". GeneCards. Weizmann Institute of Science.
  25. "C22orf15". Decipher Genomics. Wellcome Sanger Institute.
  26. Dai Y, Chen W, Wang X, Huang J, Yang M, Pan H, et al. (2020). "Screening Genes Crucial for Nasopharyngeal Carcinoma Using Weighted Gene Coexpression Network Analysis Combined with Methylation Data Analysis". SSRN   3566103.
  27. Norby FL. Atrial Fibrillation in Older Adults: Relation to Proteomics, Risk Prediction, and Urban/Rural Disparities in Treatment and Outcomes (Ph.D. thesis). University of Minnesota.
  28. "C22orf15". Decipher Genomics. Wellcome Sanger Institute.