C15orf32 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C15orf32 , chromosome 15 open reading frame 32, chromosome 15 putative open reading frame 32 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | HomoloGene: 89189 GeneCards: C15orf32 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Uncharacterized Protein C15orf32 is a protein which in humans is encoded by the C15orf32 gene and is located on chromosome 15, location 15q26.1. [3] Variants of C15orf32 have been linked to bipolar disorder, [4] [5] alcohol use disorder, [6] and acute myeloid leukemia. [7]
C15orf32, which stands for chromosome 15 open reading frame 32, is a gene on the plus strand of chromosome 15, on the cytogenetic band 15q26.1. C15orf32 is 29,464 bases long; on Genome Reference Consortium Human Build 38, it spans bases 92,471,654-92,501,117. It contains 3 exons. [3]
Two isoforms of C15orf32 exist. The longer transcript, known as transcript variant 2 on NCBI, is 1,764 bases long. [8] The other is transcript 1 and is 1,726 bases long. [9]
The transcript variant 2 of the C15orf32 gene encodes a 178 amino acid protein and has a molecular mass of 20,262 Da. Its basal isoelectric point is 9.34. [11] Transcript variant 1 is missing amino acids 166–178. [3] There is significantly large spacing between the glutamic acid residues at locations 12 and 23. [12]
A transmembrane segment is predicted between amino acids 51 and 71 by Phobius [14] and amino acids 57 and 71 by SAPS. [12] The N-terminus is predicted to be outside of the cytoplasm, and the C-terminus within the cytoplasm. [14] The Chou–Fasman algorithm predicts a beta sheet in this region, as well as much of region between amino acids 114 and 147. [10] I-TASSER was used to predict the tertiary structure. [13] The top model predicted eight alpha helices, including one between amino acids 51 and 71 concurrent with the transmembrane segment predicted earlier, although this structure had low confidence.
The promoter region of C15orf32 is predicted to span base pairs 92,470,677-92,471,777 according to Gene2Promoter tool by Genomatix. [15] The most commonly predicted transcription factor families by the MatInspector tool from Genomatix within this promoter region were SOX, nuclear receptor subfamily 2, and retinoid X receptor. [15] Transcription factor binding sites that have been determined experimentally include STAT1, MAFK, and JUND and are located within the second intron. [16] C15orf32 is expressed most notably in testes, brain, heart, and early in the development of fetuses, although expression is very low. [17] Exposure to some compounds such as bromelain, Bortezomib, estrogen, and 4-hydroxytamoxifen lead to increase in C15orf32 expression in breast cancer cells. [18] [19] [20]
Possible secondary structures of the 5' and 3' untranslated region in C15orf32 mRNA is given to the left and was predicted by mfold. [21] It is mostly linear, with a number of small stem-loops. According to TargetScan, sites targeted by miRNA families miR-193a-5p and miR-365-3p within the 3' UTR are broadly conserved among vertebrates. [22]
Immunohistochemical staining shows that C15orf32 is localized within cells to the cytoplasm and membranes, including the nucleus. [23] Both PSORTII and DeepLoc strongly predict localization to the nucleus. [24] [25] Thr41 has been shown to be phosphorylated post-translation [11] 26 other potential phosphorylation sites were predicted using NetPhos, with the most likely phosphorylation sites being 6S by PKC, 32T by PKG, 83T by PKC, 89S by PKC, and 162S by PKA. [26] A sumoylation site is predicted at amino acids 107–110. [27] 11 mucin type GalNAc O-glycosylation were predicted using NetOGlyc, 9 of which occurred in the first 50 base pairs. [28]
Experimental evidence shows potential interaction between C15orf32 and PKD2, ALG9, DISP1, NPC1, FZD2, FAM69A, ATP6V1G2, ASIC1, DPY19L4, SPPL2B, and HGSNAT. [29]
Variants of C15orf32 has been linked to several traits through genome wide association studies. The rs8040009 SNP in the 3’ UTR had a strong association with bipolar I disorder in a population of Han Chinese. [4] Three SNPs within C15orf32, including rs1455773 in exon 1 which causes a missense mutation from alanine to threonine at position 17, [30] were also associated with bipolar disorder in an Australian cohort. [5] This SNP was also linked to alcohol use disorder and heaviness of drinking. [6] The rs1455774 SNP, located in the 5’ UTR, is located within the target sites of miRNA has-miR-539 and has-let-7i* which affects the expression of these miRNAs, which may increase breast cancer susceptibility. [31] The rs11635085 SNP was linked to increased antibody IgG levels after exposure to casein, a dietary antigen, in Mexican Americans. [32] The rs1455782 SNP was linked to decreased forced vital capacity, which is a measure of pulmonary function. [33] The rs12148722 SNP was mildly associated with velopharyngeal dysfunction. [34] A haplotype block within C15orf32 was associated with acute myeloid leukemia. [7] A deletion in 15q26.1 including genes ST8SIA2, C15orf32, and FAM174B was found in a patient with epilepsy and autism spectrum disorder. [35]
Homologs of C15orf32 have been described in 39 other mammals. [36] No known orthologs exist outside of mammals.
Scientific name | Common name | Order | Date of divergence (MYA, estimated) | Sequence ID | Length | % Identity | % Similarity |
---|---|---|---|---|---|---|---|
Pan paniscus | Bonobo | Primates | 6.7 | XP_003816836.1 | 178 | 100.00 | 100 |
Colobus angolensis palliatus | Tanzanian black-and-white colobus | Primates | 29.44 | XP_011802791.1 | 165 | 81.21 | 83.03 |
Cebus capucinus imitator | Panamanian white-faced capuchin | Primates | 43.2 | XP_017372471.1 | 158 | 63.92 | 72.78 |
Marmota flaviventris | Yellow-bellied marmot | Rodentia | 90 | XP_027811208.1 | 171 | 53.80 | 61.99 |
Cavia porcellus | Guinea pig | Rodentia | 90 | XP_005008701.1 | 91 | 50.55 | 63.74 |
Urocitellus parryii | Arctic ground squirrel | Rodentia | 90 | XP_026249828.1 | 169 | 49.70 | 59.76 |
Heterocephalus glaber | Naked mole-rat | Rodentia | 90 | XP_021120379.1 | 157 | 47.77 | 56.69 |
Octodon degus | Common degu | Rodentia | 90 | XP_012369204.1 | 113 | 38.94 | 54.87 |
Ceratotherium simum simum | Southern white rhinoceros | Perissodactyla | 96 | XP_014644154.1 | 101 | 57.43 | 72.28 |
Tursiops truncatus | Common bottlenose dolphin | Artiodactyla | 96 | XP_019801175.1 | 120 | 53.33 | 67.5 |
Sus scrofa | Wild boar | Artiodactyla | 96 | XP_020955730.1 | 121 | 52.07 | 65.29 |
Balaenoptera acutorostrata scammoni | North Pacific Minke whale | Artiodactyla | 96 | XP_028020695.1 | 161 | 50.93 | 62.73 |
Equus caballus | Horse | Perissodactyla | 96 | XP_023505656.1 | 172 | 50.00 | 61.63 |
Felis catus | Cat | Carnivora | 96 | XP_006944424.1 | 89 | 49.44 | 58.43 |
Odobenus rosmarus divergens | Pacific walrus | Carnivora | 96 | XP_012420660.1 | 77 | 46.75 | 58.44 |
Lagenorhynchus obliquidens | Pacific white-sided dolphin | Artiodactyla | 96 | XP_026938944.1 | 180 | 44.44 | 55.56 |
Lipotes vexillifer | Baiji | Artiodactyla | 96 | XP_007472421.1 | 165 | 43.64 | 54.55 |
Panthera pardus | Leopard | Carnivora | 96 | XP_019315554.1 | 99 | 43.43 | 50.51 |
Orcinus orca | Killer whale | Artiodactyla | 96 | XP_012389158.1 | 162 | 43.21 | 53.7 |
Canis lupus dingo | Dingo | Carnivora | 96 | XP_025294087.1 | 149 | 36.24 | 44.3 |
Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
LOC105377021 is a protein which in humans is encoded by the LOC105377021 gene. LOC105377021 exhibits expressional pathology related to breast cancer, specifically triple negative breast cancer. LOC105377021 contains a serine rich region in addition to predicted alpha helix motifs.
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
Leukocyte Receptor Cluster Member 9 is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
C4orf36 is a protein that in humans is encoded by the c4orf36 gene.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
{{cite journal}}
: Cite journal requires |journal=
(help){{cite journal}}
: Cite journal requires |journal=
(help)