KIAA0825

Last updated
KIAA0825
Identifiers
Aliases KIAA0825 , C5orf36, PAPA10
External IDs OMIM: 617266; MGI: 1919621; HomoloGene: 89234; GeneCards: KIAA0825; OMA:KIAA0825 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001145678
NM_173665

NM_001081353
NM_001145676
NM_028215

RefSeq (protein)

NP_001139150
NP_775936

NP_001074822
NP_001139148

Location (UCSC) Chr 5: 94.15 – 94.62 Mb Chr 13: 77.28 – 77.76 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.

Contents

General information

The Isoforms of C5orf36 C5orf36 isoforms.png
The Isoforms of C5orf36

KIAA0825 is gene that appears to be a genetic factor that increases the risk of Type II Diabetes, possibly by increasing the level of blood glucose levels. [5] It has also been identified as a possible oncogene. [6] C5orf36 has one common alias KIAA0825. The gene is about 478 kb long and contains 22 exons. It produces 10 different variants: 9 alternatively spliced, and one un-spliced version. The longest experimentally confirmed mRNA is 7240 bp long and produces a protein 1275 amino acids long. [7] The protein is predicted to weigh about 147.8kDal. It has orthologs in most animals including Aplysia californica , but is not found outside animals with the possible exception of Plasmodiophora brassicae.

Protein information

The protein has a predicted weight of 147.8 kDal. [8] [9] It does not contain a known nuclear localization signal but does contain a nuclear export signal. [10] The subcellular localization for the protein is predicted to be the nucleus and the cytoplasm. [11] This suggests that the protein might shuttle back and forth across the nuclear membrane.

Secondary structure

This is a 3-D Prediction created by I-TASSER. The green indicates the conserved DUF4495. C5orf36 Predicted Tertiary Structure.png
This is a 3-D Prediction created by I-TASSER. The green indicates the conserved DUF4495.

Several programs suggest that the secondary structure of the protein is mainly helices with only a few beta sheets. [12] [13] [14] [15] Analysis of protein composition also suggests that the protein has relatively low levels of glycine. [16] This could suggest a fairly rigid structure relative to other proteins. The tertiary structure is harder to predict due to the size of the protein, partially due to its size. The 3-D structure shown shows a prediction made by I-TASSER. This is a possible structure with a C-score of -1.06 on a scale from -5 to 1 (in which the higher the number the greater the confidence). [17] [18] [19] This predicted structure indicates there are two main parts, and it is possible they interact depending on the state of the protein (e.g. whether or not it's phosphorylated).

Expression

mRNA expression data from the Human Protein Atlas, calculated as transcripts per million (TPM). C5orf36 mRNA expression data.png
mRNA expression data from the Human Protein Atlas, calculated as transcripts per million (TPM).
This shows the expression levels of C5orf36 in human tissue. It is provided by the Human Protein Atlas. C5orf36 Protein expression.png
This shows the expression levels of C5orf36 in human tissue. It is provided by the Human Protein Atlas.

The mRNA for KIAA0825 is expressed at relatively low rates in comparison to other mRNAs. [20] The protein however is expressed at relatively high rates, especially in parts of the brain as well as adrenal glands and the thyroid. [21] This would suggest that the protein is not readily degraded and remains in the cell for long periods of time, such that continuous transcription of the DNA into mRNA is unnecessary. No current finding suggest that there is alternative expression of different isoforms in different tissues.

Regulation

Analysis of the promoter offers some insight into the expression of KIAA0825. [22] One possible regulator found is the NeuroD1 transcription factor. This factor is an important regulator for the insulin gene, and a mutation in this gene can lead to Type II diabetes. [23] This could explain why KIAA0825 is expressed at lower levels in patients with Type II diabetes. Another possible transcription factor is the Myeloid zinc finger 1 factor, which is tied to myeloid leukemia, because it delays apoptosis of cells in the presence of retinoic acid. [24] There are also several places where Vertebrate SMAD family transcription factors can bind. These transcription factors are thought to be responsible for nucleocytoplasmic dynamics. [25] This means that these SMAD transcription factors could affect KIAA0825, because subcellular localization suggests it shuttles across the nuclear envelope.

Function

There are two proteins found to interact with KIAA0825. One is Interleukin enhancer-binding factor 3. [26] ILF3 is a factor that complexes with other proteins and regulates gene expression and stabilizes mRNAs. [27] The other is the Amyloid-beta precursor protein. [28] This protein is an integral membrane protein found most commonly in the synapses of neurons. Neither of these proteins is well enough understood to indicate for certain the role of C5orf36 in human cells. They however suggest that KIAA0825 could serve a variety of roles in different parts of the cell.

Orthology

KIAA0825 orthologs can be found in virtually all animals, but cannot be found in plants, bacteria, or protozoa. It is mostly highly conserved in vertebrates especially mammals, but genes that contain region similar to DUF4495 region can be found in California sea hare, generally one of the most simple animal. The size especially in mammals is well conserved sticking very close to between 1250 and 1300 amino acids long. This suggests that the protein wraps around on itself forming important structures for its function.

There were no paralogs found of the gene KIAA0825 in humans or in any other species.

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">C20orf27</span> Protein-coding gene in the species Homo sapiens

UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

<span class="mw-page-title-main">C16orf82</span> Protein-coding gene in the species Homo sapiens

C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene. C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types. The function of C16orf82 is not fully understood by the scientific community.

<span class="mw-page-title-main">SHLD1</span> Protein-coding gene in the species Homo sapiens

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">C18orf63</span> Protein-coding gene in the species Homo sapiens

Chromosome 18 open reading frame 63 is a protein which in humans is encoded by the C18orf63 gene. This protein is not yet well understood by the scientific community. Research has been conducted suggesting that C18orf63 could be a potential biomarker for early stage pancreatic cancer and breast cancer.

<span class="mw-page-title-main">C1orf112</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.

<span class="mw-page-title-main">CFAP299</span> Protein found in humans

Cilia- and flagella-associated protein 299 (CFAP299) is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C2orf16</span> Protein-coding gene in the species Homo sapiens

C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">ISLR</span> Protein-coding gene in the species Homo sapiens

In humans, the immunoglobulin super family containing leucine-rich repeat (ISLR) protein is encoded by the ISLR gene. Current RNA-seq studies show that the protein is highly expressed in the endometrium and ovary and shows expression among 25 other tissues. The protein is seen localized in the cytoplasm, plasma membrane, extracellular exosome, and platelet alpha granule lumen. Furthermore, the protein is known to play a role in platelet degranulation, cell adhesion, and response to elevated platelet cytosolic Ca2+.

<span class="mw-page-title-main">SAAL1</span> Protein-coding gene in the species Homo sapiens

Serum amyloid A-like 1 is a protein in humans encoded by the SAAL1 gene.

<span class="mw-page-title-main">OCEL1</span> Protein-coding gene in the species Homo sapiens

OCEL1, also called Occludin//ELL Domain Containing 1, is a protein encoding gene located at chromosome 19p13.11 in the human genome. Other aliases for the gene include FLJ22709, FWP009, and S863-9. The function of OCEL1 has not yet been identified.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">SLC66A3</span> Entry on the gene SLC66A3

Solute carrier family 66 member 3 is a gene in humans that encodes the protein SLC66A3. The function of the SLC66A3 protein is not yet well understood but belongs to a family of five evolutionarily related proteins, the SLC66 lysosomal amino acid transporters. SLC66A3 is localized to the endoplasmic reticulum and has four transmembrane domains.

<span class="mw-page-title-main">CCDC188</span> Protein found in humans

CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000185261 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000071252 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Li J, Wei J, Xu P, Yan M, Li J, Chen Z, Jin T (December 2016). "Impact of diabetes-related gene polymorphisms on the clinical characteristics of type 2 diabetes Chinese Han population". Oncotarget. 7 (51): 85464–85471. doi:10.18632/oncotarget.13399. PMC   5356749 . PMID   27863428.
  6. Delgado AP, Brandao P, Chapado MJ, Hamid S, Narayanan R (July–August 2014). "Open reading frames associated with cancer in the dark matter of the human genome". Cancer Genomics & Proteomics. 11 (4): 201–13. PMID   25048349.
  7. NCBI Resource Coordinators (January 2017). "Database Resources of the National Center for Biotechnology Information". Nucleic Acids Research. 45 (D1): D12–D17. doi:10.1093/nar/gkw1071. PMC   5210554 . PMID   27899561.
  8. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  9. Brendel V. "SDSC Biology Workbench". workbench.sdsc.edu. Department of Mathematics, Stanford University, CA. Retrieved 17 April 2017.
  10. la Cour T, Kiemer L, Mølgaard A, Gupta R, Skriver K, Brunak S (June 2004). "Analysis and prediction of leucine-rich nuclear export signals". Protein Engineering, Design & Selection. 17 (6): 527–36. doi: 10.1093/protein/gzh062 . PMID   15314210.
  11. Nakai K, Horton P (January 1999). "PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization". Trends in Biochemical Sciences. 24 (1): 34–6. doi:10.1016/s0968-0004(98)01336-x. PMID   10087920.
  12. Bigelow HR, Petrey DS, Liu J, Przybylski D, Rost B (28 April 2004). "Predicting transmembrane beta-barrels in proteomes". Nucleic Acids Research. 32 (8): 2566–77. doi:10.1093/nar/gkh580. PMC   419468 . PMID   15141026.
  13. Rost B, Yachdav G, Liu J (July 2004). "The PredictProtein server". Nucleic Acids Research. 32 (Web Server issue): W321–6. doi:10.1093/nar/gkh377. PMC   441515 . PMID   15215403.
  14. Garnier J, Osguthorpe DJ, Robson B (March 1978). "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins". Journal of Molecular Biology. 120 (1): 97–120. doi:10.1016/0022-2836(78)90297-8. PMID   642007.
  15. Burgess AW, Ponnuswamy PK, Scheraga HA (1974). "Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins". Israel Journal of Chemistry. 12 (1–2): 239–286. doi:10.1002/ijch.197400022.
  16. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  17. Zhang Y (January 2008). "I-TASSER server for protein 3D structure prediction". BMC Bioinformatics. 9 (1): 40. doi: 10.1186/1471-2105-9-40 . PMC   2245901 . PMID   18215316.
  18. Roy A, Kucukural A, Zhang Y (April 2010). "I-TASSER: a unified platform for automated protein structure and function prediction". Nature Protocols. 5 (4): 725–38. doi:10.1038/nprot.2010.5. PMC   2849174 . PMID   20360767.
  19. Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y (January 2015). "The I-TASSER Suite: protein structure and function prediction". Nature Methods. 12 (1): 7–8. doi:10.1038/nmeth.3213. PMC   4428668 . PMID   25549265.
  20. Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID   25613900. S2CID   802377.
  21. Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID   25613900. S2CID   802377.
  22. "Genomatix". Genomatix. Archived from the original on 19 August 2021. Retrieved 7 May 2017.
  23. Prantera G, Pimpinelli S, Rocchi A (1 January 1999). "Effects of distamycin A on human leukocytes in vitro". Cytogenetics and Cell Genetics. 23 (1–2): 103–7. doi: 10.1128/MCB.19.1.704 . PMC   83927 . PMID   83927.
  24. Robertson KA, Hill DP, Kelley MR, Tritt R, Crum B, Van Epps S, Srour E, Rice S, Hromas R (May 1998). "The myeloid zinc finger gene (MZF-1) delays retinoic acid-induced apoptosis and differentiation in myeloid leukemia cells". Leukemia. 12 (5): 690–8. doi: 10.1038/sj.leu.2401005 . PMID   9593266.
  25. Massagué J, Seoane J, Wotton D (December 2005). "Smad transcription factors". Genes & Development. 19 (23): 2783–810. doi: 10.1101/gad.1350705 . PMID   16322555.
  26. Chu L, Su MY, Maggi LB, Lu L, Mullins C, Crosby S, Huang G, Chng WJ, Vij R, Tomasson MH (August 2012). "Multiple myeloma-associated chromosomal translocation activates orphan snoRNA ACA11 to suppress oxidative stress". The Journal of Clinical Investigation. 122 (8): 2793–806. doi:10.1172/JCI63051. PMC   3408744 . PMID   22751105.
  27. Chaumet A, Castella S, Gasmi L, Fradin A, Clodic G, Bolbach G, Poulhe R, Denoulet P, Larcher JC (June 2013). "Proteomic analysis of interleukin enhancer binding factor 3 (Ilf3) and nuclear factor 90 (NF90) interactome". Biochimie. 95 (6): 1146–57. doi:10.1016/j.biochi.2013.01.004. PMID   23321469.
  28. Oláh J, Vincze O, Virók D, Simon D, Bozsó Z, Tõkési N, et al. (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry. 286 (39): 34088–100. doi: 10.1074/jbc.M111.243907 . PMC   3190826 . PMID   21832049.