KIAA2012

Last updated
KIAA2012
Identifiers
Aliases KIAA2012
External IDs MGI: 2685819 HomoloGene: 124277 GeneCards: KIAA2012
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001080445
NM_001277372
NM_001367720

NM_001013771
NM_001357842

RefSeq (protein)

NP_001264301
NP_001354649

n/a

Location (UCSC) Chr 2: 202.07 – 202.21 Mb Chr 1: 59.56 – 59.68 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

KIAA2012 is a protein which, in humans, is encoded by the KIAA2012 gene. KIAA2012 is expressed at very low levels throughout the body, but it is primarily expressed in the ovary, lungs, and brain. [5]

Contents

Gene

Ideogram of human chromosome 2 showing the location of KIAA2012 (Image generated using BioRender). Kiaa2012ideogram.png
Ideogram of human chromosome 2 showing the location of KIAA2012 (Image generated using BioRender).

KIAA2012 is located on the positive sense strand at position 2q33.1. [6] KIAA2012 has 24 exons, and it spans 131,934 bases including introns. No aliases or common names are used in addition to KIAA2012.

Gene level regulation

Within the promoter region of KIAA2012, there is a highly conserved transcription factor binding site that has no common SNPs. [7] The RFX transcription factors, more specifically RFX1-6, bind to this highly conserved region and regulates cellular specialization and differentiation. [8] The image below shows the promoter region of KIAA2012 with the highly conserved RFX1-6 binding site. [7]

Human KIAA2012 promoter sequence. In red are common SNPs, highlighted in yellow is the conserved RFX1-6 transcription factor binding site, and the bolded letters show the transcription start site. Promoterkiaa2012.png
Human KIAA2012 promoter sequence. In red are common SNPs, highlighted in yellow is the conserved RFX1-6 transcription factor binding site, and the bolded letters show the transcription start site.

mRNA

KIAA2012 is expressed differentially in the body at low levels. Of this overall low expression, KIAA2012 is expressed most highly in the brain, lungs, and ovary. [5] [9] KIAA2012 is expressed at lower levels in the liver, trachea, and testes. [10] [11] [12]

Protein

Unmodified KIAA2012 is 1,181 amino acids in length, has a molecular weight of 136 kdal, and an isoelectric pH around 8. [13] [14]

Internal features

KIAA2012 is rich in glutamic acid and glutamine, and it is poor in valine. [13] There is also one mixed charge cluster between amino acids 951–1118. [15] There is one Domain of Unknown Function (DUF 4670) within KIAA2012 spanning from amino acid 635 to amino acid 1137. [6] Different than the whole KIAA2012, DUF 4670 is also rich in arginine and poor in glycine and phenylalanine. [13]

Schematic of KIAA2012 with annotations showing the Glutamine (E) and Glutamic Acid (Q) rich regions, DUF 4670, and the mixed charge cluster (MCC). (Made using IBS Online Drawing Tool). Wikiimage2.png
Schematic of KIAA2012 with annotations showing the Glutamine (E) and Glutamic Acid (Q) rich regions, DUF 4670, and the mixed charge cluster (MCC). (Made using IBS Online Drawing Tool).

Structure

The secondary structure of KIAA2012 consists primarily of alpha helices. On the left, a high confidence prediction of the secondary structure is shown. On the right, the entire 3-D structure is shown, showing how the alpha helices fold to form the entire KIAA2012 protein.

Predicted folding pattern and 3-D structure of KIAA2012. (Image generated by I-Tasser Online Tool). Model1KIAA2012.gif
Predicted folding pattern and 3-D structure of KIAA2012. (Image generated by I-Tasser Online Tool).
Predicted KIAA2012 secondary structure with a 91.2% confidence level. (Image generated by Phyre 2.0) Phyre2.oKIAA2012.png
Predicted KIAA2012 secondary structure with a 91.2% confidence level. (Image generated by Phyre 2.0)

Post-translational modification

KIAA2012 has a highly conserved cGMP-dependent protein kinase binding domain. These cGMP-dependent protein kinases (PRKG) are a part of the NO/cGMP signaling pathway, and they are important factors in many signal transduction processes. [16] Additionally, there are many potential sites for phosphorylation, SUMOylation, and myristoylation. In instances where KIAA2012 is post-translationally modified in these ways, the resulting charge, structure, function, and sub-cellular localization can be altered. [17] [18]

Sub-cellular Localization

Proteins tagged with localization signals will be transported to various regions of the cell. KIAA2012 contains nuclear localization signal sequences, which are short stretches of amino acids that moderate transportation of nuclear proteins to the nucleus. [19] Shown in the table below, human KIAA2012 and two orthologs are listed with confidence values of where in the cell KIAA2012 is localized. [20]

KIAA2012 Localization with Confidence Percentages
NuclearPlasma MembraneCytoskeletalMitochondrialCytoplasmicSecretory Vesicles
Human82%4%9%4%------
Sardinian Tree Frog78%9%9%4%------
Zebrafish74%9%4%---9%4%

Function

KIAA2012 has predicted protein interactions with STAG2 and SMC1A. [21] STAG2 encodes a subunit of cohesion complexes used to regulate sister chromatid separation during cell division. [22] SMC1A is an important part of functional kinetochores due to its role in the multiprotein cohesion complex required for sister chromatid cohesion. [23] Because KIAA2012 is localized in the nucleus and interacts with STAG2 and SMC1A, it's role as a protein surrounds DNA manipulation or cell division.

Predicted Proteins that Interact with KIAA2012
Protein NameAliasesLocation
SMC1A SMC1, SMCB, CDLS2, SB1.8, SMC1L1, DXS423E, SMC1alpha, RP6-29D12.1Xp11.22 [23]
STAG2 SA2, SA-2, SCC3B, bA517O1.1, RP11-517O1.1Xq25 [22]

Homology and evolution

Twenty organisms with a KIAA2012 ortholog are shown below, and they are sorted by date of divergence and sequence identity. There were no orthologs found in birds, but ortholog versions of KIAA2012 exist in mammals, reptiles, amphibians, and fish. An unrooted phylogenetic tree showing each taxonomic group and their divergence patterns can be found below the ortholog table.

Genus & SpeciesCommon NameDate of Divergenve (MYA)Accession #Sequence Length% Identity% Similarity
Homo sapiens Human0 NM_001277372.4 1181100100
Hylobates moloch Silvery Gibbon19.5 XP_032610815 118194.296.4
Sciurus carolinensis Gray Squirrel87 XP_047398902 113064.172.3
Mus caroli Mouse87 XP_029333762 116061.372
Panthera uncia Snow Leopard94 XP_049471125 118075.382.7
Orcinus orca Killer Whale94 XP_033285753 117274.582.7
Bubalus bubalis Water Buffalo94 XP_006080602 118572.981.1
Alligator mississippiensis American Alligator319 XP_059583055 132537.149.7
Caretta caretta Loggerhead Turtle319 XP_048725054 13293751.4
Chelonia mydas Green Sea Turtle319 XP_037768210 132536.850.9
Crotalus tigris Tiger Rattlesnake319 XP_039210533 122032.446.9
Xenopus tropicalis Western Clawed Frog352 XP_031749269 133931.945.8
Rhinatrema bivittatum Two-Lined Caecilian352 XP_029462137 149930.644.8
Spea bombifrons Plains Spadefoot Toad352 XP_053326593 143629.644
Hyla sarda Sardinian Tree Frog352 XP_056391303 142829.544.9
Protopterus annectens West African Lungfish408 XP_043931036 141229.845
Takifugu rubripes Japanese Puffer429 XP_029701411 112925.339.4
Danio rerio Zebrafish429 XP_009302807 148424.937.1
Anarrhichthys ocellatus Wolf Eel429 XP_031729884 120423.937.3
Amblyraja radiata Thorny Skate462 XP_032880336 139225.740.5
Unrooted phylogenetic tree of the KIAA2012 orthologs found in Table 1.  Mammals (yellow), reptiles (red), amphibians (green), and fish (blue) are circled to differentiate between them, and images of each species are provided next to the three-letter code KIAA2012.Unrooted.Tree.pdf
Unrooted phylogenetic tree of the KIAA2012 orthologs found in Table 1.  Mammals (yellow), reptiles (red), amphibians (green), and fish (blue) are circled to differentiate between them, and images of each species are provided next to the three-letter code

Clinical significance

There are several genome-wide association studies that report traits associated variations in KIAA2012. The reported traits with the highest number of associations are heel bone mineral density, taste liking measurement, educational attainment, lung function, and height. [24] Additionally, KIAA2012 is down regulated in women with polycystic ovary syndrome (PCOS) compared to women without PCOS. [25]

Related Research Articles

<span class="mw-page-title-main">RFX1</span> Protein-coding gene in the species Homo sapiens

MHC class II regulatory factor RFX1 is a protein that, in humans, is encoded by the RFX1 gene located on the short arm of chromosome 19.

<span class="mw-page-title-main">RFX3</span> Protein-coding gene in the species Homo sapiens

Transcription factor RFX3 is a protein that in humans is encoded by the RFX3 gene.

<span class="mw-page-title-main">Basic leucine zipper and W2 domain-containing protein 2</span> Protein-coding gene in the species Homo sapiens

Basic Leucine Zipper and W2 Domain-Containing Protein 2 is a protein that is encoded by the BZW2 gene. It is a eukaryotic translation factor found in species up to bacteria. In animals, it is localized in the cytoplasm and expressed ubiquitously throughout the body. The heart, placenta, skeletal muscle, and hippocampus show higher expression. In various cancers, upregulation tends to lead to higher severity and mortality. It has been found to interact with SARS-CoV-2.

<span class="mw-page-title-main">STAG1</span> Protein-coding gene in the species Homo sapiens

Cohesin subunit SA-1 (SA1) is a protein that in humans is encoded by the STAG1 gene. SA1 is a subunit of the Cohesin complex which mediates sister chromatid cohesion, homologous recombination and DNA looping. In somatic cells cohesin is formed of SMC3, SMC1, RAD21 and either SA1 or SA2 whereas in meiosis, cohesin is formed of SMC3, SMC1B, REC8 and SA3. There is a nonprofit community formed for those with a STAG1 Gene mutation at www.stag1gene.org.

<span class="mw-page-title-main">Transmembrane protein 53</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 53, or TMEM53, is a protein that is encoded on chromosome 1 in humans. It has no paralogs but is predicted to have many orthologs across eukaryotes.

<span class="mw-page-title-main">Coiled-coil domain-containing protein 135</span> Protein found in humans

Coiled-coil domain-containing protein 135, also known as CCDC135, is a protein that in humans is encoded by the CCDC135 gene.

<span class="mw-page-title-main">TMEM242</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.

<span class="mw-page-title-main">Morn repeat containing 1</span> Protein-coding gene in the species Homo sapiens

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">ARMH3</span> Protein-coding gene in the species Homo sapiens

ARMH3 or Armadillo Like Helical Domain Containing 3, also known as UPF0668 and c10orf76, is a protein that in humans is encoded by the ARMH3 gene. Its function is not currently known, but experimental evidence has suggested that it may be involved in transcriptional regulation. The protein contains a conserved proline-rich motif, suggesting that it may participate in protein-protein interactions via an SH3-binding domain, although no such interactions have been experimentally verified. The well-conserved gene appears to have emerged in Fungi approximately 1.2 billion years ago. The locus is alternatively spliced and predicted to yield five protein variants, three of which contain a protein domain of unknown function, DUF1741.

<span class="mw-page-title-main">Acyl-CoA thioesterase 9</span> Protein-coding gene in humans

Acyl-CoA thioesterase 9 is a protein that is encoded by the human ACOT9 gene. It is a member of the acyl-CoA thioesterase superfamily, which is a group of enzymes that hydrolyze Coenzyme A esters. There is no known function, however it has been shown to act as a long-chain thioesterase at low concentrations, and a short-chain thioesterase at high concentrations.

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

<span class="mw-page-title-main">IFFO1</span> Protein-coding gene in the species Homo sapiens

Intermediate filament family orphan 1 is a protein that in humans is encoded by the IFFO1 gene. IFFO1 has uncharacterized function and a weight of 61.98 kDa. IFFO1 proteins play an important role in the cytoskeleton and the nuclear envelope of most eukaryotic cell types.

<span class="mw-page-title-main">FAM76A</span> Protein-coding gene in the species Homo sapiens

FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.

<span class="mw-page-title-main">C9orf135</span> Mammalian protein found in Homo sapiens

C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.

<span class="mw-page-title-main">FAM163A</span> Protein-coding gene in the species Homo sapiens

FAM163A, also known as cebelin and neuroblastoma-derived secretory protein (NDSP) is a protein that in humans is encoded by the FAM163A gene. This protein has been implicated in promoting proliferation and anchorage-independent growth of neuroblastoma cancer cells. In addition, this protein has been found to be up-regulated in the lung tissue of chronic smokers. FAM163A is found on human chromosome 1q25.2; its protein product is 167 amino acids long. FAM163A contains a very highly conserved signal peptide sequence, coded for by the first ~37 amino acids in its sequence; albeit only conserved in eukaryotes, the most distant of which being the Japanese Rice Fish.

<span class="mw-page-title-main">Coiled-coil domain containing 74a</span> Protein found in humans

Coiled-coil domain containing 74A is a protein that in humans is encoded by the CCDC74A gene. The protein is most highly expressed in the testis and may play a role in developmental pathways. The gene has undergone duplication in the primate lineage within the last 9 million years, and its only true ortholog is found in Pan troglodytes.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">TMEM247</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 247 is a multi-pass transmembrane protein of unknown function found in Homo sapiens encoded by the TMEM247 gene. Notable in the protein are two transmembrane regions near the c-terminus of the translated polypeptide. Transmembrane protein 247 has been found to be expressed almost entirely in the testes.

Human protein 53 intron 1 (Hp53int1) is a protein encoded by the Hp53int1 gene in humans.

<span class="mw-page-title-main">FAM13B</span> Protein which in humans is encoded by the FAM13B gene

Family with sequence similarity 13 member B is a protein which in humans is encoded by the FAM13B gene, also known as C5ORF5. The FAM13B gene is found in vertebrates and jawed fish. FAM13B is expressed ubiquitously in human tissues and has been linked to maglinant myelomas susceptibility to atrial fibrillation, a cardiac arrhythmia.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000182329 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000047361 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J, et al. (February 2014). "Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics". Molecular & Cellular Proteomics. 13 (2): 397–406. doi: 10.1074/mcp.M113.035600 . PMC   3916642 . PMID   24309898.
  6. 1 2 "KIAA2012 [Homo sapiens (human)]". NCBI. National Library of Medicine. Retrieved 1 Dec 2023.
  7. 1 2 "USCS Genomics Institute". Genome Browser. Retrieved 30 Nov 2023.
  8. Sugiaman-Trapman D, Vitezic M, Jouhilahti EM, Mathelier A, Lauter G, Misra S, et al. (March 2018). "Characterization of the human RFX transcription factor family by regulatory and target gene analysis". BMC Genomics. 19 (1): 181. doi: 10.1186/s12864-018-4564-6 . PMC   5838959 . PMID   29510665.
  9. "Illumina bodyMap2 Transcriptome". NCBI. BioProject. Retrieved 10 Dec 2023.
  10. Szabo L, Morey R, Palpant NJ, Wang PL, Afari N, Jiang C, et al. (June 2015). "Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development". Genome Biology. 16 (1): 126. doi: 10.1186/s13059-015-0690-5 . PMC   4506483 . PMID   26076956.
  11. Duff MO, Olson S, Wei X, Garrett SC, Osman A, Bolisetty M, et al. (May 2015). "Genome-wide identification of zero nucleotide recursive splicing in Drosophila". Nature. 521 (7552): 376–379. Bibcode:2015Natur.521..376D. doi:10.1038/nature14475. PMC   4529404 . PMID   25970244.
  12. "Tissue Expression Type -- KIAA2012". The Human Protein Atlas. Retrieved 8 Nov 2023.
  13. 1 2 3 "SAPS Results". European Bioinformatic Institute. Retrieved 29 Nov 2023.
  14. Tokmakov AA, Kurotani A, Sato KI (2021). "Protein pI and Intracellular Localization". Frontiers in Molecular Biosciences. 8: 775736. doi: 10.3389/fmolb.2021.775736 . PMC   8667598 . PMID   34912847.
  15. Zhu ZY, Karlin S (August 1996). "Clusters of charged residues in protein three-dimensional structures". Proceedings of the National Academy of Sciences of the United States of America. 93 (16): 8350–8355. Bibcode:1996PNAS...93.8350Z. doi: 10.1073/pnas.93.16.8350 . PMC   38674 . PMID   8710874.
  16. Wolfertstetter S, Huettner JP, Schlossmann J (February 2013). "cGMP-Dependent Protein Kinase Inhibitors in Health and Disease". Pharmaceuticals. 6 (2): 269–286. doi: 10.3390/ph6020269 . PMC   3816681 . PMID   24275951.
  17. Maejima Y, Sadoshima J (September 2014). "SUMOylation: a novel protein quality control modifier in the heart". Circulation Research. 115 (8): 686–689. doi:10.1161/CIRCRESAHA.114.304989. PMC   4181369 . PMID   25258400.
  18. Nestler EJ, Greengard P (1999). "Protein Phosphorylation is of Fundamental Importance in Biological Regulation". Basic Neurochemistry: Molecular, Cellular and Medical Aspects. 6. Retrieved 10 Dec 2023.
  19. Cokol M, Nair R, Rost B (November 2000). "Finding nuclear localization signals". EMBO Reports. 1 (5): 411–415. doi:10.1093/embo-reports/kvd092. PMC   1083765 . PMID   11258480.
  20. "YLoc". Iterpretable Subcellular Localization Prediction. Retrieved 2 Dec 2023.
  21. "KIAA2012 Results Summary". BioGRID. Retrieved 30 Nov 2023.
  22. 1 2 "STAG2 cohesion complex component". Gene -- NCBI. National Library of Medicine. Retrieved 3 Dec 2023.
  23. 1 2 "SMC1A - structural maintenance of chromosome 1A (human)". PubChem. National Library of Medicine. Retrieved 3 Dec 2023.
  24. "The NHGRI-EBI Catalog of human genome-wide association studies". GWAS Catalog. Retrieved 10 Dec 2023.
  25. Hiam D, Simar D, Laker R, Altıntaş A, Gibson-Helm M, Fletcher E, et al. (December 2019). "Epigenetic Reprogramming of Immune Cells in Women With PCOS Impact Genes Controlling Reproductive Function". The Journal of Clinical Endocrinology and Metabolism. 104 (12): 6155–6170. doi: 10.1210/jc.2019-01015 . hdl: 10536/DRO/DU:30130006 . PMID   31390009.