KIAA0232

Last updated

KIAA0232 is a nuclear phosphoserine protein which in humans is encoded by the KIAA0232 gene.

Contents

Gene

KIAA0232 is located at 4p16.1 neighboring TBC1 domain family member 14 and an uncharacterized locus. It has 10 exons which comprise its 4 known transcript variants. [1]

Chromosomal neighborhood of gene KIAA0232 KIAA0232 ideogram.gif
Chromosomal neighborhood of gene KIAA0232

Expression

KIAA0232 is expressed fairly ubiquitously, but particularly highly in the brain relative to other tissues according to GEO normal tissue expression profiling. [2] Other notable areas of high expression identified by EST profiling include nerves, umbilical cord, and parathyroid. [3]

Homology

Paralogs

There are no known paralogs of KIAA0232.

Orthologs

KIAA0232 is conserved in most animals, including mammals, reptiles, birds, amphibians, insects, and as far back as Trichnella spiralis, a species of nematode. It is not found in fungi, plants, or prokaryotes. [4]

Protein

The KIAA0232 protein is 1395 amino acids in length with a molecular weight of 154.8kDa. It has higher than average frequencies of serine and glutamic acid residues as well as several multi-serine runs that are evolutionarily conserved. It has an isoelectric point of 4.52. [5] [6]

Domains

KIAA0232 is largely composed of DUF4603. [7]

Graphic alignment of KIAA0232 and DUF4603 from NCBI BLAST DUF4603.PNG
Graphic alignment of KIAA0232 and DUF4603 from NCBI BLAST

Subcellular localization

KIAA0232 is predicted to have nuclear localization. [5]

Post-translational modification

KIAA0232 is predicted by the tools at ExPASy to be highly phosphorylated, with 15 tyrosine specific sites, 25 threonine specific sites, and 103 serine specific sites. [8]

Interacting proteins

KIAA0232 is experimentally known to interact with YWHAZ, [9] DYRK1A, and DYRK1B. [10] via two hybrid interaction experiments. YWHAZ is an apoptotic path regulator known to bind to phosphoserine proteins. DYRK1A/B are dual-specificity tyrosine kinases. This corroborates the post-translational modification predictions.

Related Research Articles

<span class="mw-page-title-main">TSR3</span> Hypothetical human protein

TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.

<span class="mw-page-title-main">Transmembrane protein 53</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 53, or TMEM53, is a protein that is encoded on chromosome 1 in humans. It has no paralogs but is predicted to have many orthologs across eukaryotes.

<span class="mw-page-title-main">C2CD4D</span> Mammalian protein found in Homo sapiens

C2CD4D, or C2 calcium-dependent domain-containing protein 4D is a protein product of the human genome. The gene that codes for this protein is found on chromosome 1, from 150,076,963 to 150,079,657. The gene contains 2 exons and encodes 353 amino acids. Synonyms for C2CD4D are "FAM148D" and NP_001129475. C2CD4D contains a conserved metal binding domain that is a known as Protein kinase C conserved region 2, subgroup 1. This motif is known to be a member of the C2 superfamily, which is present in phospholipases, protein kinases C, and synaptotagmins. The amino acid sequence of C2CD4D can be accessed at Prior to any post translational modification, C2CD4D has a molecular weight of 37.6 kdal. Although scientists have not yet determined where C2CD4D functions within the cell, C2CD4D has a predicted isoelectric point of 11.636 which severely limits the places in which it can be effective. In addition, C2CD4D does not contain any predicted transmembrane domains or any predicted signal peptides.

<span class="mw-page-title-main">WWC2</span> Protein-coding gene in the species Homo sapiens

WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.

<span class="mw-page-title-main">FAM63A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

Niban-like protein 2.(NLP2) is a protein that in humans is encoded by the FAM129C gene. Paralogs of this gene include FAM129A, and FAM129B. Its aliases include B-Cell Novel Protein 1 (BCNP1), and Family with Sequence Similarity 129 Member C (FAM129C).

<span class="mw-page-title-main">Transmembrane protein 268</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 268 is a protein that in humans is encoded by TMEM268 gene. The protein is a transmembrane protein of 342 amino acids long with eight alternative splice variants. The protein has been identified in organisms from the common fruit fly to primates. To date, there has been no protein expression found in organisms simpler than insects.

<span class="mw-page-title-main">ANKRD24</span> Protein-coding gene in the species Homo sapiens

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">C9orf135</span> Mammalian protein found in Homo sapiens

C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

Leukocyte Receptor Cluster Member 9 is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

The Family with sequence similarity 149 member B1 is an uncharacterized protein encoded by the human FAM149B1 gene, with one alias KIAA0974. The protein resides in the nucleus of the cell. The predicted secondary structure of the gene contains multiple alpha-helices, with a few beta-sheet structures. The gene is conserved in mammals, birds, reptiles, fish, and some invertebrates. The protein encoded by this gene contains a DUF3719 protein domain, which is conserved across its orthologues. The protein is expressed at slightly below average levels in most human tissue types, with high expression in brain, kidney, and testes tissues, while showing relatively low expression levels in pancreas tissues.

References

  1. "KIAA0232 KIAA0232 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  2. "NCBI GEO Expression Profiles".
  3. "EST Profile - Hs.79276". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  4. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  5. 1 2 "SAPS, Statistical Analysis of Protein Sequence". SDSC Biology Workbench.
  6. "PI, Isoelectric point determination". SDSC Biology Workbench. Archived from the original on 2003-08-11.
  7. "RCSB Protein Data Bank". Archived from the original on 2012-12-27.
  8. "NetPhos 2.0 Server". www.cbs.dtu.dk. Retrieved 2016-05-09.
  9. Bandyopadhyay, Sourav; Chiang, Chih-yuan; Srivastava, Jyoti; Gersten, Merril; White, Suhaila; Bell, Russell; Kurschner, Cornelia; Martin, Christopher H; Smoot, Mike (2010). "A human MAP kinase interactome". Nature Methods. 7 (10): 801–805. doi:10.1038/nmeth.1506. PMC   2967489 . PMID   20936779.
  10. Varjosalo, Markku; Keskitalo, Salla; Van Drogen, Audrey; Nurkkala, Helka; Vichalkovski, Anton; Aebersold, Ruedi; Gstaiger, Matthias (2013-04-25). "The Protein Interaction Landscape of the Human CMGC Kinase Group". Cell Reports. 3 (4): 1306–1320. doi: 10.1016/j.celrep.2013.03.027 . hdl: 20.500.11850/70524 . ISSN   2211-1247. PMID   23602568.