KIAA1211L

Last updated
CRACDL
Identifiers
Aliases CRACDL , C2orf55, KIAA1211-like, KIAA1211 like, KIAA1211L, CRACD like
External IDs MGI: 1919347 HomoloGene: 19208 GeneCards: CRACDL
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_207362

NM_028096

RefSeq (protein)

NP_997245

n/a

Location (UCSC) Chr 2: 98.79 – 98.94 Mb Chr 1: 37.61 – 37.72 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

KIAA1211L is a protein that in humans is encoded by the KIAA1211L gene. It is highly expressed in the brain (Cerebral Cortex). [5] Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. [6] [7] Finally, KIAA1211L is associated with certain mental disorders and various cancers. [8] [9]

Contents

Gene

Chromosome2 (2q.11.2) [10]
Location98,793,846 bp from pter to 98,936,259 bp from pter [10]
Size142,414 bases [10]
Accession NumberNM_207362 [11]
Also Known AsKIAA1211 Like

C2orf55

Chromosome 2 Open Reading Frame 55 [10]

KIAA1211L is a protein-coding gene. [10] The table above presents the gene's alias, location, size and accession number.

mRNA

There are 11 splice isoforms of the gene KIAA1211L. [5] The validated isoform has 10 exons. [5]

Protein

Conceptual Translation KIAA1211L Part 1. The upstream stop codon is orange, the exon boundaries are highlighted blue, the start codon is green, the predicted nuclear location signal is denoted by a purple box, the conserved regions are highlighted orange, the phosphorylation sites are highlighted yellow, the predicted SUMOylation sites are denoted by green boxes, the DUF4592 Motif is highlighted red. Conceptual Translation KIAA1211L Part 1.jpg
Conceptual Translation KIAA1211L Part 1. The upstream stop codon is orange, the exon boundaries are highlighted blue, the start codon is green, the predicted nuclear location signal is denoted by a purple box, the conserved regions are highlighted orange, the phosphorylation sites are highlighted yellow, the predicted SUMOylation sites are denoted by green boxes, the DUF4592 Motif is highlighted red.
Conceptual Translation KIAA1211L Part 2. The exon boundaries are highlighted blue, the conserved regions are highlighted orange, the predicted SUMOylation sites are denoted by green boxes, the stop codon is red, the predicted miRNA 132 target is underlined in purple, and the polyadenylation signal is highlighted magenta. Conceptual Translation KIAA1211L Part 2.0.jpg
Conceptual Translation KIAA1211L Part 2. The exon boundaries are highlighted blue, the conserved regions are highlighted orange, the predicted SUMOylation sites are denoted by green boxes, the stop codon is red, the predicted miRNA 132 target is underlined in purple, and the polyadenylation signal is highlighted magenta.
Amino Acid Length962 [10]
Molecular Weight102 kda [12]
Isoelectric Point8 [12]
Accession NumberNP_997245.2 [11]
Also Known AsUncharacterized Protein KIAA1211-like [10]

Uncharacterized Protein C2orf55 [10]

Hypothetical Protein LOC343990 [13]

The table above presents the protein's alias, size, and accession number. The KIAA1211L protein is proline rich and asparagine, isoleucine, phenylalanine, and tyrosine poor. [12]

Domains and motifs

KIAA1211L Schematic. This figure is a schematic illustration of KIAA1211L. The nuclear location signal is blue, the phosphorylation sites are red, the SUMOylation sites are grey, and the DUF4592 motif is orange. MyDomain.png
KIAA1211L Schematic. This figure is a schematic illustration of KIAA1211L. The nuclear location signal is blue, the phosphorylation sites are red, the SUMOylation sites are grey, and the DUF4592 motif is orange.

The KIAA1211L protein has one domain called the DUF4592 motif and spans amino acids 131–239. [14] This domain is highly conserved among the KIAA1211L orthologs. The DUF4592 motif is depicted in both the conceptual translation and schematic figures.

Post translational modifications

KIAA1211L is phosphorylated at the Ser92 and Ser490 amino acids. [15] The KIAA1211L protein is also predicted to have five different SUMOylation sites located at Lys134, Lys375, Lys866, Lys874, and Lys914. [16] Both the phosphorylated sites and the SUMOylation sites are depicted in the conceptual translation and schematic figures.

Secondary structure

The KIAA1211L protein predicted secondary structure is composed of 50% alpha helixes, 8.9% beta sheets, and 17.9% turns. [17] The high number of turns is consistent with the fact that KIAA1211L is proline rich. [12]

Subcellular location

The KIAA1211L protein is predicted to be located in the nucleus. [7] The orthologs, including the elephant shark, horse, rock dove, and chimp, are also predicted to be located in the nucleus. [7] The nuclear location signal is located on amino acids 25-43 which is depicted in both the conceptual translation and schematic figures. . [7] This signal is conserved throughout the orthologs. Additionally, this location (amino acids 24-43) is positively charged, probably due to the high amount of lysine at this location. [12] Finally, it is predicted that KIAA1211L is mainly localized to the microtubules and centrosome and sometimes localized to the cytokinetic bridge. [6]

Expression

The gene is highly expressed in the brain (Cerebral Cortex). [5] The KIAA1211L protein is located in many different tissue types, including the brain, the hippocampus, the lung, breast carcinoma, the islets of Langerhans, the pancreas, the kidney, and 38 other tissues. [18] Additionally, it is expressed an average amount compared to other human proteins. [19]

Regulation of transcription

The promoter region of KIAA1211L is approximately 1340 base pairs with various predicted transcription factors. [20] The glial cells missing homolog 1 and the oligodendrocyte lineage transcription factors are notable because KIAA1211L is highly expressed in the brain. [20] [5] Furthermore, the Estrogen-related receptor alpha is also a notable transcription factor due to KIAA1211L's low expression levels when estrogen receptors are knocked down. [21] [20] Furthermore, KIAA1211L is predicted to be SUMOylated. [16] The 3' UTR of KIAA1211L is predicted to be a targeted by miRNA-132, which is depicted in the conceptual translation figure. [22]

Function

Interacting proteins

Glycogen Synthase Kinase 3 Beta (GSK3B)

GSK3B is a protein kinase that regulates transcription factors and microtubules. [23] As such, it phosphorylates proteins, decreasing their ability to bind and stabilize microtubules. [23] The proteins it phosphorylates are the principle components of neurofibrillary tangles in Alzheimer disease. [23] The protein is needed for the establishment of neuronal polarity and axon outgrowth and phosphorylates proteins in neuroblastoma cells. [23] Furthermore, it is associated with bipolar disease and is active in breast cancer cells. [23] [24]

As such, the predicted interaction between KIAA1211L and GSK3B is likely because KIAA1211L is highly expressed in the brain, associated with bipolar disorder and breast cancer, and is localized on the microtubules. [5] [6] [8] [9] The interaction between GSK3B and KIAA1211L was predicted using anti bait coimmunoprecipitation, pull down, tandem affinity purification, fluorescence polarization spectroscopy, protein kinases assay, two hybrid, and confocal microscopy experiments. [25]

KIAA1211L protein is also predicted to interact with Alpha-synuclein (SNCA), E3 Ubiquitin-Protein Ligase Mdm2 (MDM2), Serine/Threonine-Protein Kinase PAK 1 (PAK 1), and DNA Replication Factor Cdt1 (CDT1). [25]

Clinical significance

KIAA1211L is associated with depression, bipolar disorder, and schizophrenia. [9] Additionally, KIAA1211L is associated with various cancers including ovarian, breast, etc. [8]

Homology

Paralogs

KIAA1211 is the paralog to KIAA1211L. KIAA1211 is located on chromosome 4 and has 1233 amino acids. [26] Its percent identity to KIAA1211L is 21%. [27] The KIAA1211 has an ortholog in the bacteria Proteus vulgarism, indicating the paralog duplicated 4290 million years ago, before KIAA1211L. [28] [29]

Orthologs

Below is the table of various KIAA1211L orthologs. It includes closely, intermediately, and distantly related orthologs. The most distant ortholog is the elephant shark, indicating KIAA1211L duplicated 473 MYA. The amino acids conserved among all the KIAA1211L orthologs are depicted in the conceptual translation.

Species [30] NCBI Accession # [30] Date of Divergence [31] Sequence Identity [12] Sequence Similarity [32]
Pan troglodytes (Chimpanzee)XP_515643.26.65 MYA99.1%99.3%
Octodon negus (Degu)XP_004633240.190 MYA65.9%73.1%
Panthera pardus (Leopard)XP_019312964.196 MYA67.8%73.3%
Anas platyrhynchos (Mallard Duck)XP_012949224.1312 MYA41.2%52.40%
Pygoscelis adeliae (Adélie penguin)XP_009321834.1312 MYA38.5%51.6%
Python bivittatus (Burmese python)XP_007428826312 MYA34.2%46.3%
Nanorana parker (High Himalaya frog)XP_018418330.1352 MYA32.1%43.7%
Callorhinchus milii (Elephant Shark)XP_007889338.1473 MYA30.5%42.4%

Phylogeny

The KIAA1211L gene is similar and conserved in mammals, birds, reptiles, amphibians, and fish. It is not conserved in bacteria, archaea, protists, plants, fungus, trichoplax, and invertebrates.

Citations

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000196872 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000026090 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 6 "KIAA1211L KIAA1211 like [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-04-23.
  6. 1 2 3 "Cell atlas - KIAA1211L - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2017-04-23.
  7. 1 2 3 4 "GenScript Protein Subcellular Location Prediction Tool".[ permanent dead link ]
  8. 1 2 3 Spurrell CH (2013). Identifying New Genes for Inherited Breast Cancer by Exome Sequencing (Doctor of Philosophy thesis). University of Washington.
  9. 1 2 3 Iwamoto K, Kakiuchi C, Bundo M, Ikeda K, Kato T (April 2004). "Molecular characterization of bipolar disorder by comparing gene expression profiles of postmortem brains of major mental disorders". Molecular Psychiatry. 9 (4): 406–16. doi: 10.1038/sj.mp.4001437 . PMID   14743183.
  10. 1 2 3 4 5 6 7 8 Database, GeneCards Human Gene. "KIAA1211L Gene - GeneCards | K121L Protein | K121L Antibody". www.genecards.org. Retrieved 2017-02-24.
  11. 1 2 "Homo sapiens KIAA1211 like (KIAA1211L), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-04-23.
  12. 1 2 3 4 5 6 Workbench, NCSA Biology. "SDSC Biology Workbench". workbench.sdsc.edu. Retrieved 2017-04-23.
  13. "Genatlas sheet". genatlas.medecine.univ-paris5.fr. Retrieved 2017-04-23.
  14. "Pfam: Family: DUF4592 (PF15262)". pfam.xfam.org. Retrieved 2017-04-23.
  15. "Homo sapiens KIAA1211 like (KIAA1211L), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-04-23.
  16. 1 2 "SUMOplot™ Analysis Program | Abgent". www.abgent.com. Retrieved 2017-04-23.
  17. Kumar, Prof. T. Ashok. "BioGem.Org - Ashok Kumar's Bioinformatics Portal... | Home". www.biogem.org. Retrieved 2017-04-23.
  18. "Tissue expression of KIAA1211L - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2017-04-23.
  19. "KIAA1211L protein abundance in PaxDb". pax-db.org. Retrieved 2017-04-23.
  20. 1 2 3 "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Retrieved 2017-05-07.
  21. Al Saleh S, Al Mulla F, Luqmani YA (2011). "Estrogen receptor silencing induces epithelial to mesenchymal transition in human breast cancer cells". PLOS ONE. 6 (6): e20610. Bibcode:2011PLoSO...620610A. doi: 10.1371/journal.pone.0020610 . PMC   3119661 . PMID   21713035.
  22. Alvarez-Saavedra, M (2010). "MicroRNA-132-Dependent Post-Transcriptional Regulation of Clock Entrainment Physiology Via Modulation of Chromatin Remodeling and Translational Control Gene Targets". University of Ottawa.
  23. 1 2 3 4 5 "GSK3B - Glycogen synthase kinase-3 beta - Homo sapiens (Human) - GSK3B gene & protein". www.uniprot.org. Retrieved 2017-04-23.
  24. Database, GeneCards Human Gene. "GSK3B Gene - GeneCards | GSK3B Protein | GSK3B Antibody". www.genecards.org. Retrieved 2017-04-23.
  25. 1 2 "IntAct". www.ebi.ac.uk. Retrieved 2017-04-23.
  26. Database, GeneCards Human Gene. "KIAA1211 Gene - GeneCards | K1211 Protein | K1211 Antibody". www.genecards.org. Retrieved 2017-04-23.
  27. Myers EW, Miller W (March 1988). "Optimal alignments in linear space". Computer Applications in the Biosciences. 4 (1): 11–7. doi:10.1093/bioinformatics/4.1.11. PMID   3382986.
  28. "kiaa1211l KIAA1211 like [Callorhinchus milii (elephant shark)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-02-24.
  29. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2017-02-24.
  30. 1 2 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2017-04-23.
  31. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2017-04-23.
  32. EMBL-EBI. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2017-04-23.

Related Research Articles

YIF1A

Protein YIF1A is a protein that in humans is encoded by the YIF1A gene.

KIAA1841

KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.

WD repeat-containing protein 90 is a protein that in humans is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

ANKRD24

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

Glutamate rich 5

Glutamate Rich Protein 5 is a protein in humans encoded by the ERICH5 gene, also known as Chromosome 8 open reading frame 47 (C8orf47).

C10orf67

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn's Disease and sarcoidosis.

C12orf60

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

C17orf53

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

C21orf58

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

C16orf46 Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

Uncharacterized protein Chromosome 1 Open Reading Frame 27 is a protein in humans, encoded by the C1orf27 gene. It is accession number NM_017847. This is a membrane protein that is 3926 base pairs long with the most extensive string of amino acids being 454aa long. C1orf27 exhibits cytoplasmic expression in epidermal tissues. Predicted associated biological processes of the gene include cell fate specification and developmental properties.

Transmembrane protein 44

Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.

TMEM44

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

SMCO3

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

LSMEM2

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

FAM214B

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

FAM98C

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.