CRACDL | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | CRACDL , C2orf55, KIAA1211-like, KIAA1211 like, KIAA1211L, CRACD like | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1919347 HomoloGene: 19208 GeneCards: CRACDL | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. [5] Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. [6] [7] Finally, CRACDL is associated with certain mental disorders and various cancers. [8] [9]
Chromosome | 2 (2q.11.2) [10] |
Location | 98,793,846 bp from pter to 98,936,259 bp from pter [10] |
Size | 142,414 bases [10] |
Accession Number | NM_207362 [11] |
Also Known As | KIAA1211 Like C2orf55 Chromosome 2 Open Reading Frame 55 [10] |
CRACDL is a protein-coding gene. [10] The table above presents the gene's alias, location, size and accession number.
There are 11 splice isoforms of the CRACDL. [5] The validated isoform has 10 exons. [5]
Amino Acid Length | 962 [10] |
Molecular Weight | 102 kda [12] |
Isoelectric Point | 8 [12] |
Accession Number | NP_997245.2 [11] |
Also Known As | Uncharacterized Protein KIAA1211-like [10] Uncharacterized Protein C2orf55 [10] Hypothetical Protein LOC343990 [13] |
The table above presents the protein's alias, size, and accession number. The CRACD-L protein is proline rich and asparagine, isoleucine, phenylalanine, and tyrosine poor. [12]
The CRACD-L protein has one domain called the DUF4592 motif and spans amino acids 131–239. [14] This domain is highly conserved among the CRACDL orthologs. The DUF4592 motif is depicted in both the conceptual translation and schematic figures.
CRACDL is phosphorylated at the Ser92 and Ser490 amino acids. [15] The KIAA1211L protein is also predicted to have five different SUMOylation sites located at Lys134, Lys375, Lys866, Lys874, and Lys914. [16] Both the phosphorylated sites and the SUMOylation sites are depicted in the conceptual translation and schematic figures.
The CRACD-L protein predicted secondary structure is composed of 50% alpha helixes, 8.9% beta sheets, and 17.9% turns. [17] The high number of turns is consistent with the fact that CRACD-L is proline rich. [12]
The CRACD-L protein is predicted to be located in the nucleus. [7] The orthologs, including the elephant shark, horse, rock dove, and chimp, are also predicted to be located in the nucleus. [7] The nuclear location signal is located on amino acids 25-43 which is depicted in both the conceptual translation and schematic figures. . [7] This signal is conserved throughout the orthologs. Additionally, this location (amino acids 24-43) is positively charged, probably due to the high amount of lysine at this location. [12] Finally, it is predicted that CRACD-L is mainly localized to the microtubules and centrosome and sometimes localized to the cytokinetic bridge. [6]
The gene is highly expressed in the cerebral cortex of the brain. [5] The CRACD-L protein is located in many different tissue types, including the brain, the hippocampus, the lung, breast carcinoma, the islets of Langerhans, the pancreas, the kidney, and 38 other tissues. [18] Additionally, it is expressed an average amount compared to other human proteins. [19]
The promoter region of CRACDL is approximately 1340 base pairs with various predicted transcription factors. [20] The glial cells missing homolog 1 and the oligodendrocyte lineage transcription factors are notable because CRACDL is highly expressed in the brain. [20] [5] Furthermore, the Estrogen-related receptor alpha is also a notable transcription factor due to CRACDL's low expression levels when estrogen receptors are knocked down. [21] [20] Furthermore, CRACDL is predicted to be SUMOylated. [16] The 3' UTR of CRACDL is predicted to be a targeted by miRNA-132, which is depicted in the conceptual translation figure. [22]
Glycogen Synthase Kinase 3 Beta (GSK3B)
GSK3B is a protein kinase that regulates transcription factors and microtubules. [23] As such, it phosphorylates proteins, decreasing their ability to bind and stabilize microtubules. [23] The proteins it phosphorylates are the principle components of neurofibrillary tangles in Alzheimer disease. [23] The protein is needed for the establishment of neuronal polarity and axon outgrowth and phosphorylates proteins in neuroblastoma cells. [23] Furthermore, it is associated with bipolar disease and is active in breast cancer cells. [23] [24]
As such, the predicted interaction between CRACDL and GSK3B is likely because CRACDL is highly expressed in the brain, associated with bipolar disorder and breast cancer, and is localized on the microtubules. [5] [6] [8] [9] The interaction between GSK3B and CRACDL was predicted using anti bait coimmunoprecipitation, pull down, tandem affinity purification, fluorescence polarization spectroscopy, protein kinases assay, two hybrid, and confocal microscopy experiments. [25]
CRACD-L protein is also predicted to interact with Alpha-synuclein (SNCA), E3 Ubiquitin-Protein Ligase Mdm2 (MDM2), Serine/Threonine-Protein Kinase PAK 1 (PAK 1), and DNA Replication Factor Cdt1 (CDT1). [25]
CRACDL is associated with depression, bipolar disorder, and schizophrenia. [9] Additionally, CRACDL is associated with various cancers including ovarian, breast, etc. [8]
KIAA1211 is the paralog to KIAA1211L. KIAA1211 is located on chromosome 4 and has 1233 amino acids. [26] Its percent identity to KIAA1211L is 21%. [27] The KIAA1211 has an ortholog in the bacteria Proteus vulgarism, indicating the paralog duplicated 4290 million years ago, before KIAA1211L. [28] [29]
Below is the table of various KIAA1211L orthologs. It includes closely, intermediately, and distantly related orthologs. The most distant ortholog is the elephant shark, indicating KIAA1211L duplicated 473 MYA. The amino acids conserved among all the KIAA1211L orthologs are depicted in the conceptual translation.
Species [30] | NCBI Accession # [30] | Date of Divergence [31] | Sequence Identity [12] | Sequence Similarity [32] |
---|---|---|---|---|
Pan troglodytes (Chimpanzee) | XP_515643.2 | 6.65 MYA | 99.1% | 99.3% |
Octodon negus (Degu) | XP_004633240.1 | 90 MYA | 65.9% | 73.1% |
Panthera pardus (Leopard) | XP_019312964.1 | 96 MYA | 67.8% | 73.3% |
Anas platyrhynchos (Mallard Duck) | XP_012949224.1 | 312 MYA | 41.2% | 52.40% |
Pygoscelis adeliae (Adélie penguin) | XP_009321834.1 | 312 MYA | 38.5% | 51.6% |
Python bivittatus (Burmese python) | XP_007428826 | 312 MYA | 34.2% | 46.3% |
Nanorana parker (High Himalaya frog) | XP_018418330.1 | 352 MYA | 32.1% | 43.7% |
Callorhinchus milii (Elephant Shark) | XP_007889338.1 | 473 MYA | 30.5% | 42.4% |
The CRACDL gene is similar and conserved in mammals, birds, reptiles, amphibians, and fish. It is not conserved in bacteria, archaea, protists, plants, fungus, trichoplax, and invertebrates.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
RING Finger Protein 227, also known as RNF227 and LINC02581, is a protein which in humans is encoded by the RNF227 gene. According to DNA microarray data, it is found in at least 15 tissues.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.