CIMAP1C (ciliary microtubule associated protein 1C) is a gene in humans that encodes the CIMAP1C protein. It is also often referred to as ODF3L1 (outer dense fiber protein 3-like protein 1). CIMAP1C is expressed in low levels throughout the body with high expression levels in the testes. It is highly conserved in mammals and reptiles but not present in birds or amphibians, indicating it arose around 300 million years ago.
The CIMAP1C gene is found on Chromosome 15 at position 15q24.2, spanning 3.72 kb. [1] It contains 4 exons and 1,132 nucleotides. [2]
CIMAP1C is also known as ODF3L1 (outer dense fiber protein 3 like protein 1). [2]
The CIMAP1C protein is 274 amino acids long and its molecular weight is ~31 kDa. [4] Its isoelectric point is 9.6. [5] There are no known isoforms. [2] CIMAP1C is very basic, proline rich, and highly tyrosine rich. [6] There are two structurally significant regions known as sperm tail proline-glycine rich repeat regions. [2] These regions are highly conserved and contain glycosylation and phosphorylation sites. [7]
CIMAP1C is expressed at low levels in most tissues in the body (15.7%), with high expression in the testis. [2] Slightly high levels of expression are seen in the mammary glands. [2]
CIMAP1C is imported into the nucleus via two nuclear localization signals, and can otherwise be found in the cytoskeleton. [2] [8] CIMAP1C contains a variety of phosphorylation sites, specifically protein kinase C and Casein kinase II phosphorylation. [7]
CIMAP1C is highly conserved, and can be found in mammals and reptiles. CIMAP1C is not found in amphibians or birds, therefore it arose 300 million years ago. [9] Based on the calculated protein divergence figure, CIMAP1C evolves rapidly and similarly to Fibrinogen Alpha. The table below lists orthologs and their information.
Genus and Species | Common Name | Taxonomic Group | Mediat Date of Divergence (MYA) [9] | Accession number | Sequence length (aa) | Sequence Identity to human protein (%) | Sequence Similarity to human Protein (%) |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | Primates | 0.0 | NP_787077.1 | 274 | 100.0 | 100.0 |
Saimiri boliviensis | Squirrel Monkey | Primates | 43 | XP_010329148.1 | 274 | 87.6 | 92.7 |
Canis lupus familiaris | Dog | Carnivora | 94 | XP_038297715.1 | 275 | 78.2 | 88.4 |
Mus musculus | House Mouse | Rodentia | 87 | NP_001361610.1 | 306 | 64.4 | 75.5 |
Myotis brandtii | Brandt's Bat | Chiroptera | 94 | XP_005884580.1 | 275 | 74.5 | 85.8 |
Gracilinanus agilis | Mouse Opossum | Didelphimorphia | 160 | XP_044518760.1 | 278 | 58.8 | 73.8 |
Bubalus bubalis | Water buffalo | Artiodactyla | 94 | XP_006080752.3 | 274 | 73.1 | 84.4 |
Pogona vitticeps | Cental Bearded Dragon | Squamata | 319 | XP_020640486.1 | 285 | 40.2 | 54.3 |
Notechis scutatus | Tiger Snake | Squamata | 319 | XP_026540911.1 | 195 | 28.4 | 41.4 |
Dermochelys coriacea | Leatherback Sea Turtle | Testudines | 319 | XP_038274105.1 | 274 | 37.8 | 54.7 |
Alligator mississippiensis | American Alligator | Crocodylia | 319 | XP_019333216.1 | 249 | 37.8 | 52.8 |
Phascolarctos cinereus | Koala | Diprotodontia | 160 | XP_020850090.1 | 278 | 59 | 73.4 |
Phocoena sinus | Vaquita | Artiodactyla | 94 | XP_032479987.1 | 275 | 77.5 | 87.3 |
Eublepharis macularius | Leopard gecko | Squamata | 319 | XP_054858185.1 | 276 | 38.5 | 50.7 |
Ornithorhynchus anatinus | Platypus | Monotremata | 180 | XP_001509549.1 | 269 | 45.8 | 58.8 |
It has been experimentally determined that CIMAP1C interacts with STAMBP (STAM binding protein) through affinity chromatography. [10] STAMBP regulates cell surface receptor-mediated endocytosis and ubiquitin-dependent receptor sorting to lysosomes. [11]
Expression levels of CIMAP1C are positive markers of disease prognosis for those with papillary thyroid carcinoma (PTC) and non-metastatic breast cancer (NMBC). High levels of CIMAP1C in tumor tissues accurately predicted a better prognosis for individuals with PTC. [12] CIMAP1C is also noted as a positive indicator for the effectiveness of NMBC radiotherapy. [13]
Transmembrane protein 131-like, alternatively named uncharacterized protein KIAA0922, is an integral transmembrane protein encoded by the human gene KIAA0922 that is significantly conserved in eukaryotes, at least through protists. Although the function of this gene is not yet fully elucidated, initial microarray evidence suggests that it may be involved in immune responses. Furthermore, its paralog, prolyl endopeptidase (PREP) whose function is known, provides clues as to the function of TMEM131L.
Coiled-coil domain containing 130 is a protein that in humans is encoded by the CCDC130 gene. It is part of the U4/U5/U6 tri-snRNP in the U5 portion. This tri-snRNP comes together with other proteins to form complex B of the mature spliceosome. The mature protein is approximately 45 kilodaltons (kDa) and is extremely hydrophilic due to the abnormally high number of charged and polar amino acids. CCDC130 is a highly conserved protein, it has orthologous genes in some yeasts and plants that were found using nucleotide and protein versions of the basic local alignment search tool (BLAST) from the National Center for Biotechnology Information. GEO profiles for CCDC130 have shown that this protein is ubiquitously expressed, but the highest levels of expression are found in T-lymphocytes.
PRP36 is an extracellular protein in Homo sapiens that is encoded by the PRR36 gene that contains a domain of unknown function, DUF4596, towards the C terminus of the protein. The function of PRP36 is unknown, but high gene expression has been observed in various regions of the brain such as the prefrontal cortex, cerebellum, and the amygdala. PRP36 has one alias: Putative Uncharacterized Protein FLJ22184.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.
UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.
The Family with sequence similarity 149 member B1 is an uncharacterized protein encoded by the human FAM149B1 gene, with one alias KIAA0974. The protein resides in the nucleus of the cell. The predicted secondary structure of the gene contains multiple alpha-helices, with a few beta-sheet structures. The gene is conserved in mammals, birds, reptiles, fish, and some invertebrates. The protein encoded by this gene contains a DUF3719 protein domain, which is conserved across its orthologues. The protein is expressed at slightly below average levels in most human tissue types, with high expression in brain, kidney, and testes tissues, while showing relatively low expression levels in pancreas tissues.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Chromosome 1 open reading frame 198 (C1orf198) is a protein that in humans is encoded by the C1orf198 gene. This particular gene does not have any paralogs in Homo sapiens, but many orthologs have been found throughout the Eukarya domain. C1orf198 has high levels of expression in all tissues throughout the human body, but is most highly expressed in lung, brain, and spinal cord tissues. Its function is most likely involved in lung development and hypoxia-associated events in the mitochondria, which are major consumers of oxygen in cells and are severely affected by decreases in available cellular oxygen.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
SH3 Domain Binding Kinase Family Member 3 is an enzyme that in humans is encoded by the SBK3 gene. SBK3 is a member of the serine/threonine protein kinase family. The SBK3 protein is known to exhibit transferase activity, especially phosphotransferase activity, and tyrosine kinase activity. It is well-conserved throughout mammalian organisms and has two paralogs: SBK1 and SBK2.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
Protein FAM110A, also known as protein family with sequence similarity 110, A, C20orf55 or BA371L19.3 is encoded by the FAM110A gene. FAM110A is located on chromosome 20 and is a part of the greater FAM110 gene family, consisting of FAM110A, FAM110B, and FAM110C.
SPMIP10 is a protein that in Homo sapiens is encoded by the SPMIP10 gene.
{{cite web}}
: Missing or empty |url=
(help){{cite web}}
: Missing or empty |url=
(help)