C11orf52 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C11orf52 , chromosome 11 open reading frame 52 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1914202; HomoloGene: 12059; GeneCards: C11orf52; OMA:C11orf52 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.
C11orf52 is located on chromosome 11 at 11q23.1, starting at 111908620 and ending at 112064278. [5] C11orf52 spans 155658 base pairs and is orientated on the positive strand. Gene C11orf52 has a molecular weight of 14kDa and is a protein coding gene of 7,995 bp containing four exons. The coding region is made up of 1,168 bp. [6]
Genes HSPB2, CRYAB, OLAT, and PPIHP1 neighbor C11orf52 on chromosome 11.
C11orf52 is highly expressed in the thyroid, thalmus, pituitary, placenta, and prostate, kidney, heart, and skeletal muscles. [7] However, in estrogen receptor alpha-silenced MCF-7 breast cancer cells, it is expressed at an extremely low level compared to control tissues. [8]
There is only one variant of C11orf52 RNA. The mRNA sequence is 1,140 base pairs long. [6] There is an upstream stop codon located at nucleotides 65 – 67. The 23rd amino acid varies between threonine and arginine. [6]
The 123 amino acid chain is a domain of unknown function. [9] It has a molecular weight of 13,9 kDal and a predicted Isoelectric Point of 9.74 [6] C11orf52 is predicted to be targeted to the nucleus.
There are no isoforms of the protein encoded by C11orf52.
The LYS19-22 region is an external domain of the protein structure. [6]
There is only one member of the C11orf52 gene family and no splice isoforms can be found going back to Geospiza fortis - the most distantly related to Homo Sapiens C11orf52 sequence. Gene duplication first occurred approximately 324.5 million years ago in reptiles and birds. There are no paralogs for the C11orf52 gene.
Species | Common Name | Accession # | Sequence Length | mRNA Identity % | Sequence Similarity % |
---|---|---|---|---|---|
Homo sapiens | Human | NP_542390.2 | 123 | 100 | 100 |
Saimiri boliviensis boliviensis | Black-Capped Squirrel Monkey | XP_010332875.1 | 126 | 87 | 89 |
Equus caballus | Horse | XP_001916914.1 | 124 | 82 | 82 |
Oryctolagus cuniculus | European Rabbit | XP_002708495.1 | 126 | 75 | 83 |
Erinaceus europaeus | European Hedgehog | XP_007539796.1 | 124 | 59 | 69 |
Leptonychotes weddellii | Weddell Seal | XP_006733365.1 | 130 | 48 | 59 |
Pelodiscus sinensis | Chinese Softshell Turtle | XP_006111270.1 | 123 | 44 | 59 |
Anolis carolinensis | Carolina Anole | XP_008119278.1 | 125 | 42 | 57 |
Alligator mississippiensis | American Alligator | XP_006271409.1 | 118 | 39 | 54 |
Python bivittatus | Burmese Python | XP_007422684.1 | 160 | 38 | 59 |
Haliaeetus albicilla | White-Tailed Eagle | XP_009915555.1 | 118 | 46 | 60 |
Pseudopodoces humilis | Ground Tit | XP_005531117.1 | 118 | 46 | 58 |
Cariama cristata | Red-legged Seriema | XP_009702852.1 | 118 | 44 | 58 |
Apaloderma vittatum | Bar-tailed Trogon | XP_009868605.1 | 118 | 44 | 59 |
Anas platyrhynchos | Mallard | XP_005030930.1 | 139 | 43 | 58 |
Tinamus guttatus | White-throated tinamou | XP_010217725.1 | 118 | 42 | 57 |
Ficedula albicollis | Collared Flycatcher | XP_005058859.1 | 122 | 38 | 52 |
Geospiza fortis | Medium Ground Finch | XP_005427571.1 | 117 | 36 | 52 |
Unusual DNA methylation in the C11orf52 gene in some children can be attributed to prenatal smoke exposure. [10]
C11orf52 may also play a role in lung cancer. C11orf52 is expressed in the lungs and has been associated with increased phosphorylation in cell lung cancer tumors. There is evidence that phosphorylation mechanisms exist which enhance proteins and pathways which should have inhibited phosphorylation in order to prevent extreme proliferation. C11orf52 is one gene where the phosphorylation is significantly different between the cancerous cells and normal tissue. [11]
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
CAP-Gly Domain Containing Linker Protein Family Member 4 is a protein that in humans is encoded by the CLIP4 gene. In terms of conserved domains, the CLIP4 gene contains primarily ankyrin repeats and the eponymous CAP-Gly domains. The structure of the CLIP4 protein is largely made up of coil, with alpha helices dominating the rest of the protein. CLIP4 mRNA expression occurs largely in the adrenal cortex and atrioventricular node. The literature encompassing CLIP4's conserved domains and paralogs points toward microtubule regulation as a possible function of CLIP4.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.
TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.
TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.