C11orf54 | |||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||||||||||||||||||||||||||||||||||||
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C11orf54 , PTD012, PTOD012, chromosome 11 open reading frame 54 | ||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 615810 MGI: 1918234 HomoloGene: 8531 GeneCards: C11orf54 | ||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 11 open reading frame 54 (C11orf54) is a protein that in humans is encoded by the C11orf54 gene. [5] The "Homo sapiens" gene, C11orf54 is also known as PTD012 and PTOD12. C11orf54 exhibits hydrolase activity on p-nitrophenyl acetate and acts on ester bonds, though the overall function is still not fully understood by the scientific community. The protein is highly conserved with the most distant homolog found is in bacteria. [6]
C11orf54 is located on chromosome 11 at 11q21. Common aliases of the gene are PTD012 and PT0D12. The gene consists 13 exons and spans 23730 bp. C11orf54 is flanked by TAF1D and MED17. [6]
The protein ester hydrolase c11orf54 exists as a monomer and is composed of 315 amino acids. There are 6 isoforms for C11orf54. See table 1. [6]
Variant | Isoform | Length (bp) | Accession Number |
---|---|---|---|
1 | ester hydrolase C11orf54 isoform a | 2726 | NM_001286067.1 |
2 | ester hydrolase C11orf54 isoform a | 2589 | NM_001286068.1 |
3 | ester hydrolase C11orf54 isoform a | 2594 | NM_001286069.1 |
4 | ester hydrolase C11orf54 isoform b | 2444 | NM_014039.3 |
5 | ester hydrolase C11orf54 isoform c | 2442 | NM_001286070.1 |
6 | ester hydrolase C11orf54 isoform d | 2417 | NM_001286071.1 |
The amino acid sequence contains the domain of unknown function 1907. Found in this transcript is the HxHxxxxxxxxxH motif which coordinates the zinc ion involved in the hydrolase activity. [7] An LR nest motif is found at lys262 and Arg263. The LR nest motif forms hydrogen bonds between the NH groups and anions; an acetate anion is coordinated with the LR nest. [8]
Table 2 shows the different characteristics of the protein sequence throughout humans and other orthologs. [9]
Organism | Molecular Weight (kiloDalton) | Isoelectric point | High Bias Amino Acids | Repeats |
---|---|---|---|---|
Human | 35.1 | 5.9 | F | AEFS |
Mouse | 35.0 | 5.9 | H | None |
13 Lined Ground Squirrel | 35.1 | 6.0 | F,H | PAEF |
Giant Panda | 35.2 | 6.5 | F | PAEF |
The protein of C11orf54 exists as a monomer in solution. The protein assumes a globular shape of 20 beta strands and 4 alpha helices, containing 9 antiparallel beta strands forming a beta screw region. The β-screw region of C11orf54 has structural similarity to the cyclic adenosine 3′,5′-monophosphate (cAMP) binding domain of the regulatory subunit of protein kinase A. A zinc ion is bound to the HxHxxxxxxxxxH motif found in the sequence. [7]
C11orf54 is predicted to be localized 60.9% in the cytoplasm, 21.7% in the nucleus, 13.0% mitochondrial and 4.3% in the Golgi Apparatus. [10]
See image one. [11] [12] The protein is highly expressed in the kidneys and moderately expressed in the adrenal gland, colon, liver, testis and thyroid gland. [13]
There are no paralogs for C11orf54. [5]
The protein Ester Hydrolase C11orf54 has many orthologs (see table.) It is highly conserved (60-100% identity) in mammals, reptiles, birds, and fish. The protein is moderately conserved (30-59.99% identity) in invertebrates, amphibia, Cnidaria, Mollusca, fungi and bacteria. It is not conserved in archaea. [9] The most distant orthologs are bacteria. Figure 2 shows the unrooted phylogenetic tree of a few of C11orf54’s orthologs. [5]
Species | Common Name | Class | Accession Number | Percent Identity | Divergence (MYA median) |
---|---|---|---|---|---|
Microtus ochrogaster | Prairie Vole | mammalia | XP_005346877.1 | 87.0 | 88 |
Chelonia mydas | Green Sea Turtle | reptilia | XP_007069537.1 | 72.8 | 320 |
Xenopus tropicalis | Burmese Python | reptilia | XP_007434894.1 | 70.9 | 320 |
Python bivittatus | Red Junglefowl | Ave | NP_001264206.1 | 73.4 | 320 |
Gallus gallus | Common Cuckoo | Ave | XP_009564677.1 | 72.5 | 320 |
Cuculus canorus | Southern Platyfish | Actinopterygii | XP_005800827.1 | 65.2 | 432 |
Xiphophorus maculatus | Zebrafish | Actinopterygii | NP_997781.1 | 62.4 | 432 |
Danio rerio | Acorn Worm | Enteropneusta | XP_002738479.1 | 55.6 | 627 |
Saccoglossus kowalevskii | Atlantic Horseshoe Crab | Merostomata | XP_013785734.1 | 56.6 | 758 |
Limulus polyphemus | Western Clawed Frog | Amphibia | XP_012812415.1 | 55.1 | 353 |
Crassostrea gigas | Pacific Oyster | Bivalvia | XP_011412414.1 | 50.0 | 758 |
Tribolium castaneum | Red Flour Beetle | Insecta | XP_968861.1 | 49.0 | 758 |
Drosophila bipectinata | Fruitfly | Insecta | XP_017103988.1 | 46.0 | 758 |
Megachile rotundata | Alfalfa leafcutter bee | Insecta | XP_003702672.1 | 44.8 | 758 |
Zymoseptoria brevis | fungi | Dothideomycetes | KJX93246.1 | 36.5 | 1150 |
Cladophialophora carrionii | fungi | Dothideomycetes | OCT48531.1 | 35.8 | 1150 |
Alternaria alternata | fungi | Dothideomycetes | XP_018384285.1 | 36.2 | 1150 |
Candidatus Pelagibacter ubique | bacteria | Bacteria | WP_075504325.1 | 34.5 | 4090 |
Pelagibacteraceae bacterium | bacteria | Bacteria | OCW82973.1 | 34.1 | 4090 |
C11orf54's coordination with a zinc ion through three histidines and an acetate anion is likely to point to a function of the protein being an enzymatic reaction as an ester hydrolase. The protein has a high turnover number when reacted with p-nitrophenyl acetate (0.042 sec−1) as compared to a 1 sec−1 turnover rate found in another enzyme (bovine carbonic anhydrase II) that reacts with p-nitrophenyl acetate. [7]
Protein Name | Abbreviation |
---|---|
Ubiquitin C | UBC |
Collagen, type IV, alpha 3 | COL4A3 |
Thyroid Hormone Receptor Interactor 13 | TRIP13 |
DEAD (Asp-Glu-Ala-Asp) box polypeptide 60-like | DDX60L |
Glutamine-fructose-6-phosphate transaminase 2 | GFPT2 |
Superkiller viralicidic activity 2-like (S. cerevisiae) | SKIV2L |
OTU domain, ubiquitin aldehyde binding 1 | OTUB1 |
Chloride channel accessory 1 is a protein that in humans is encoded by the CLCA1 gene.
Zinc finger protein 43 is a protein that in humans is encoded by the ZNF43 gene.
DGLUCY is a protein that in humans is encoded by the DGLUCY gene.
MAP11 is a protein that in human is encoded by the gene MAP11. It was previously referred to by the generic name C7orf43. C7orf43 has no other human alias, but in mice can be found as BC037034.
Chromosome 11 open reading frame one, also known as C11orf1, is a protein-coding gene. It has been found by yeast two hybrid screen to bind to SETDB1 a histone protein methyltransferase enzyme. SETDB1 has been implicated in Huntington's disease, a neurodegenerative disorder.
C7orf38 is a gene located on chromosome 7 in the human genome. The gene is expressed in nearly all tissue types at very low levels. Evolutionarily, it can be found throughout the kingdom animalia. While the function of the protein is not fully understood by the scientific community, bioinformatic tools have shown that the protein bares much similarity to zinc finger or transposase proteins. Many of its orthologs, paralogs, and neighboring genes have been shown to possess zinc finger domains. The protein contains a hAT dimerization domain nears its C-terminus. This domain is highly conserved in transposase enzymes.
Polyphosphoinositide phosphatase also known as phosphatidylinositol 3,5-bisphosphate 5-phosphatase or SAC domain-containing protein 3 (Sac3) is an enzyme that in humans is encoded by the FIG4 gene. Fig4 is an abbreviation for Factor-Induced Gene.
Chromosome 20 open reading frame 111, or C20orf111, is the hypothetical protein that in humans is encoded by the C20orf111 gene. C20orf111 is also known as Perit1, HSPC207, and dJ1183I21.1. It was originally located using genomic sequencing of chromosome 20. The National Center for Biotechnology Information, or NCBI, shows that it is located at q13.11 on chromosome 20, however the genome browser at the University of California-Santa Cruz (UCSC) website shows that it is at location q13.12, and within a million base pairs of the adenosine deaminase locus. It was also found to have an increase in expression in cells undergoing hydrogen peroxide(H
2O
2)-induced apoptosis. After analyzing the amino acid content of C20orf111, it was found to be rich in serine residues.
ZGRF1 is a protein in humans that is encoded by the ZGRF1 gene that has a weight of 236.6 kDa. The ZGRF1 gene product localizes to the cell nucleus and promotes DNA repair by stimulating homologous recombination. This gene shows relatively low expression in most human tissues, with increased expression in situations of chemical dependence. ZGRF1 is orthologous to nearly all kingdoms of Eukarya. Functional domains of this protein link it to a series of helicases, most notably the AAA_12 and AAA_11 domains.
Acyl-CoA thioesterase 9 is a protein that is encoded by the human ACOT9 gene. It is a member of the acyl-CoA thioesterase superfamily, which is a group of enzymes that hydrolyze Coenzyme A esters. There is no known function, however it has been shown to act as a long-chain thioesterase at low concentrations, and a short-chain thioesterase at high concentrations.
Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.
ABHD18 is a protein that in Homo sapiens is encoded by the ABHD18 gene.
Uncharacterized protein C14orf80 is a protein which in humans is encoded by the chromosome 14 open reading frame 80, C14orf80, gene.
Uncharacterized protein C1orf131 is a protein that in humans is encoded by the gene C1orf131. The first ortholog of this protein was discovered in humans. Subsequently, through the use of algorithms and bioinformatics, homologs of C1orf131 have been discovered in numerous species, and as a result, the name of the majority of the proteins in this protein family is Uncharacterized protein C1orf131 homolog.
Chromosome 9 open reading frame 152 is a protein that in humans is encoded by the C9orf152 gene. The exact function of the protein is not completely understood.
C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.
C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.
Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.