WDR53

Last updated
WDR53 structure. Zhang Lab, University of Michigan WDR53.gif
WDR53 structure. Zhang Lab, University of Michigan

WD repeat containing protein 53 (WDR53) is a protein encoded by the WDR53 gene that has been identified in the human genome by the Human Genome Project but has, at the moment, lacked experimental procedures to understand the function. It is located on chromosome 3 at location 3q29 in Homo sapiens . It has short up and down stream untranslated regions as well as WD40 repeat regions which have been linked to various functions (signal transduction, transcription regulation, apoptosis etc.).

Contents

Expression

In H. sapiens, it has been shown to be highly expressed in the tissue of the testes with low, almost untraceable, expression in other tissues. [1]

Gene

WDR53 transcribes an mRNA with 1701 base pairs. [2] This gene is on the negative strand of chromosome 3 and has four exons. [3] The mRNA has a promoter labeled GXP_232341 from Genomatix and is 1253 bp long. [4]

mRNA

WDR53 can be alternately spliced into 6 different mRNA products.

Protein

The translated protein from WDR53 contains seven identifiable WD40 regions encoded by 358 amino acids. [5] The protein product is 38.99 kDa. [6]

Subcellular Localization

Cone shape formed by WDR53 WDR53 Cone.jpg
Cone shape formed by WDR53

WDR53 has been predicted to be localized in the nucleus of cells. [7] The protein even possesses one nuclear localization signal.

Structure

The secondary structure of WDR53 has been predicted to be predominantly alternations between loops and strands with little to no helices. [8] These WD40 regions fold into a tertiary propeller like structure that has been conserved in multiple different genes across the human genome as well as other Eukaryotas. [9] These seven repeats form into a cone like shape in which the center depression most likely acts as a binding point for other proteins.

Interacting Proteins

There are two likely proteins that interact with WDR53: WDR5 and MCPH1. Each of these proteins possess regions that have a high likelihood of forming and association with WDR53. Both proteins are expressed in the testes as well which strengthens the likelihood of there being a true association. [10] [11]

Protein map showing post-translational modifications. WDR53 new.jpg
Protein map showing post-translational modifications.

Post-translational Modifications

WDR53 undergoes protein modifications such as phosphorylation, sumoylation, and glycosylation.

Homology

Because WD40 repeats are conserved across many eukaryotes, WDR53 is also conserved among many eukaryotes even into certain members of the plantae kingdom such as the Apple.

Mutation rate of WDR53 in comparison to Fibrinogen and Cytochrome-C across organisms Chart of Divergence.jpg
Mutation rate of WDR53 in comparison to Fibrinogen and Cytochrome-C across organisms
Orthologs
OrganismAccession NumberPercent IdentityDivergence from Homo sapiens (MYA)
Minke Whale XP_0071716658994
Tasmanian Devil XP_00376372675160
Chinese Alligator XP_00603714869320
King Cobra ETE6263462320
American Bullfrog PIO1389454353
Zebrafish XP_001339961.148432
Crown-of-Thorns Starfish XP_02209340135627
Eastern Oyster XP_02228902130794
Apple AFV94635261624

Related Research Articles

<span class="mw-page-title-main">TSBP1</span> Protein found in humans

TSBP1 is a protein that in humans is encoded by the TSBP1 gene. C6orf10 is an open reading frame on chromosome 6 containing a protein that is ubiquitously expressed at low levels in the adult genome and may play a role during fetal development. C6orf10 has been found to be linked to both neurodegenerative and autoimmune diseases in adults. Expression of this gene is highest in the testis but is also seen in other tissue types such as the brain, lens of the eye and the medulla. TSBP1 was previously known as C6orf10.

<span class="mw-page-title-main">Interferon-inducible GTPase 5</span> Protein-coding gene in the species Homo sapiens

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

<span class="mw-page-title-main">NBEAL1</span> Protein-coding gene in the species Homo sapiens

NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">FAM71E1</span> Mammalian protein found in Homo sapiens

FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.

<span class="mw-page-title-main">ZCCHC18</span> Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. "WDR53 WD repeat domain 53 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. NCBI. Retrieved 5 February 2018.
  2. "WD repeat-containing protein 53 isoform a [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-02-19.
  3. "WD repeat-containing protein 53 isoform a [Homo sapiens] - Protein - NCBI".
  4. El Dorado by Genomatix
  5. "WD repeat-containing protein 53 isoform a [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-02-19.
  6. "WDR53 - WD repeat-containing protein 53 - Homo sapiens (Human) - WDR53 gene & protein". www.uniprot.org. Retrieved 2018-05-06.
  7. "aGE251ffc52e51822bf2e9b6375a5787012 WoLFPSORT output". wolfpsort.hgc.jp. Retrieved 2018-05-06.
  8. "Protein Predictions". predictprotein.org. 2018-05-06. Retrieved 2018-05-11.
  9. Smith, Temple F; Gaitatzes, Chrysanthe; Saxena, Kumkum; Neer, Eva J (1999-05-01). "The WD repeat: a common architecture for diverse functions". Trends in Biochemical Sciences. 24 (5): 181–185. doi: 10.1016/S0968-0004(99)01384-5 . ISSN   0968-0004. PMID   10322433.
  10. "WDR5 WD repeat domain 5 [Homo sapiens (human)] - Gene - NCBI".
  11. "MCPH1 microcephalin 1 [Homo sapiens (human)] - Gene - NCBI".