WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). [1] This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. [2] It contains multiple WD40 repeat domains and one domain of unknown function. [3] This protein is conserved all the way back to invertebrates. [4] Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis. [5]
In the human genome, WDR90 spans from base pairs 649363 to 667833, and has an mRNA sequence that is 5,250 nucleotides in length. This gene consists of 41 exons and encodes for a 1,750 amino acid protein in humans.
This human gene is located in the positive DNA strand on chromosome 16 at the position 16p13.3. [1] Flanking sequences on the positive strand include FAM195A and RHOT2. [3]
Aliases for WDR90 include: C16orf15, C16orf16, C16orf17, C16orf18, C16orf19, and KIAA1924. [1]
One human paralog for WDR90 has been found. This paralog is known as WDR16. According to UniProtKB/Swiss-Prot, WDR16 may play an essential role in growth or survival of hepatocellular carcinoma. This protein in humans if 620 amino acids long. [6] [7]
Orthologs for protein WDR90 are found conserved all the way back to single invertebrates. [8] There were three conserved domains across mammals, DUF667 (location 3-124), and two WD40 repeats (locations 697-991, and 1155-1549). [3]
Human protein WDR90 is 287.7 kDa, with an isoelectric point of 6.584. There is a domain of unknown function called DUF667 that is conserved across its 30 predicted isoforms. [9]
WDR90 is predicted to be a heavily phosphorylated protein. [10]
There are three domains that repeat throughout the human sequence. [11]
The de novo protein structure of WDR90 in humans is predicted to possess beta sheets, beta turns, and random coils. [12]
The human protein WDR90 adopts a 7-bladed beta propeller that is indicative of the WD40 repeat family. This structure was predicted with 100% confidence by the structure predicting program Phyre2.
The most notable proteins that interact with human WDR90 are predicted to be UTY, KDM6B, SETD1B, and ASH2L. [13] These proteins are primarily found as histone modifiers.
The promoter for WDR90 in humans has been predicted to have domains for the transcription factors: ARNT, AP-1, MYB, and Cis2His2. [14]
Expression levels of human WDR90 appear to be relatively constituent in low levels, with its highest levels in the lymph nodes and thymus. [15] [16] The expression of WDR90 in development is the highest in blastocyst and fetal stages. Expression is notably absent from the neonatal and infant stages, then comes back during juvenile and persists through adulthood. [16]
The function of WDR90 in humans in not well known. Given its broad expression in many tissues, its ties to the protein superfamily WD40 domain-containing proteins, and the known protein interactions, it could play a role in histone modifications, vesicular transport, or transcription regulation. [13] [15] [16]
Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
KIAA0753 is a protein that in humans is encoded by the gene KIAA0753. The gene is located on chromosome 17p13.1, on the reverse strand spanning bases 6578141 to 6641744. The KIAA0753 gene contains 18 exons, 19 introns, and has no known aliases.
C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
Leukocyte Receptor Cluster Member 9 is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
WD repeat containing protein 53 (WDR53) is a protein encoded by the WDR53 gene that has been identified in the human genome by the Human Genome Project but has, at the moment, lacked experimental procedures to understand the function. It is located on chromosome 3 at location 3q29 in Homo sapiens. It has short up and down stream untranslated regions as well as WD40 repeat regions which have been linked to various functions.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
FAM71E2, also known as Family With Sequence Similarity 71 Member E2, is a protein that, in humans, is encoded by the FAM71E2 gene. Aliases include C19orf16, Protein FAM71E2, Chromosome 19 open reading frame 16, and Putative Protein FAM71E2. The gene is primarily conserved in mammals, but it is also conserved in two reptile species.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.