C21orf58

Last updated
C21orf58
Identifiers
Aliases C21orf58 , chromosome 21 open reading frame 58
External IDs HomoloGene: 137684 GeneCards: C21orf58
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

n/a

RefSeq (protein)

NP_001273391
NP_001273392
NP_001273405
NP_001273406
NP_478060

Contents

n/a

Location (UCSC) Chr 21: 46.3 – 46.32 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene. [3]

Gene

C21orf58 gene neighborhood Chromosome 21-C21orf58 genomic context.png
C21orf58 gene neighborhood

Locus

The gene is located on the minus strand of the distal half of the long arm of Chromosome 21 at 21q22.3. [4] Transcript 1, including UTRs, is 22,740 bp and spans the chromosomal locus 46,301,130-46,323,875. [4]

mRNA

Alternative Splicing

mRNA transcript variants 1-5 encode two validated protein isoforms of C21orf58. [5] [4] Transcript variant 1 encodes the longer, primary isoform (1) (Accession: NP_470860). [3] Transcript variants 2-5 encode the shorter isoform (2). [4] Isoform 2 has a distinct N-terminus in comparison to Isoform 1 resulting from the use of an alternative start codon. [4] A domain of unknown function, DUF4587, is conserved in all variants. [4]

Transcript [4] Protein [4] Length (bp) [4] Length (aa) [4] Exons [4] DUF4587 (aa) [4]
1Isoform 129753228234-291
2Isoform 216742169128-185
3Isoform 229002167128-185
4Isoform 229412169128-185
5Isoform 226242169128-185

Protein

General Properties

The primary encoded protein consists of 322 amino acids, 8 total exons, and a molecular weight of 39.0 kDa. [3] [6] [7] The predicted isoelectric point is 10.06, supporting predicted nuclear localization. [7] [6]

Composition

Human protein C21orf58 Isoform 1 is rich in proline and glutamine, and poor in cysteine, phenylalanine, and tyrosine. [7] The protein is particularly tyrosine poor containing zero tyrosine residues. [7] Isoform 1 contains 20 more positive charged residues than negative charged residues providing additional support for the predicted isoelectric point. [7]

Domains & Motifs

Illustration of C21orf58 annotated with important domains, motifs, and post-translational modifications. C21orf58 .png
Illustration of C21orf58 annotated with important domains, motifs, and post-translational modifications.

C21orf58 Isoform 1 has three conserved domains: proline-rich domain, histidine rich domain, and DUF4587. Proline-rich domain, Pro175-Pro322, is predicted to mediate protein-protein interactions. [8] Histidine-rich repeat domain, His292-His299, is predicted to facilitate localization. [9] [10] The domain of unknown function, DUF4587 (Arg234- His291), is a member of pfam15248 exclusively found in eukaryotes. [11]

C21orf58 contains a nuclear localization signal, The135-Leu144. [12]

Tertiary structure of C21orf58 predicted by Phyre2 C21orf58 structure.png
Tertiary structure of C21orf58 predicted by Phyre2

Structure

Secondary structure of C21orf58 is predicted to consist primarily of random coil domains with four regions of alpha helices throughout the span of the protein. [14] [15] [16] Secondary structure predictions of C21orf58 orthologs revealed similar results; random coil and four regions of alpha helices with the addition of beta-sheets throughout. [14] [15] [16]

C21orf58 mRNA transcript variant 1 aligned and conceptually translated with important domains, motifs, and post-translational modifications. C21orf58 Conceptual Translation.png
C21orf58 mRNA transcript variant 1 aligned and conceptually translated with important domains, motifs, and post-translational modifications.

Post-Translational Modifications

C21orf58 is predicted to undergo multiple post-translational modifications including phosphorylation, O-GlcNAc, and SUMOylation. [17] [18] [19] [20]

Subcelluar Localization

Immunocytochemistry revealed localization of C21orf58 to nucleoplasm and nuclear bodies. [21] Presence of a nuclear localization sequence provides further evidence for protein import into the cell nucleus. [14]

Subcellular localization predictions for C21orf58 based on the amino acid sequence (PSORTII) suggested nuclear localization. [22] Predictions across orthologs agreed with nuclear localization. [22]

Expression

Tissue Expression Pattern

C21orf58 is constitutively expressed at low levels across various normal tissues (GDS3113), including but not limited to brain, endocrine, bone marrow, lung, and reproductive tissues. [23]

C21orf58 constitutive low level expression across all tissues analyzed (GDS3113) GDS3113.png
C21orf58 constitutive low level expression across all tissues analyzed (GDS3113)

DNA microarray experimental data

DNA microarray analysis from various experiments showed variable C21orf58 expression in unique physiological conditions.

C21orf58 was found to be expressed through all stages of development at similar levels throughout. [29]

Sagittal plane view of the mouse brain in situ hybridization of C21orf58 otholog in mouse (2610028H24ik). Expression of C21orf58 color-coded by expression intensity ranging from blue (low intensity) through green to red (high intensity). Allen Brain Atlas C21orf58 expression in m brain.png
Sagittal plane view of the mouse brain in situ hybridization of C21orf58 otholog in mouse (2610028H24ik). Expression of C21orf58 color-coded by expression intensity ranging from blue (low intensity) through green to red (high intensity). Allen Brain Atlas

In situ Hybridization

C21orf58 ortholog in mouse 2610028H24Rik was found to be ubiquitously expressed at high levels throughout the mouse brain. [30]

Regulation of Expression

Transcriptional

The primary promoter for the longest variant of C21orf58 aligns with the start of the 5'UTR and is 1143bp in length. [31] The predicted promoter sequence overlaps with the 5'UTR and coding sequence of Pericentrin (PCNT) on the plus strand of Chromosome 21. Predicted transcription factors are associated with regulation of the cell cycle, neurogenesis, early development, and sex determination.

Transcription Factor [31] Function [31]
PLAG1 Associated with nuclear import

Transcriptional activator

WT1 Role in the development of the urogenital system
ZFX Implicated in mammalian sex determination
AP-2 Activation of genes in early development

Expression in neural crest cell lineages

E2F4 Cell cycle control

Tumor suppression

c-Myb Regulation of hematopoiesis
Elk-1 Transcriptional activator
KLF7 Cell proliferation, differentiation, and survival

Regulates neurogenesis

ZBTB33 Promotes histone deacetylation and the formation of repressive chromatic structures
Roaz Involved in olfactory neuronal differentiation

Interacting Proteins

Yeast-two hybrid screening confirmed protein-protein interactions with PNMA1, MTUS2, GRB2. [32] Affinity Capture-MS indicated interactions with MTA2, ASH2L, and FAM199X. [32] Two hybrid prey pooling followed by two hybrid array approach revealed interactions with Ccdc136, Ccdc125, KRT37, KRT27, KRT35, SPTA1, MKRN3, USHBP1, and KLHL20. [33]

Predicted interactions involved proteins associated with the cytoskeleton, cell migration, histone modification, and signal transduction.

InteractorFunction
PNMA1 Neuron- and testis- specific protein [34]

Associated with paraneoplastic neurological disorders [34]

MTUS2 Microtubule associated scaffold protein [35]

Role in cell migration and linking of microtubules to plasma membrane [35]

GRB2 Signal Transduction [36]
MTA2 Component of NuRD, a nucleosome remodeling deacetylase complex [37]
ASH2L Component of HMT Set1/Ash2 histone methyltransferase (HTM) complex [38]
Ccdc136Acrosome formation in spermatogenesis [39]
Ccdc125 Regulation of Cell Migration [40]
KRT37 Type 1 keratin that heterodimerizes with type II keratin to form hair and nails [41]
KRT27 Member of Type I keratin family

Involved in intermediate filament formation [42]

KRT35 Type 1 keratin that heterodimerizes with type II keratin to form hair and nails [43]
SPTA1 Molecular scaffold protein that links the plasma membrane to actin cytoskeleton [44]
MKRN3 Plays a role in the onset of puberty

Part of ubiquitin-proteasome system [45]

USHBP1Harmonin binding protein [46]

Actin filament binding [46]

KLHL20 Actin filament binding [47]

Adapter of BCR, a negative regulator of apoptosis [47]

Homology

Strict orthologs of C21orf58 by divergence (MYA) and % similarity to human protein C21orf58 Table of C21orf58 Orthologs.png
Strict orthologs of C21orf58 by divergence (MYA) and % similarity to human protein C21orf58

Paralogs

No human paralogs for C21orf58 were identified. [49]

Orthologs

C21orf58 orthologs were identified in bony fish but not in cartilaginous fish. [50] The first 35 bases of DUF4587, Arg234- Pro265, were conserved across ortholog sequences. [51] The most distantly related ortholog identified was the zebrafish. [50]

Molecular Evolution

The rate of C21orf58 evolution was determined through an application of the Molecular Clock Hypothesis. Through comparison with alpha fibrinogen and cytochrome C, it was determined that C21orf58 has evolved at an intermediate rate.

m vs Divergence from Humans (MYA). C21orf58 compared to a quickly evolving gene (a fibrinogen) and a slowly evolving gene (Cytochrome C) across orthologs. C21orf58 Rate of evolution.png
m vs Divergence from Humans (MYA). C21orf58 compared to a quickly evolving gene (α fibrinogen) and a slowly evolving gene (Cytochrome C) across orthologs.

Related Research Articles

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">Glutamate rich 5</span> Protein-coding gene in the species Homo sapiens

Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

bMERB domain containing 1 is a gene expressed in humans which has broad expression across the brain. This gene codes for bMERB1 domain-containing protein 1 isoform 1. It is predicted that this gene is involved in actin cytoskeleton regulation, microtubule regulation and glial cell migration.

Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates

<span class="mw-page-title-main">NOXRED1</span> Human gene

NADP Dependent Oxidoreductase Domain Containing 1 (NOXRED1) is a human protein encoded by the gene NADP-Dependent Oxidoreductase Domain Containing 1 (NOXRED1). An alias of this gene is Chromosome 14 Open Reading Frame 148 (c14orf148). This gene is located on chromosome 14, at 14q24.3. NOXRED1 is predicted to be involved in pyrroline-5-carboxylate reductase activity as part of the L-proline biosynthetic pathway. It is expressed in a wide variety of tissues at a relatively low level, including the testes, thyroid, skin, small intestine, brain, kidney, colon, and more.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000160298 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. 1 2 3 "uncharacterized protein C21orf58 isoform 1 [Homo sapiens] - Protein - NCBI". ncbi.nlm.nih.gov. Retrieved 2018-02-04.
  4. 1 2 3 4 5 6 7 8 9 10 11 12 "C21orf58 chromosome 21 open reading frame 58 [Homo sapiens (human)] - Gene - NCBI". ncbi.nlm.nih.gov. Retrieved 2018-02-04.
  5. "Gene: C21orf58 (ENSG00000160298) - Splice variants - Homo sapiens - Ensembl genome browser 88". mar2017.archive.ensembl.org. Retrieved 2018-02-18.
  6. 1 2 "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2018-04-27.
  7. 1 2 3 4 5 EMBL-EBI. "SAPS Results". ebi.ac.uk. Retrieved 2018-04-27.
  8. Lewitzky M, Kardinal C, Gehring NH, Schmidt EK, Konkol B, Eulitz M, Birchmeier W, Schaeper U, Feller SM (March 2001). "The C-terminal SH3 domain of the adapter protein Grb2 binds with high affinity to sequences in Gab1 and SLP-76 which lack the SH3-typical P-x-x-P core motif". Oncogene. 20 (9): 1052–62. doi: 10.1038/sj.onc.1204202 . PMID   11314042.
  9. Hernández-Sánchez IE, Maruri-López I, Ferrando A, Carbonell J, Graether SP, Jiménez-Bremont JF (2015-09-07). "Nuclear localization of the dehydrin OpsDHN1 is determined by histidine-rich motif". Frontiers in Plant Science. 6: 702. doi: 10.3389/fpls.2015.00702 . PMC   4561349 . PMID   26442018.
  10. Seo YA, Lopez V, Kelleher SL (June 2011). "A histidine-rich motif mediates mitochondrial localization of ZnT2 to modulate mitochondrial function". American Journal of Physiology. Cell Physiology. 300 (6): C1479–89. doi:10.1152/ajpcell.00420.2010. PMC   3118624 . PMID   21289295.
  11. group, NIH/NLM/NCBI/IEB/CDD. "NCBI CDD Conserved Protein Domain DUF4587". ncbi.nlm.nih.gov. Retrieved 2018-04-27.
  12. Kosugi S, Hasebe M, Tomita M, Yanagawa H (June 2009). "Systematic identification of cell cycle-dependent yeast nucleocytoplasmic shuttling proteins by prediction of composite motifs". Proceedings of the National Academy of Sciences of the United States of America. 106 (25): 10171–6. Bibcode:2009PNAS..10610171K. doi: 10.1073/pnas.0900604106 . PMC   2695404 . PMID   19520826.
  13. Kelley, Lawrence. "PHYRE2 Protein Fold Recognition Server". sbg.bio.ic.ac.uk. Retrieved 2018-05-07.
  14. 1 2 3 Combet C, Blanchet C, Geourjon C, Deléage G (March 2000). "NPS@: network protein sequence analysis". Trends in Biochemical Sciences. 25 (3): 147–50. doi:10.1016/s0968-0004(99)01540-6. PMID   10694887.
  15. 1 2 Garnier J, Osguthorpe DJ, Robson B (March 1978). "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins". Journal of Molecular Biology. 120 (1): 97–120. doi:10.1016/0022-2836(78)90297-8. PMID   642007.
  16. 1 2 Chou, Peter Y.; Fasman, Gerald D. (1974-01-15). "Prediction of protein conformation". Biochemistry. 13 (2): 222–245. doi:10.1021/bi00699a002. ISSN   0006-2960. PMID   4358940.
  17. "Motif Scan". myhits.isb-sib.ch. Retrieved 2018-04-27.
  18. Basu S, Plewczynski D (April 2010). "AMS 3.0: prediction of post-translational modifications". BMC Bioinformatics. 11: 210. doi: 10.1186/1471-2105-11-210 . PMC   2874555 . PMID   20423529.
  19. Gupta R, Brunak S (2002). "Prediction of glycosylation across the human proteome and the correlation to protein function". Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing: 310–22. doi:10.1142/9789812799623_0029. ISBN   978-981-02-4777-5. PMID   11928486.
  20. Hilgarth RS, Murphy LA, Skaggs HS, Wilkerson DC, Xing H, Sarge KD (December 2004). "Regulation and function of SUMO modification". The Journal of Biological Chemistry. 279 (52): 53899–902. doi: 10.1074/jbc.R400021200 . PMID   15448161.
  21. "C21orf58 - Antibodies - The Human Protein Atlas". proteinatlas.org. Retrieved 2018-05-01.
  22. 1 2 "PSORT II Prediction". psort.hgc.jp. Retrieved 2018-05-06.
  23. "49003066 - GEO Profiles - NCBI". ncbi.nlm.nih.gov. Retrieved 2018-05-01.
  24. "GDS3113 / 152620". ncbi.nlm.nih.gov. Retrieved 2018-05-01.
  25. "GDS2919 / 238541_at". ncbi.nlm.nih.gov. Retrieved 2018-05-06.
  26. 1 2 "GDS3429 / 19723". ncbi.nlm.nih.gov. Retrieved 2018-05-06.
  27. "GDS2697 / 238541_at". ncbi.nlm.nih.gov. Retrieved 2018-05-06.
  28. "What is teratozoospermia?". Teratozoospermia. 2018-04-06. Retrieved 2018-05-06.
  29. Group, Schuler. "EST Profile - Hs.236572". ncbi.nlm.nih.gov. Retrieved 2018-05-07.
  30. "Experiment Detail :: Allen Brain Atlas: Mouse Brain". mouse.brain-map.org. Retrieved 2018-05-06.
  31. 1 2 3 "Genomatix: ElDorado Result". genomatix.de. Retrieved 2018-05-06.
  32. 1 2 Lab, Mike Tyers. "C21orf58 Result Summary | BioGRID". thebiogrid.org. Retrieved 2018-05-05.
  33. "31 binary interactions found for search term C21orf58". IntAct Molecular Interaction Database. EMBL-EBI. Retrieved 2018-08-25.
  34. 1 2 Database, GeneCards Human Gene. "PNMA1 Gene - GeneCards | PNMA1 Protein | PNMA1 Antibody". genecards.org. Retrieved 2018-05-04.
  35. 1 2 Database, GeneCards Human Gene. "MTUS2 Gene - GeneCards | MTUS2 Protein | MTUS2 Antibody". genecards.org. Retrieved 2018-05-04.
  36. "GRB2". collab.its.virginia.edu. Retrieved 2018-05-05.
  37. Database, GeneCards Human Gene. "MTA2 Gene - GeneCards | MTA2 Protein | MTA2 Antibody". genecards.org. Retrieved 2018-05-06.
  38. "Ash2l - Set1/Ash2 histone methyltransferase complex subunit ASH2 - Mus musculus (Mouse) - Ash2l gene & protein". uniprot.org. Retrieved 2018-05-06.
  39. "CCDC136 - Coiled-coil domain-containing protein 136 - Homo sapiens (Human) - CCDC136 gene & protein". uniprot.org. Retrieved 2018-05-06.
  40. Database, GeneCards Human Gene. "CCDC125 Gene - GeneCards | CC125 Protein | CC125 Antibody". genecards.org. Retrieved 2018-05-06.
  41. Database, GeneCards Human Gene. "KRT37 Gene - GeneCards | KRT37 Protein | KRT37 Antibody". genecards.org. Retrieved 2018-05-06.
  42. Database, GeneCards Human Gene. "KRT27 Gene - GeneCards | K1C27 Protein | K1C27 Antibody". genecards.org. Retrieved 2018-05-06.
  43. "KRT35 keratin 35 [Homo sapiens (human)] - Gene - NCBI". ncbi.nlm.nih.gov. Retrieved 2018-05-06.
  44. Database, GeneCards Human Gene. "SPTA1 Gene - GeneCards | SPTA1 Protein | SPTA1 Antibody". genecards.org. Retrieved 2018-05-06.
  45. Reference, Genetics Home. "MKRN3 gene". Genetics Home Reference. Retrieved 2018-05-06.
  46. 1 2 Database, GeneCards Human Gene. "USHBP1 Gene - GeneCards | USBP1 Protein | USBP1 Antibody". genecards.org. Retrieved 2018-05-06.
  47. 1 2 "KLHL20 - Kelch-like protein 20 - Homo sapiens (Human) - KLHL20 gene & protein". uniprot.org. Retrieved 2018-05-06.
  48. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2018-05-04.
  49. 1 2 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2018-05-04.
  50. 1 2 "Protein BLAST: search protein databases using a protein query". blast.ncbi.nlm.nih.gov. Retrieved 2018-05-04.
  51. EMBL-EBI. "Bioinformatics Tools for Multiple Sequence Alignment < EMBL-EBI". ebi.ac.uk. Retrieved 2018-05-04.