PANO1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | PANO1 , PANO, proapoptotic nucleolar protein 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | GeneCards: PANO1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
PANO1 is a protein which in humans is encoded by the PANO1 gene. PANO1 is an apoptosis inducing protein that is able to regulate the function of tumor suppressor. [3] More specifically, P14ARF is a protein in which in humans is modulated by the PANO1 gene. P14ARF is known to function as a tumor suppressor. [4] When PANO1 is highly expressed in the cells, it is able to modulate p14ARF by stabilizing it and protecting it from degradation. [4] With a confidence level of 5 out of 5, PANO1 has been theorized to be expressed in the nucleolus of the cell. [5] PANO1 is an intron-less gene. [3] Intron-less genes only make up about 3% of the human genome. [6] A functional analysis of these types of genes revealed that they often have tissue-specific expression in tissues such as the nervous system and testis. [6] This kind of expression is commonly associated with neuropathies, disease, and cancer. [6] The tissue types that PANO1 has the highest expression in, are the cerebellum regions of the brain as well as pituitary and testis tissues. [7]
PANO1 is also known as Proapoptotic Nucleolar Protein 1, PANO, and Pre-mRNA-splicing Factor CW22-like. PANO1 is located on human chromosome 11 at positions 797,511-799,190 and is positioned on the + strand. [8] Its protein contains 1 exon and 215 amino acids. [3]
PANO1 has one isoform, isoform 1, located in the PONAB and PANTR species. [3] These isoforms have proteins with 215 and 216 amino acids, respectively. [3] No isoforms for the human PANO1 protein could be identified. Human PANO1 protein has a molecular weight of 22.8 kb and a theoretical, isoelectric point of 12.21. [3] From an analysis of the PANO1 protein, it was observed that the protein contains a low amount of lysine and a very low amount of asparagine when compared to other human proteins. [9] The same analysis indicated that the protein does not contain any hydrophobic or transmembrane regions. [9] PANO1 contains 2 cAMP phosphorylation sites, 6 N-myristoylation sites, 4 protein kinase C phosphorylation sites, 3 bipartite nuclear localization signals as well as arginine-rich and proline-rich regions. Using PSORTII, 3 discrimination of nuclear localization signals were identified. [10] Pat4 (RRRR) at position 200, Pat7 (RRRR) at position 201 and a bipartite (RKGTPTARCLGQRTKEK) at position 35. [10] Nuclear localization signals allow proteins to be able to enter the nucleus, but many nuclear proteins possess their own. [10] PANO1 overlaps solute carrier family 25 member 22 (SLC25A22). [5] A causal link between this solute carrier, when upregulated, has been strongly associated with an osteosarcoma's ability to proliferate. [11] In addition to promoting proliferation, SLC25A22 when expressed in vitro, progressed the cell cycle and inhibited apoptosis. [11] It is curious that PANO1, an apoptosis inducer, and SLC25A22, an apoptosis inhibiter, overlap one another.
The structure of PANO1 is 82% disordered meaning the protein is able to move around easily. [12] The secondary structure reveals a beta strand at positions 3-9 as well as alpha helices incorporated throughout. [12]
One promoter region was identified using Genomatix. It is located on the positive strand and is 1040 bp in length. [13] Its start site is located at 795633 and it ends at 796672. [13] It overlaps with the primary transcript which starts at 796633. [13] 440 transcription factors were identified within the promoter region. [13] To highlight some of relevance and importance to PANO1: Wilms tumor suppressor, spermatogenic zip 1 transcription factor, signal transducer and activator of transcription, pleomorphic adenoma gene, general transcription factor IIIA, stimulating protein 1, CCAAT/enhancer binding protein, GC box elements and HMG box-containing protein 1.
Like previously mentioned, PANO1 is detected to be most expressed in the cerebellum, pituitary and testis tissues. [7] Furthermore, NCBI Geo profiles indicated that PANO1 is highly expressed in B-lymphocytes, liposarcomas, and the testis cell lines. [14] It also appears that PANO1 is highly expressed in the androgen-sensitive LAPC-4 cell line with a rank of 97. PANO1 is biased to being expressed in androgen sensitive cells compared to androgen insensitive cells. [14]
A predicted 3' UTR structure was generated using Unafold and depicts predicted stem loop structures. [15] Two stem loop structures are zoomed in on.
A possible cleavage site was identified between amino acids 33 and 34 as depicted in the PANO1 protein model. [3] As mentioned previously, 6 N-myristoylation sites, 2 cAMP phosphorylation sites, and 4 protein kinase C phosphorylation sites are also present. [3]
Motif | Location (aa) |
---|---|
N-myristoylation site | 23-28 |
N-myristoylation site | 33-38 |
N-myristoylation site | 71-76 |
N-myristoylation site | 111-116 |
N-myristoylation site | 120-125 |
N-myristoylation site | 186-191 |
cAMP phosphorylation site | 35-38 |
cAMP phosphorylation site | 136-139 |
protein kinase C phosphorylation site | 34-36 |
protein kinase C phosphorylation site | 40-42 |
protein kinase C phosphorylation site | 57-59 |
protein kinase C phosphorylation site | 191-193 |
PANO1 orthologs were only able to be traced back in divergence to birds. Much more closely related orthologs include primates, as well as marsupial and placental mammals. [16] Specific examples of orthologs can be seen in the table below.
Genus & species | Common name | Tax. group | Date of divergence (MYA) | Accession # | Seq. length (aa) | Seq. identity to HSA prot. (%) | Seq. identity to HSA prot. (#/100) | Seq. similarity to HSA prot. (%) | n | m |
Homo sapiens | Human | human | 0 | XP_034607700.1 | 215 | 100 | 1 | 100 | 0 | 0 |
Pongo abelii | Sumatran orangutan | primate | 15.76 | XP_024111240.1 | 216 | 96.3 | 0.963 | 100 | 0.037 | 3.770186718 |
Sapajus apella | tufted capuchin | primate | 43.2 | XP_024111240.1 | 216 | 96.3 | 0.963 | 100 | 0.037 | 3.770186718 |
Hylobates moloch | silvery gibbon | primate | 19.8 | XP_032005218.1 | 175 | 95.43 | 0.9543 | 81 | 0.0457 | 4.677719156 |
Rhinopithecus roxellana | golden monkey | primate | 28.21 | XP_030773457.1 | 239 | 86.63 | 0.8663 | 86 | 0.1337 | 14.35240101 |
Callithrix jacchus | white-tufted-eared marmoset | primate | 42.9 | XP_009007032.2 | 268 | 72.5 | 0.725 | 91 | 0.275 | 32.15836241 |
Propithecus coquereli | Coquerel's sifaka | primate | 74.1 | XP_012507388.1 | 183 | 69.28 | 0.6928 | 71 | 0.3072 | 36.70139217 |
Phascolarctos cinereus | koala | marsupial | 160 | XP_020836186.1 | 205 | 28.5 | 0.285 | 0.715 | 125.5266099 | |
Leptonychotes weddellii | Weddell seal | placental | 96 | XP_030893169.1 | 182 | 47.88 | 0.4788 | 70 | 0.5212 | 73.64723053 |
Phoca vitulina | harbor seal | placental | 96 | XP_032270736.1 | 264 | 54.55 | 0.5455 | 67 | 0.4545 | 60.60524737 |
Loxodonta africana | African bush elephant | placental | 105 | XP_010600675.1 | 204 | 39 | 0.39 | 85 | 0.61 | 94.16085399 |
Orcinus orca | killer whale | placental | 96 | XP_012391505.1 | 237 | 41.21 | 0.4121 | 91 | 0.5879 | 88.64892406 |
Felis catus | house cat | placental | 96 | XP_023096045.1 | 217 | 59.86 | 0.5986 | 0.4014 | 51.31616836 | |
Ursus arctos horribilis | grizzly bear | placental | 96 | XP_026346196.1 | 290 | 60.34 | 0.6034 | 53 | 0.3966 | 50.51749523 |
Oxyura jamaicensis | ruddy duck | bird | 312 | XP_035173110.1 | 278 | 29.26 | 0.2926 | 0.7074 | 122.894879 | |
Melopsittacus undulatus | budgerigar | bird | 312 | XP_033919672.1 | 228 | 28.73 | 0.2873 | 0.7127 | 124.7228313 |
PANO1 was compared to two other genes, fibrinogen alpha chain as well as cytochrome C. The date of divergence as well as amino acid changes were tracked over many different species types to generate a divergence date vs. number of amino acids changes as seen to the right. PANO1 appears to diverge much more quickly than fibrinogen alpha and much more quickly than cytochrome c.
p14ARF is a protein that is a known tumor suppressor. [4] It does this by controlling cell proliferation and cell survival, however the mechanism for how this process is controlled/modulated remained unclear. [4] PANO1 has been identified to modulate and stabilize p14ARF by stabilizing it and protecting it from degradation in HeLa cells. [4] When PANO1 is over-expressed, as a direct result, p14ARF expression also increases. [4]
When there is an abnormal expression of PANO1 in HeLa cells, scientists have seen a decrease in tumorigenicity in nude mice. [4] Additionally, other diseases that have been associated with PANO1 include hemochromatosis 2. [5]
Basic Leucine Zipper and W2 Domain-Containing Protein 2 is a protein that is encoded by the BZW2 gene. It is a eukaryotic translation factor found in species up to bacteria. In animals, it is localized in the cytoplasm and expressed ubiquitously throughout the body. The heart, placenta, skeletal muscle, and hippocampus show higher expression. In various cancers, upregulation tends to lead to higher severity and mortality. It has been found to interact with SARS-CoV-2.
E3 ubiquitin-protein ligase RNF128 is an enzyme that in humans is encoded by the RNF128 gene.
Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.
Coiled-Coil Domain Containing protein 82 (CCDC82) is a protein that in humans, is encoded for by the gene of the same name, CCDC82. The CCDC82 gene is expressed in nearly all of human tissues at somewhat low rates. As of today, there are no patents involving CCDC82 and the function remains unknown.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.
BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.
C17orf98 is a protein which in humans is coded by the gene c17orf98. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
Major facilitator superfamily domain containing 6 like (MFSD6L) is a protein encoded by the MFSD6L gene in humans. The MFSD6L protein is a transmembrane protein that is part of the major facilitator superfamily (MFS) that uses chemiosmotic gradients to facilitate the transport of small solutes across cell membranes.
Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.
Human protein 53 intron 1 (Hp53int1) is a protein encoded by the Hp53int1 gene in humans.