PRP36

Last updated
LOC105371752
Identifiers
Aliases PRP36
External IDs GeneCards: ; OMA:- orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

n/a

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC)n/an/a
PubMed searchn/a
Wikidata
View/Edit Human

PRP36 (Proline Rich Protein 36) is an extracellular protein in Homo sapiens that is encoded by the PRR36 (Proline Rich Region 36) gene that contains a domain of unknown function, DUF4596, towards the C terminus of the protein. [1] The function of PRP36 is unknown, but high gene expression has been observed in various regions of the brain such as the prefrontal cortex, cerebellum, and the amygdala. [2] [3] PRP36 has one alias: Putative Uncharacterized Protein FLJ22184. [4]

Contents

Gene

The human PRR36 gene consists of 7 exons and is 5723 base pairs long. [5]

Locus

PRR36 is located on the short arm of human chromosome 19 at 19p13.2 (region 1, band 3, and sub-band 2). [5] The gene spans between base pair numbers 7868719 and 7874441 on chromosome 19 and is located between two other genes—LYPLA2P2, a pseudogene, and EVI5L, a gene which produces a protein that regulates Rab GTPase activity. [6] [7]

Relative location of PRR36 on the short arm of Chromosome 19 Location of EVI5L on Chromosome 19.png
Relative location of PRR36 on the short arm of Chromosome 19

mRNA and splice variants

Alternative splicing of the PRR36 gene results in two transcript variants. PRR36 (FLJ22184) Transcript Variant 1, seen in the image below, is 4518 base pairs long and consists of six exons, of which the last five are utilized in protein coding. The protein produced, PRP36, is made up of 1346 amino acids. [8] PRR36 Transcript Variant 2 is 780 base pairs long and consists of five exons. PRR36 Transcript Variant 2 theoretically encodes a protein 260 amino acids in length. However, it is currently suspected that this variant transcript never gets translated. [5]

Splicing of FLJ22184/PRR36 gene for transcript variant 1 AceView FLJ22184 splicing.png
Splicing of FLJ22184/PRR36 gene for transcript variant 1

PRR36 Transcript Variant 1 has only been found to have only one polyadenylation site. [9]

Protein

Domain and motifs

DUF4596 on human PRP36 is 47 amino acids long, has an isoelectric point of 3.77, and is almost completely conserved across mammals. [10] Despite lacking a signal peptide, PRP36 is predicted to be excreted from the cell after it undergoes processing. [11] [12]

A few different tandem repeats, separated repeats, and repeated sequences exist throughout PRP36. These repeats are observable in primate PRP36 orthologs but are absent in PRP36 orthologs from more distantly related species such as the opossum, suggesting that some form of evolution has been occurring throughout the PRP36 sequence in relatively recent history. [10]

Multiple sequence alignment of the DUF4596 Region across 20 PRP36 Orthologs. Green indicates complete conservation, blue identical residues, and magenta chemical similarity. DUF4596 Region from PRP36.png
Multiple sequence alignment of the DUF4596 Region across 20 PRP36 Orthologs. Green indicates complete conservation, blue identical residues, and magenta chemical similarity.

Composition

PRP36 is 1346 amino acids long and is proline rich, meaning that a greater proportion of proline residues exist throughout the protein, including the DUF4596 domain, in comparison with other human proteins. Proline rich proteins are often observed to be intrinsically unstructured and have been connected with protein-protein interactions in signaling pathways. [13] However, it isn't certain whether these traits hold true in PRP36. In PRP36 the amino acids isoleucine, tyrosine, and asparagine are present at a decreased proportion compared to a typical human protein. Two highly positive sequences exist towards the N terminus of PRP36 while a highly negative sequence exists within the DUF4596 domains towards the C terminus. As a whole, however, PRP36 appears to be a slightly basic and overall positively charged protein, as it has a corresponding isoelectric point of 10.98. [10] PRP36 is a polar and soluble protein. [14]

Post-translational modifications

PRP36 is predicted to contain 24 phosphorylation sites in humans, including 14 serine, 9 threonine, and 1 tyrosine site. [15] [16] [17] Additionally, there are 8 predicted N-Acetylglucosamine attachment sites and 2 highly conserved predicted SUMOylation sites. [18] [19]

Secondary structure

PRP36 secondary structure has not been explicitly determined, but predictions based on the PRR36 mRNA give some possibilities. Alpha-helixes, beta sheets, and other structure characteristics fail to be conserved across PRP36 orthologs with the exception of an alpha-helix alpha-helix beta-strand beta-strand motif that was highly conserved across mammals. [10] This motif begins slightly before and carries into the DUF4596 region, suggesting a high importance for this domain in PRP36 function.

Interacting proteins

PRP36 has medium scores for predicted interaction with two other proteins of unknown function, OVCH1 and FAM179A. [20] These predictions, however, have not been experimentally determined, so the confidence of protein-protein interaction with PRP36 isn't very high. [20]

Cellular location

No signal peptide or other marker is predicted to exist with the PRP36 sequence. [21] However, according to Phobius, PRP36 is predicted to be a non-cytoplasmic protein existing the extracellular space. [21] Assuming this prediction is correct, this might indicate that PRP36 undergoes unconventional protein secretion.

Expression

Promoter

A single promoter is predicted to exist by Genomatix for the PRP36 protein. This promoter exists on the negative strand from position 7939226 to 7939826 and is 601 base pairs in length. [22] The PRP36 promoter region contains a number of predicted transcription factors of various types including various zinc fingers, E2F factors, and CDF factors. Of particulate note is the presence of a XGene Promoter Element on the minus strand which is a mediator of RNA polymerase II for promoters lacking a TATA box, as is the case for the PRP36 promoter. [23] The following table gives 12 transcription factors that interact with PRP36 as predicted by the ElDorado tool from Genomatix—all shown factors received a minimum Matrix Sim score of 0.877. [23]

Matrix FamilyDetailed Matrix InformationDetailed Family InformationStart PositionEnd PositionAnchor PositionStrandSequence
O$XCPEX gene core promoter element 1Activator-, mediator- and TBP-dependent core promoter element for RNA polymerase II transcription from TATA-less promoters513523518+ggGCGGgaccg
V$ZF5FZF5 POZ domain zinc finger, zinc finger protein 161ZF5 POZ domain zinc finger477491484+gagcgCGCGcccccg
V$GLIFGLIS family zinc finger 2GLI zinc finger family163224+tcgaCCCCccaaccaga
V$ZF02Zinc finger and BTB domain containing 7A, PokémonC2H2 zinc finger transcription factors 2550572561+gcagcCCCCtcccctcgcctcct
V$E2FFE2F transcription factor 6E2F-myc activator/cell cycle regulator490506498-cgcggGCGGgagagccg
V$E2FFE2F transcription factor 2E2F-myc activator/cell cycle regulator476492484+cgagcGCGCgcccccgg
V$SP1FSp2, member of the Sp/XKLF transcription factors with three C2H2 zinc fingers in a conserved carboxyl-terminal domainGC-Box factors SP1/GC189205197-ccaggaggcgGGACcac
V$MAZFMyc associated zinc finger protein (MAZ)Myc associated zinc fingers557569563-aggcGAGGggagg
V$E2FFE2F transcription factor 2E2F-myc activator/cell cycle regulator411427419+ccaaaGCGCgcttctcc
O$XCPEX gene core promoter element 1Activator-, mediator- and TBP-dependent core promoter element for RNA polymerase II transcription from TATA-less promoters493503498-ggGCGGgagag
V$E2FFE2F transcription factor 3E2F-myc activator/cell cycle regulator475491483-cggggGCGCgcgctcga
O$XCPEX gene core promoter element 1Activator-, mediator- and TBP-dependent core promoter element for RNA polymerase II transcription from TATA-less promoters190200195-agGCGGgacca

Expression

Unigene's EST cDNA Tissue Abundance display and Protein Atlas shows PRP36 as having significant expression levels in the brain, embryonic tissue, eyes, intestines, kidneys, nerves, and ovaries. [24] Additional evidence supports some of these findings, as analysis of normal tissues revealed that over 50% of the cells in the cerebellum, fetal brain, prefrontal cortex, and superior cervical ganglion expressed PRP36. [25] [26] PRP36 appears to be over-expressed in cell samples taken from patients with ductal carcinomas of the mammary gland, suggesting that the disease state and PRP36 expression might be connected. [27]

Unigene EST tissue expression data for human PRP36 protein Unigene EST Tissue Expression for PRP36.png
Unigene EST tissue expression data for human PRP36 protein

Homology

Orthologs

PRP36 has no known paralogs in humans, but a number of orthologs were found to exist in species throughout the mammalian kingdom. [28] PRP36 is highly conserved across primates, but a few short sequences unique to the human version of the gene do exist. [10] Based on the lack of conservation across all mammals a rapid evolution for PRP36 can be suggested. However, the DUF4596 region is highly conserved across mammals, suggesting that the domain is critical to PRP36 function while the rest of protein is more easily manipulated without leading to harm. A list of orthologs for PRP36 can be found below [28]

#Genus and speciesCommon nameDivergence (MYA) [29] Accession numberE-valueLength (aa)Identity (%)Similarity (%)
1Homo sapiens Human 0 NP_001177396 01346100100
2Pan troglodytes Chimpanzee 6.3 XP_009432808 012808788
3Callithrix jacchus Marmoset 42.6 XP_008985368 012437879
4Saimiri boliviensis boliviensis Black-capped squirrel monkey 42.6 XP_010347967.1 011617577
5Otolemur garnetti Northern greater galago 74.0 XP_003793753 5x10-15810845964
6Mesocricetus auratus Golden hamster 92.3 XP_005085339 2x10-10610348286
7Mus musculus House mouse 92.3 XP_006508977 6x10-10410466771
8Nannospalax galili Upper Galilee Mountains blind mole-rat 92.3 XP_008822351.1 1x10-11711676265
9Jaculus jaculus Lesser Egyptian jerboa 92.3 XP_004672230 1x10-12310075559
10Ictidomys tridecemlineatus Thirteen-lined ground squirrel 92.3 XP_005332306.1 1x10-1118265865
11Bubalus bubalis Water buffalo 94.2 XP_006046812 5x10-11910687577
12Felis catus House cat 94.2 XP_011287775 2x10-896687377
13Camelus dromedarius Dromedary 94.2 XP_010976676.1 4x10-1198686570
14Myotis lucifugus Little brown bat 94.2 XP_006101945.1 4x10-1119355863
15Balaenoptera acutorostrata scammoni Common minke whale 94.2 XP_007169287.1 7x10-1326846267
16Bison bison bison American bison 94.2 XP_010826582 8x10-10110545158
17Vicugna pacos Alpaca 94.2 XP_006206574 8x10-646805156
18Echinops telfairi Lesser hedgehog tenrec 98.7 XP_004717416 2x10-968506165
19Trichechus manatus latirostris Florida manatee 98.7 XP_004378653.1 4x10-1148795963
20Monodelphis domestica Gray short-tailed opossum 162.6 XP_007489701.1 4x10-856534854

Evolution

Multiple sequence alignment suggests that PRP36 evolved early in mammalian lineage. [10] Mammals very distantly related to human beings, such as the opossum, have a version of the PRP36, suggesting that the protein came about prior to that evolutionary divergence. However, with exception to the DUF4596 domain, very few areas within the PRP36 sequence are conserved. [10]

Clinical significance

At this time, the function of the PRP36 protein is not known. However, some speculation of the function can be made. In 2009, it was discovered that the source of a patient's phenotypes was a genetic condition involving a 19p13.2 microdeletion—a very small piece of chromosome was missing from the patient (the entire 19p13.2 region was not missing). [30] [31] Additional diagnoses have since been made, and a few patients have been found to have microdeltions that involve the region in which the PRR36 gene is found, meaning the PRP36 protein would not be found in these individuals. However, this region also included other genes whose functions are well known; for example the obesity observed in the patients can be traced to the deletion of the insulin receptor gene. Other symptoms, such a learning disabilities and speech impediments can be tied to similar gene deletions. [30] However, it is possible that PRP36 absence causes a minor disability that is masked by these other symptoms. Additionally, it is possible that PRP36 plays a secondary role with one or more of these other deleted genes. This second option can be slightly supported by noting that other proline rich proteins that have known function, both on human chromosome 19 and other chromosomes, tend to more frequently produce proteins that are involved in protein-protein interactions than many other general types of genes. [32]

Related Research Articles

<span class="mw-page-title-main">QRICH1</span> Protein found in humans

QRICH1, also known as Glutamine-rich protein 1, is a protein that in humans is encoded by the QRICH1 gene. One notable feature of this protein is that it contains a Caspase Activation Recruitment Domain, also known as a CARD domain. As a result of having this domain, QRICH1 is believed to be involved in apoptotic, inflammatory, and host-immune response pathways.

<span class="mw-page-title-main">Proline-rich 12</span> Protein-coding gene in the species Homo sapiens

Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.

<span class="mw-page-title-main">Protein FAM46B</span> Protein-coding gene in the species Homo sapiens

Protein FAM46B also known as family with sequence similarity 46 member B is a protein that in humans is encoded by the FAM46B gene. FAM46B contains one protein domain of unknown function, DUF1693. Yeast two-hybrid screening has identified three proteins that physically interact with FAM46B. These are ATX1, PEPP2 and DAZAP2.

<span class="mw-page-title-main">ARMH3</span> Protein-coding gene in the species Homo sapiens

ARMH3 or Armadillo Like Helical Domain Containing 3, also known as UPF0668 and c10orf76, is a protein that in humans is encoded by the ARMH3 gene. Its function is not currently known, but experimental evidence has suggested that it may be involved in transcriptional regulation. The protein contains a conserved proline-rich motif, suggesting that it may participate in protein-protein interactions via an SH3-binding domain, although no such interactions have been experimentally verified. The well-conserved gene appears to have emerged in Fungi approximately 1.2 billion years ago. The locus is alternatively spliced and predicted to yield five protein variants, three of which contain a protein domain of unknown function, DUF1741.

<span class="mw-page-title-main">TM6SF2</span> Protein-coding gene in the species Homo sapiens

TM6SF2 is the Transmembrane 6 superfamily 2 human gene which codes for a protein by the same name. This gene is otherwise called KIAA1926. Its exact function is currently unknown.

<span class="mw-page-title-main">DEPDC1B</span> Protein-coding gene in the species Homo sapiens

DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.

Proline-rich protein 21 (PRR21) is a protein of the family of proline-rich proteins. It is encoded by the PRR21 gene, which is found on human chromosome 2, band 2q37.3. The gene exists in several species, both vertebrates and invertebrates, including humans. However, the protein have few conserved regions among species.

Transmembrane protein 251, also known as C14orf109 or UPF0694, is a protein that in humans is encoded by the TMEM251 gene. One notable feature of this protein is the presence of proline residues on one of its predicted transmembrane domains., which is a determinant of the intramitochondrial sorting of inner membrane proteins.

Leucine-Rich Single-Pass Membrane Protein 1 (LSMEM1) is a protein that, in humans, is encoded by the LSMEM1 gene.

<span class="mw-page-title-main">C3orf70</span> Protein-coding gene in the species Homo sapiens

C3orf70 also known as Chromosome 3 Open Reading Frame 70, is a 250aa protein in humans that is encoded by the C3orf70 gene. The protein encoded is predicted to be a nuclear protein; however, its exact function is currently unknown. C3orf70 can be identified with known aliases: Chromosome 3 Open Reading Frame 70, AK091454, UPF0524, and LOC285382.

<span class="mw-page-title-main">TMEM249</span> Protein-coding gene in the species Homo sapiens

TMEM249 is a protein that in humans is encoded by the C8orfk29 gene.

LOC105377021 is a protein which in humans is encoded by the LOC105377021 gene. LOC105377021 exhibits expressional pathology related to breast cancer, specifically triple negative breast cancer. LOC105377021 contains a serine rich region in addition to predicted alpha helix motifs.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">FAM155B</span> Protein-coding gene in humans

Family with Sequence Similarity 155 Member B is a protein in humans that is encoded by the FAM155B gene. It belongs to a family of proteins whose function is not yet well understood by the scientific community. It is a transmembrane protein that is highly expressed in the heart, thyroid, and brain.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

References

  1. "proline-rich protein 36 [Homo sapiens]". NCBI Protein. National Center for Biotechnology Information. Retrieved 1 May 2015.
  2. "FLJ22184 - Normal tissues of various types". GEO Profiles. National Center for Biotechnology Information. Retrieved 1 May 2015.
  3. "FLJ22184 - Large-scale analysis of the human transcriptome (HG-U133A)". GEO Profiles. National Center for Biotechnology Information. Retrieved 1 May 2015.
  4. "Submitted name: Protein FLJ22184". UniProt. UniProt Consortium. Retrieved 1 May 2015.
  5. 1 2 3 "PRR36 proline rich 36 [ Homo sapiens (human) ]". Gene. National Center for Biotechnology Information. Retrieved 1 May 2015.
  6. Itoh T, Satoh M, Kanno E, Fukuda M (Sep 2006). "Screening for target Rabs of TBC (Tre-2/Bub2/Cdc16) domain-containing proteins based on their Rab-binding activity". Genes to Cells. 11 (9): 1023–37. doi:10.1111/j.1365-2443.2006.00997.x. PMID   16923123. S2CID   43042490.
  7. "LYPLA2P2 (lysophospholipase II pseudogene 2)". Atlas of Genetics and Cytogenetics in Oncology and Haematology. Retrieved 1 May 2015.
  8. "Homo sapiens proline rich 36 (PRR36), transcript variant 1, mRNA". Nucleotide. National Center for Biotechnology Information. Retrieved 1 May 2015.
  9. "Gene Finding in Eukaryota". Softberry. Retrieved 1 May 2015.
  10. 1 2 3 4 5 6 7 "SDSC Biology Workbench". Department of Bioengineering. University of California Sand Diego. Retrieved 1 May 2015.
  11. "Transmembrane Topology". Phobius. Stockholm Bioinformatics Centre. Retrieved 1 May 2015.
  12. Petersen TN, Brunak S, von Heijne G, Nielsen H (29 September 2011). "SignalP 4.0: discriminating signal peptides from transmembrane regions". Nature Methods. 8 (10): 785–6. doi: 10.1038/nmeth.1701 . PMID   21959131. S2CID   16509924.
  13. Kay BK, Williamson MP, Sudol M (Feb 2000). "The importance of being proline: the interaction of proline-rich motifs in signaling proteins with their cognate domains". FASEB Journal. 14 (2): 231–41. doi: 10.1096/fasebj.14.2.231 . PMID   10657980. S2CID   10475561.
  14. "SOSUI". Classification and Secondary Structure Prediction of Membrane Proteins. Mitaku Group.
  15. Xue Y, Ren J, Gao X, Jin C, Wen L, Yao X (Sep 2008). "GPS 2.0, a tool to predict kinase-specific phosphorylation sites in hierarchy". Molecular & Cellular Proteomics. 7 (9): 1598–608. doi: 10.1074/mcp.M700574-MCP200 . PMC   2528073 . PMID   18463090.
  16. Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S (Jun 2004). "Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence". Proteomics. 4 (6): 1633–49. doi:10.1002/pmic.200300771. PMID   15174133. S2CID   18810164.
  17. Blom N, Gammeltoft S, Brunak S (Dec 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–62. doi:10.1006/jmbi.1999.3310. PMID   10600390.
  18. "SUMOplot™ Analysis Program". ABGENT. WuXi AppTec. Retrieved 1 May 2015.
  19. Gupta R, Brunak S (2002). "Prediction of glycosylation across the human proteome and the correlation to protein function". Pacific Symposium on Biocomputing: 310–22. doi:10.1142/9789812799623_0029. ISBN   978-981-02-4777-5. PMID   11928486.
  20. 1 2 "STRING10 FLJ22184". STRING. Retrieved 9 May 2015.
  21. 1 2 "Phobius PRP36". Phobius. Stockholm Bioinformatics Centre. Retrieved 9 May 2015.
  22. "Gene2Promoter". Genomatix Software Suite. Genomatix. Retrieved 3 May 2015.
  23. 1 2 "ElDorado". Genomatix Software Suite. Genomatix. Retrieved 3 May 2015.
  24. "EST Profile FLJ22184". UniGene. National Center for Biotechnology Information. Retrieved 9 May 2015.
  25. "FLJ22184 - Large-scale analysis of the human transcriptome (HG-U133A)". GEO Profiles. National Center for Biotechnology Information. Retrieved 3 May 2015.
  26. "FLJ22184-Normal tissue of various types". GEO Profiles. National Center for Biotechnology Information. Retrieved 3 May 2015.
  27. "Ductal carcinoma in situ: mammary gland". GEO Profiles. National Center for Biotechnology Information. Retrieved 9 May 2015.
  28. 1 2 "BLAST". Basic Local Alignment Search Tool. National Center for Biotechnology Information. Retrieved 3 May 2015.
  29. Kumar S, Hedges S. "TimeTree: a public knowledge-base of divergence times among organisms". TimeTree :: The Timescale of Life.
  30. 1 2 Wangensteen T, Retterstøl L, Rødningen OK, Hjelmesaeth J, Aukrust P, Halvorsen B (Jun 2013). "De novo 19p13.2 microdeletion encompassing the insulin receptor and resistin genes in a patient with obesity and learning disability". American Journal of Medical Genetics Part A. 161A (6): 1480–6. doi:10.1002/ajmg.a.35927. PMID   23637016. S2CID   43594020.
  31. Lysy PA, Ravoet M, Wustefeld S, Bernard P, Nassogne MC, Wyns E, Sibille C (Nov 2009). "A new case of syndromic craniosynostosis with cryptic 19p13.2-p13.13 deletion". American Journal of Medical Genetics Part A. 149A (11): 2564–8. doi:10.1002/ajmg.a.33056. PMID   19842200. S2CID   34800233.
  32. Johns A, Wooster MJ (Apr 1975). "The inhibitory effects of prostaglandin E1 on guinea-pig ureter". Canadian Journal of Physiology and Pharmacology. 53 (2): 239–47. doi:10.1139/y75-035. PMID   1137821.