Proline-rich protein 30

Last updated
PRR30
Identifiers
Aliases PRR30 , C2orf53, proline rich 30
External IDs MGI: 1923877 HomoloGene: 130773 GeneCards: PRR30
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_178553

NM_029680

RefSeq (protein)

NP_848648

NP_083956

Location (UCSC) Chr 2: 27.14 – 27.14 Mb Chr 14: 101.44 – 101.44 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Proline-rich protein 30 (PRR30 or C2orf53) is a protein in humans that is encoded for by the PRR30 gene. [5] PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

Contents

Gene

The PRR30 gene is located on the short arm of human chromosome 2 at band 2p23.3. It flanked by Prolactin regulatory element binding (PREB) and Transcription Factor 23 (TCF23). The gene has three Exons in total. PRR30 has a length of 2618 base pairs of linear DNA. [6]

PRR30 Gene Neighborhood NCBI PRR30.gif
PRR30 Gene Neighborhood

Promoter Region

The PRR30 promoter directly flanks the gene and is 1162 base pairs in length. [8]

Transcript

The PRR30 mRNA transcript is 2063 base pairs in length. There are four splice sites total all of which are in the 5’ UTR. There are no known isoforms or alternative splicing of PRR30.

Protein

Human protein PRR30 consists of 412 amino acid residues. It has a molecular weight of 44.7 kdal and an isoelectric point of 10.7. [9] [10] It is proline rich and composed primarily of non-essential amino acids. There is a region of extreme conservation across orthologs spanning from residues 187 to 321. [11] PRR30 appears to be subcellularly localized to the cell nucleus. [12] NetNES predicts a nuclear export signal from residues 213 to 216. [13] IntAct predicts that PRR30 interacts with Human Testis Protein 37 or TEX37, Cystiene Rich Tail Protein 1 (CYSRT1), and Keratin Associated Protein 6-2 (KRTAP6-2). [14] PRR30 is predicted to undergo post-translational modifications in the form of glycosylation and phosphorylation. [15] [16] [17]

Adapted Prosite figure showing domains, phosphorylation sites (red), glycosylation sites (grey), and nuclear export signal (green). PRR30 Improved Cartoon.png
Adapted Prosite figure showing domains, phosphorylation sites (red), glycosylation sites (grey), and nuclear export signal (green).
I-Tasser predicted protein PRR30. Largely unstructured with minimal folding in highly conserved region. ITasser Predicted PRR30.gif
I-Tasser predicted protein PRR30. Largely unstructured with minimal folding in highly conserved region.

Structure

PRR30 is an intrinsically disordered protein (IDP) and lacks any formal tertiary structure or quaternary structure. [12] I-Tasser and Phyre predict minimal coiling throughout PRR30 as a whole. In the region of high conservation, there are predicted alpha helices & beta sheets. [19] [20]

Function

Unstructured proteins like PRR30 are highly variable in function. [21] Other Proline-Rich Proteins have been shown to have an affinity for binding calcium across different tissues in the human body. [22] [23] COACH predicts several ligand binding domains associated with calcium across PRR30. The highest confidence predicted calcium binding domain resides in the area of greatest conservation. [24] [25]

Expression

NCBI EST profiles have shown differential expression across many tissues but increased levels in the human testes and pharynx. [26]

Homology

PRR30 is exclusive to mammals but is not present in all mammals. PRR30 is highly conserved across Primates but shows loss of the gene in members of Rodents and Laurasiatheria. [27] The most distant known ortholog of PRR30 is found in S. harrisii, Tasmanian Devil. The PRR30 gene appears to be evolving relatively fast rate. [28]

Comparison of evolutionary histories between Cytochrome C (grey), Fibrinogen (orange), and PRR30 (blue). Evolutionary History PRR30.png
Comparison of evolutionary histories between Cytochrome C (grey), Fibrinogen (orange), and PRR30 (blue).

Paralogs

There are no known paralogs for PRR30. [30]

Orthologs

Genus & Species [31] Sequence Identity [31] Date of Divergence (MYA) [31] Sequence Length [31]
Homo sapiens /Human 100%0412
Pan paniscus 99%6.4412
Pan troglodytes /Chimpanzee 99%6.4412
Pongo pygmaeus /Bornean orangutan 93%15.2413
Nomascus leucogenys 94%19.43412
Gorilla gorilla /Western gorilla 96%8.61412
Macaca fascicularis 93%28.1412
Papio anubis 93%28.1412
Macaca nemestrina 93%28.1412
Acinonyx jubatus 66%94394
Bos taurus 65%94396
Bos indicus 65%94396
Heterocephalus glaber 57%88373
Cavia porcellus 54%88391
Octodon degus 61%88402
Mus musculus 52%88399
Echinops telfairi 61%102313
Erinaceus europaeus 57%94375
Tupaia chinensis 68%85410
Sorex araneus 59%94298
Elephantulus edwardii 51%102286
Rhinolophus sinicus 68%94359
Miniopterus natalensis 63%94396
Myotis brandtii 64%94239
Sarcophilus harrisii 57%160376

Clinical significance

From a study on Neurofibromatosis, this graph shows that patients afflicted with Neurofibromatosis Type 1 are likely to have an extra copy of C2orf53. C2orf53 Neurofibromatosis.gif
From a study on Neurofibromatosis, this graph shows that patients afflicted with Neurofibromatosis Type 1 are likely to have an extra copy of C2orf53.

In recent 2015 study, copy number variation of PRR30 gene was linked to an increase risk for neurofibromatosis. 78% of the patients displaying type 1-associated cutaneous neurofibromas carried an extra copy of the PRR30 gene. No mechanism was described illuminating the correlation. [32]

Related Research Articles

<span class="mw-page-title-main">TMEM63A</span>

Transmembrane protein 63A is a protein that in humans is encoded by the TMEM63A gene. The mature human protein is approximately 92.1 kilodaltons (kDa), with a relatively high conservation of mass in orthologs. The protein contains eleven transmembrane domains and is inserted into the membrane of the lysosome. BioGPS analysis for TMEM63A in humans shows that the gene is ubiquitously expressed, with the highest levels of expression found in T-cells and dendritic cells.

<span class="mw-page-title-main">Morn repeat containing 1</span>

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">TMEM106A</span>

TMEM106A is a gene that encodes the transmembrane protein 106A (TMEM106A) in Homo sapiens. It is located at 17q21.31 on the plus strand next to cancer-related genes NBR1 and BRCA1. The TMEM106A gene contains a domain of unknown function, DUF1356.

<span class="mw-page-title-main">Proline-rich 12</span> Protein-coding gene in the species Homo sapiens

Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.

<span class="mw-page-title-main">FAM203B</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.

<span class="mw-page-title-main">Multiple Epidermal Growth Factor-like Domains 8</span> Protein-coding gene in the species Homo sapiens

Megf8 also known as Multiple Epidermal Growth Factor-like Domains 8, is a protein coding gene that encodes a single pass membrane protein, known to participate in developmental regulation and cellular communication. It is located on chromosome 19 at the 49th open reading frame in humans (19q13.2). There are two isoform constructs known for MEGF8, which differ by a 67 amino acid indel. The isoform 2 splice version is 2785 amino acids long, and predicted to be 296.6 kdal in mass. Isoform 1 is composed of 2845 amino acids and predicted to weigh 303.1 kdal. Using BLAST searches, orthologs were found primarily in mammals, but MEGF8 is also conserved in invertebrates and fishes, and rarely in birds, reptiles, and amphibians. A notably important paralog to multiple epidermal growth factor-like domains 8 is ATRNL1, which is also a single pass transmembrane protein, with several of the same key features and motifs as MEGF8, as indicated by Simple Modular Architecture Research Tool (SMART) which is hosted by the European Molecular Biology Laboratory located in Heidelberg, Germany. MEGF8 has been predicted to be a key player in several developmental processes, such as left-right patterning and limb formation. Currently, researchers have found MEGF8 SNP mutations to be the cause of Carpenter syndrome subtype 2.

Proline-rich protein 21 (PRR21) is a protein of the family of proline-rich proteins. It is encoded by the PRR21 gene, which is found on human chromosome 2, band 2q37.3. The gene exists in several species, both vertebrates and invertebrates, including humans. However, the protein have few conserved regions among species.

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.

PRP36 is an extracellular protein in Homo sapiens that is encoded by the PRR36 gene that contains a domain of unknown function, DUF4596, towards the C terminus of the protein. The function of PRP36 is unknown, but high gene expression has been observed in various regions of the brain such as the prefrontal cortex, cerebellum, and the amygdala. PRP36 has one alias: Putative Uncharacterized Protein FLJ22184.

<span class="mw-page-title-main">SHOC1</span>

Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C16orf82</span>

C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene. C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types. The function of C16orf82 is not fully understood by the scientific community.

<span class="mw-page-title-main">C18orf63</span> Protein-coding gene in the species Homo sapiens

Chromosome 18 open reading frame 63 is a protein which in humans is encoded by the C18orf63 gene. This protein is not yet well understood by the scientific community. Research has been conducted suggesting that C18orf63 could be a potential biomarker for early stage pancreatic cancer and breast cancer.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">Transmembrane protein 179</span>

Transmembrane protein 179 is a protein that in humans is encoded by the TMEM179 gene. The function of transmembrane protein 179 is not yet well understood, but it is believed to have a function in the nervous system.

<span class="mw-page-title-main">SAAL1</span>

Serum amyloid A-like 1 is a protein in humans encoded by the SAAL1 gene.

Proline-rich protein 29, encoded by the PRR29 gene in humans, is a protein which is located in the human genome at 17q23. Its function is not fully understood. Its name is derived from the chain of 5 proline amino acids located toward the end of the protein. The primary domain within the sequence of this protein is known as DUF4587. It is reported to have high levels of expression in tissues pertaining to the circulatory system and the immune system. It is hypothesized that PRR29 is a nuclear protein that facilitates communication between the nucleus and the mitochondria.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000186143 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000042888 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Lamesch P, Li N, Milstein S, Fan C, Hao T, Szabo G, et al. (March 2007). "hORFeome v3.1: a resource of human open reading frames representing over 10,000 human genes". Genomics. 89 (3): 307–315. doi:10.1016/j.ygeno.2006.11.012. PMC   4647941 . PMID   17207965.
  6. 7. NCBI (National Center for Biotechnology Information) entry on PRR30 https://www.ncbi.nlm.nih.gov/nuccore/148236530
  7. "PRR30 proline rich 30 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-04.
  8. "Genomatix: Genomatix Genome Browser". www.genomatix.de. Retrieved 2017-04-27.
  9. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–2006. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  10. Volker Brendel, Department of Mathematics, Stanford University, Stanford CA 94305, U.S.A., modified; any errors are due to the modification.[ clarification needed ]
  11. Thompson JD, Higgins DG, Gibson TJ (November 1994). "CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice". Nucleic Acids Research. 22 (22): 4673–4680. doi:10.1093/nar/22.22.4673. PMC   308517 . PMID   7984417.
  12. 1 2 Rost B. "PredictProtein - Protein Sequence Analysis, Prediction of Structural and Functional Features". www.predictprotein.org. Retrieved 2017-04-28.
  13. la Cour T, Kiemer L, Mølgaard A, Gupta R, Skriver K, Brunak S (June 2004). "Analysis and prediction of leucine-rich nuclear export signals". Protein Engineering, Design & Selection. 17 (6): 527–536. doi: 10.1093/protein/gzh062 . PMID   15314210.
  14. Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F, et al. (January 2014). "The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases". Nucleic Acids Research. 42 (Database issue): D358–D363. doi:10.1093/nar/gkt1115. PMC   3965093 . PMID   24234451.
  15. Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S (June 2004). "Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence". Proteomics. 4 (6): 1633–1649. doi:10.1002/pmic.200300771. PMID   15174133. S2CID   18810164.
  16. Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–1362. doi:10.1006/jmbi.1999.3310. PMID   10600390.
  17. Gupta R, Jung E, Brunak S (2004). Prediction of N-glycosylation sites in human proteins (Report).
  18. de Castro E. "PROSITE". prosite.expasy.org. Retrieved 2017-05-04.
  19. 1 2 Zhang Y (January 2008). "I-TASSER server for protein 3D structure prediction". BMC Bioinformatics. 9: 40. doi:10.1186/1471-2105-9-40. PMC   2245901 . PMID   18215316.
  20. Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ (June 2015). "The Phyre2 web portal for protein modeling, prediction and analysis". Nature Protocols. 10 (6): 845–858. doi:10.1038/nprot.2015.053. PMC   5298202 . PMID   25950237.
  21. Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS, et al. (2001). "Intrinsically disordered protein". Journal of Molecular Graphics & Modelling. 19 (1): 26–59. doi:10.1016/s1093-3263(00)00138-8. PMID   11381529.
  22. Wong RS, Bennick A (June 1980). "The primary structure of a salivary calcium-binding proline-rich phosphoprotein (protein C), a possible precursor of a related salivary protein A". The Journal of Biological Chemistry. 255 (12): 5943–5948. doi: 10.1016/S0021-9258(19)70721-2 . PMID   7380845.
  23. Bennick A (June 1982). "Salivary proline-rich proteins". Molecular and Cellular Biochemistry. 45 (2): 83–99. doi:10.1007/bf00223503. PMID   6810092. S2CID   31373141.
  24. Yang J, Roy A, Zhang Y (January 2013). "BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions". Nucleic Acids Research. 41 (Database issue): D1096–D1103. doi:10.1093/nar/gks966. PMC   3531193 . PMID   23087378.
  25. Yang J, Roy A, Zhang Y (October 2013). "Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment". Bioinformatics. 29 (20): 2588–2595. doi:10.1093/bioinformatics/btt447. PMC   3789548 . PMID   23975762.
  26. Group, Schuler. "EST Profile - Hs.136555". www.ncbi.nlm.nih.gov. Retrieved 2017-05-04.
  27. "Gene: PRR30 (ENSG00000186143) - Gene gain/loss tree - Homo sapiens - Ensembl genome browser 88". www.ensembl.org. Retrieved 2017-05-06.
  28. "Ortholog Search | cegg.unige.ch Computational Evolutionary Genomics Group". www.orthodb.org. Retrieved 2017-05-06.
  29. "Gene: PRR30 (ENSG00000186143) - Gene tree - Homo sapiens - Ensembl genome browser 88". www.ensembl.org. Retrieved 2017-05-06.
  30. "PRR30 Gene - GeneCards | PRR30 Protein | PRR30 Antibody". GeneCards Human Gene Databas. Retrieved 2017-04-27.
  31. 1 2 3 4 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (October 1990). "Basic local alignment search tool". Journal of Molecular Biology. 215 (3): 403–410. doi:10.1006/jmbi.1990.9999. PMID   2231712.
  32. 1 2 Asai A, Karnan S, Ota A, Takahashi M, Damdindorj L, Konishi Y, et al. (March 2015). "High-resolution 400K oligonucleotide array comparative genomic hybridization analysis of neurofibromatosis type 1-associated cutaneous neurofibromas". Gene. 558 (2): 220–226. doi:10.1016/j.gene.2014.12.064. PMID   25562418.