PRR29

Last updated
PRR29
Identifiers
Aliases PRR29 , C17orf72, proline rich 29
External IDs MGI: 1922823; HomoloGene: 130357; GeneCards: PRR29; OMA:PRR29 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001164257
NM_001191029
NM_001191030
NM_001191031

Contents

NM_029345

RefSeq (protein)

NP_001157729
NP_001177958
NP_001177959
NP_001177960

NP_083621

Location (UCSC) Chr 17: 64 – 64 Mb Chr 11: 106.26 – 106.27 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
Proline-Rich Protein 29
Identifiers
SymbolPRR29
Alt. symbolsC17orf72
Alt. namesChromosome 17 Open Reading Frame 72
NCBI gene 92340
RefSeq NM_001164257.2
UniProt P0C7W0
Other data
Locus Chr. 17 q23
Search for
Structures Swiss-model
Domains InterPro

PRR29 (proline-rich protein 29) is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23. [5] [6] [7]

Its function is not fully understood. Its name is derived from the chain of 5 proline amino acids located toward the end of the protein. The primary domain within the sequence of this protein is known as DUF4587. [5] It is reported to have high levels of expression in tissues pertaining to the circulatory system and the immune system. [8] It is hypothesized that PRR29 is a nuclear protein that facilitates communication between the nucleus and the mitochondria.

The gene is also commonly known as C17orf72. The gene has a size of 5961 base pairs and contains five exons. [6]

Gene

Location of PRR29 gene relative to its neighbors PRR29 Location.jpg
Location of PRR29 gene relative to its neighbors

PRR29 is located on the long arm of chromosome 17 (17q23.3), starting at 63998344 and ending at 64004305. [6] The gene spans 5961 base pairs and is oriented on the plus strand. Genes SNHG25 and LOC105371858 neighbor PRR29 on chromosome 17.The gene ICAM2 is located on the negative strand, directly opposite of PRR29.

The human gene for PRR29, also referred to as C17of72, spans 6 exons and is located at the genomic coordinates chr17:63,998,351-64,002,516 on the positive DNA strand (hg38). It is 4,166 base pairs in length including introns. After they have been removed, the length is shortened to 3,611 base pairs. [9]

mRNA

The gene has 12 common splice variants and one unspliced form. [10] The longest transcribed mRNA is made up of 3048 base pairs and the transcribed protein sequence for this mRNA is 189 amino acids. [6]

Locus of PRR29 PRR29 Locus on Chromosome 17.png
Locus of PRR29
Genomic view of PRR29 Genomic view of PRR29.png
Genomic view of PRR29

Protein

General properties

Homo sapiens PRR29 has several protein isoforms, with the longest being 236 amino acids. [6] PRR29 has a predicted Isoelectric point of 5.23 and a predicted Molecular weight of 26.1 kilodaltons. PRR29 is characterized by a larger than average proportion of prolines (19.1%) and a smaller than average amount of asparagines (0.4%) [12]

The PRR29 protein is estimated to have a molecular weight of 20.7 kDa. Its isoelectric point is predicted to lie at 4.83. [13]

Domains

PRR29 contains a proline rich region within its sequence from amino acids 73 to 166. A domain of unknown function, DUF 4587, is also present from amino acids 39 to 112 or 113. [14] [15] DUF 4587 is usually between 64 and 79 amino acids long and contains the two sequence motifs QNAQ and HHH. PRR29 is predicted to contain multiple alpha helix and beta-sheet forming regions. Specifically, the DUF 4587 region is predicted to form an alpha helix. [16]

It contains one proline-rich region motif that extends from amino acid 39 to 107. [17]

Secondary and Tertiary Structure

3D structure of PRR29 protein PRR29 Structure.jpg
3D structure of PRR29 protein

The secondary structure is characterized by high confidence in the presence of an alpha helix from amino acid 43 to 70 with the rest consisting of coils. [18] In terms of tertiary structure, predictive tools returned low confidence in all sections of the protein besides the core alpha helix. [19]

Untranslated Regions

The secondary structure of the 5' UTR of PRR29 consists of a singular stem-loop that is almost wholly conserved between orthologs. The 3' UTR is much longer, containing 41 different stem loops in its secondary structure. Any predicted miRNA binding to PRR29 is believed to be nonfunctional. [20]

Subcellular localization

Using PSORTII, PRR29 is predicted to localize in the nucleus of the cell. [21] PSORTII does not predict any targeting sequences or signal peptides. Data on localization of PRR29 within cells shows that it is primarily found in the nucleus, followed by the mitochondria and cytoplasm. [22]

Post-Translational Modification

PRR29 is predicted to undergo sumoylation, acetylation, and serine, threonine and tyrosine phosphorylation. [23] The most common types of post-translational modification that occur for this protein are phosphorylation, glycosylation, hydroxylation, and sumoylation. [24] They contribute to the stability, structure, and folding of the protein.

Interactions

The interactome of PRR29 is not yet well characterized. One experimental study found that a Sus scrofa PRR29-like protein interacts with the N-terminal protease of classical swine fever virus (CSFV). [25]

3D structure prediction from I-TASSER ITasser PRR29 Structure Prediction.gif
3D structure prediction from I-TASSER

Expression

Expression data for PRR29 based on RNA sequencing of human tissue samples PRR29 Expression 1.jpg
Expression data for PRR29 based on RNA sequencing of human tissue samples
Expression data for PRR29 from Illumina bodyMap 2 transcriptome PRR29 Expression 2.png
Expression data for PRR29 from Illumina bodyMap 2 transcriptome

PRR29 is ubiquitously expressed throughout the body. However, there is particularly high expression in the ovaries, muscle, heart, testes, and thymus. [27] According to PaxDb, PRR29 abundance falls in the bottom 5% relative to other proteins. [28]

Its expression is concentrated in related tissues including the spleen, lungs, white blood cells, and heart. [8] In situ hybridization data reveals that expression in the human brain is brain is relatively low in the forebrain but higher in the basal ganglia, midbrain, and hindbrain with the exception of the cerebellar cortex. For comparison, expression in the mouse brain is focused in the isocortex, olfactory region, hippocampus, cortical subplate, and cerebellum.

Transcript-Level Regulation

Prediction of transcription factors that bind to the promoter region for PRR29 include ones involved with the activation of leukocytes and the formation of blood cells. [29] Abundance of the protein relative to others found in the human body is currently unknown. [30]

Homology

A table comparing the divergence of PRR29 in different species relative to the human gene. PRR29 Orthology.png
A table comparing the divergence of PRR29 in different species relative to the human gene.

PRR29 has a single known paralog, C21orf58. [31] PRR29 is well conserved among chordates and PRR29-like proteins containing the DUF 4587 have been predicted in protostomes, such as Mollusca and Annelida. [32] DUF 4587 is highly conserved in all PRR29 orthologs and is also present in its paralog, C21orf58. [33] This domain has been found in species as distantly related as Capitella teleta, which diverged from humans 847 million years ago. [34]

PRR29 Expression Expression profile of PRR29 in mice.png
PRR29 Expression

This gene has orthologs in other animal species. It has only been found in vertebrates with the oldest being the whitespotted bamboo shark. [35] [36]

PRR29 does not have any known paralogs. [35]

Genus and SpeciesCommon NameDivergence date from human (MYA) [37] Protein accession # [38] Sequence Length (aa)Identity to human sequenceSimilarity to human sequence
Homo sapiensHuman0NP_001177958.123611
Nannospalax galiliBlind mole-rat90.9XP_008844796.11810.620.68
Bubalus bubalisWater buffalo97.5XP_006041674.12360.560.65
Monodelphis domesticaopossum163.7XP_007482631.1|1950.510.62
Aquila chrysaetos canadensisGolden eagle320XP_011593496.11700.360.48
Anolis carolinensisAnole Lizard320.5XP_008111611.11860.390.52
Xenopus laevisAfrican clawed frog355.7NP_001079741.13090.340.49
Esox luciusNorthern Pike429XP_010873077.11970.480.74
Branchiostoma floridaelancelet733XP_002603029.113410.370.71
Ciona intestinalisCiona intestinalis733XP_009857401.12270.430.67
Capitella teletaCapitella teleta847ELU017495580.370.51
Lottia giganteaOwl limpet847XP_009046359.12990.330.54
PRR29 Unrooted Phylogenetic Tree Alignment PRR29.pdf
PRR29 Unrooted Phylogenetic Tree

Evolution

PRR29's rate of mutation compared to that of cytochrome c and fibrinogen alpha PRR29 Evolution.png
PRR29's rate of mutation compared to that of cytochrome c and fibrinogen alpha

This gene evolves at a rate higher than that of cytochrome c but lower than that of fibrinogen alpha chain. [36]

Clinical Significance

It is unknown if PRR29 is directly linked to any diseases.

Experiments have been performed on tissue and cell samples in order to observe any potential changes in expression of the protein under different conditions. Samples inflicted with pulmonary sarcoidosis and small cell lung cancer did not experience and significant changes in expression compared to normal cells. [40] [41] Intracranial artery aneurysm did induce change, leading to heightened expression [42]

Related Research Articles

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

<span class="mw-page-title-main">PRR23A</span> Protein that is encoded by the Proline-Rich 23A (PRR23A) gene

Proline-Rich Protein 23A is a protein that is encoded by the Proline-Rich 23A (PRR23A) gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000224383 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000009210 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "RecName: Full=Proline-rich protein 29 - Protein - NCBI". www.ncbi.nlm.nih.gov.
  6. 1 2 3 4 5 "PRR29 proline rich 29 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  7. "PRR29_HUMAN". Uniprot. Retrieved 28 April 2016.
  8. 1 2 "PRR29 proline rich 29 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov.
  9. "Human Gene PRR29 (ENST00000425164.7) from GENCODE V38". genome.ucsc.edu.
  10. "NCBI Aceview".
  11. "PRR29 proline rich 29 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  12. "Compute pI/Mw". SIB Swiss Institute of Bioinformatics.
  13. https://web.expasy.org/cgi-bin/compute_pi/pi_tool%5B%5D
  14. "PRR29 - Proline-rich protein 29 - Homo sapiens (Human) - PRR29 gene & protein". www.uniprot.org. Retrieved 2016-05-09.
  15. "PRR29 proline rich 29 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov.
  16. "SDSC Biology Workbench-PELE".
  17. "Motif Scan". myhits.sib.swiss.
  18. "I-TASSER server for protein structure and function prediction". zhanggroup.org.
  19. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk.
  20. "RNA Folding Form". www.unafold.org.
  21. "PSORT II Prediction". psort.hgc.jp. Retrieved 2016-05-09.
  22. https://psort.hgc.jp/cgi-bin/runpsort.pl%5B%5D
  23. "CBS Prediction Servers". www.cbs.dtu.dk. Retrieved 2016-05-09.
  24. "MusiteDeep". www.musite.net.
  25. Li D, Li S, Sun Y, Dong H, Li Y, Zhao B, Guo D, Weng C, Qiu HJ (February 2013). "Poly(C)-binding protein 1, a novel N(pro)-interacting protein involved in classical swine fever virus growth". Journal of Virology. 87 (4): 2072–80. doi:10.1128/JVI.02807-12. PMC   3571455 . PMID   23221550.
  26. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2016-04-28.
  27. 1 2 "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  28. "H.sapiens - Whole organism (Integrated) in PaxDb". pax-db.org. Retrieved 2016-05-09.
  29. https://www.genomatix.de/cgi-bin/eldorado/eldorado.pl?s=8d52110cfa77f5d0982853f81a3ed821 [ bare URL ]
  30. "PAXdb: Protein Abundance Database". pax-db.org.
  31. "C21orf58 chromosome 21 open reading frame 58 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  32. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  33. EMBL-EBI, InterPro. "Domain of unknown function DUF4587 (IPR027904) < InterPro < EMBL-EBI". www.ebi.ac.uk. Retrieved 2016-05-09.
  34. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2016-05-09.
  35. 1 2 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov.
  36. 1 2 "TimeTree :: The Timescale of Life". timetree.org.
  37. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2016-05-09.
  38. "National Center for Biotechnology Information". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  39. "SDSC Biology Workbench-ClustalW".[ permanent dead link ]
  40. "GDS3580 / 232972_at". www.ncbi.nlm.nih.gov.
  41. "GDS4794 / 232972_at". www.ncbi.nlm.nih.gov.
  42. "GDS3903 / 232972_at". www.ncbi.nlm.nih.gov.