PRR32

Last updated
PRR32
Identifiers
Aliases PRR32 , CXorf64, proline rich 32, chromosome x open reading frame 64, Cxorf33
External IDs MGI: 1916050 HomoloGene: 90132 GeneCards: PRR32
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001122716

NM_026841

RefSeq (protein)

NP_001116188

NP_081117

Location (UCSC) Chr X: 126.82 – 126.82 Mb Chr X: 44.18 – 44.18 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

PRR32 is a protein that in humans is encoded by the CXorf64 (Chromosome X open reading frame 64) gene. It was also found that the homologs of the PRR32 gene is conserved in chimpanzee, Rhesus monkey, dog, cow, mouse, and rat. It was also found through ncbi that 82 organisms have orthologs with human gene PRR323. [5]

Contents

PRR32 (CXorf64) seems to be involved with a group of genes over-expressed in ALS (Amyotrophic lateral sclerosis), evident from a study aiming to study gene expression patterns in muscles from patients with amyotrophic lateral sclerosis and multifocal motor neuropathy. [6]

Gene

The gene is located on Chromosome X, at position Xq25, and is 2023 bases long (significantly small when compared to other genes). Its location is at the antisense (+) strand. As shown in the graphic below, it is flanked by upstream by the DKAF12l1 shown in green on the left and downstream by LOC10. CXorf64 is the blue column shown in the middle. [7]

Figure 1 - Genomic Context illustration for CXorf64. This segment depicts approximately 1,200,000 base pairs of chromosome 10. The green lines depict the start of transcription while the red diamonds indicate the termination of transcription. CXorf64 is transcribed in the opposite direction of its flanking genes GENE1.png
Figure 1 – Genomic Context illustration for CXorf64. This segment depicts approximately 1,200,000 base pairs of chromosome 10. The green lines depict the start of transcription while the red diamonds indicate the termination of transcription. CXorf64 is transcribed in the opposite direction of its flanking genes

Transcript

CXorf64 has 1 exon which ultimately forms 1 transcript variant. This gene is expressed most in muscle tissue. There are no known isoforms in which CXorf64 is expressed. [8]

Protein

General Properties

PropertyCleaved ProteinMature Protein
Amino Acid Length298298
Isoelectric Point 7.34~7.2-7.4
Molecular Weight31.9 kDa~31.9-33 kDa

Structure

Shown to the right is a predicted tertiary structure of the protein. It is marked by long alpha-helices localized to the end of the protein opposite the N- and C- terminal ends

Predicted tertiary structure of PRR32 generated by software. Based on a protein template covering 80% of the protein sequence with 96% identity. Coilll.png
Predicted tertiary structure of PRR32 generated by software. Based on a protein template covering 80% of the protein sequence with 96% identity.

Expression

According to microarray-assessed tissue expression analysis by NCBI GEO, the gene CXorf64 has average expression levels in most tissues save for, prostate, and heart, fat, and endometrium which have relatively low levels of expression (~25th percentile of tissue gene expression) . Almost every microarray study determined very low levels of gene expression across all tissues assessed . This evidence prompted further analysis with other tools. While no tissue analysis was present on Allen Brain Atlas or genepaint, the Human Protein Atlas suggested that the gene was more highly expressed in heart, prostate, and a and fat tissues .

NCBI Expression data. This microarray study RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes. Expression2.png
NCBI Expression data. This microarray study RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes.
(Embryonic anterior and posterior palate). This study indicated that the posterior palate gene level expression was much higher than that of the anterior palate. Gene expression percentiles in the posterior palate averaged approximately 75th%, while in the anterior palate, gene expression was in the 25th percentile. DISEASE.png
(Embryonic anterior and posterior palate). This study indicated that the posterior palate gene level expression was much higher than that of the anterior palate. Gene expression percentiles in the posterior palate averaged approximately 75th%, while in the anterior palate, gene expression was in the 25th percentile.

Sub-cellular Location

It synthesizes in the cell cytoplasm and is predicted to localize in the mitochondria. It is predicted to localize in the nucleus, plasma membrane, extracellular, mitochondrion, peroxisome, and cytosol.

Subcellular Location of PRR32. CellIMA.png
Subcellular Location of PRR32.

Post-Translational Modification

N-linked glycosylation

It was found that there are several post-translational modifications that are performed on PRR32. These include several N-linked glycosylation sites that were predicted with high confidence. Glycosylation is known to play a part in cell-cell adhesion ( `a mechanism employed by cells of the immune system) via sugar-binding proteins. The graph illustrates predicted N-glyc sites across the protein chain (x-axis represents protein length from N- to C-terminal). A position with a potential (vertical lines) crossing the threshold (horizontal line at 0.5) is predicted glycosylated. [12]

Glycosylation sites of PRR32. PHOOOOO.png
Glycosylation sites of PRR32.

Homology and Evolution

A list was generated of orthologs using the NCBI (protein) blast for PRR32, as well as UCSC’s BLAT (BLAST-Like Alignment Tool. These search and analyzing engines generated orthologs that are 80+long. This list was then narrowed down to a list of 30 different species. From those 30 species, they were divided into certain portions. The first to be selected were the primates since they are the closest related species when it comes to genetics. The similarity generated when compared to humans was approximately 95-99%. Then, using the edit and resubmit option on Blast (proteins), I excluded all primates in order to widen the species selection. The BLAST search again yielded many mammals including Rabbit, Dog, Ferret, Rhino, and many more. These species’ similarity percentage was anywhere between 69%-89%. Next, to further diversify my species selection, again using the same method, I excluded all mammals from the BLAST sequencing. This time, there were absolutely no matches in the inquiry. This action was repeated a few more times and it seems that the PRR32 gene is only relevant in mammals. The google spreadsheet created below lists the selection of species and all important information such as, order, year of divergence, etc. One thing to be noted is that the protein sequences for PRR32 were highly conserved amongst closely related species of Homo sapiens such as chimpanzees, gorillas and orangutans.

Clinical Significance

Amyotrophic lateral sclerosis

An experiment analyzed gene expression pattern in muscles from patients with amyotrophic lateral sclerosis (ALS) and multifocal motor neuropathy (MMN) compared to controls. Biopsied skeletal muscles from three ALS, three MMN and three control subjects had total RNA extracted and subjected to genome-wide gene expression analysis using Affymetrix GeneChip Exon 1.0 ST array. The most significant expression pattern differences were confirmed with RT-PCR in four additional ALS patients. Results showed that over 3000 genes were identified across the groups using q < 10%. Among 50 genes that were overexpressed only in the ALS group were: leucine-rich repeat kinase-2, follistatin, collagen type XIX alpha-1, ceramide kinase-like, sestrin-3 and CXorf64. No genes were significantly overexpressed in MMN alone. Underexpressed genes only in ALS included actinin αα3, fructose-1,6-bisphosphatase-2 and homeobox C10; whereas only in MMN: hemoglobin A1 and CXorf64. Ankyrin repeat domain-1 was overexpressed in both groups. Underexpressed genes in both groups included myosin light chain kinase-2, enolase-3 and 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase-1. Validation analysis using RT-PCR confirmed the data for leucine-rich repeat kinase-2, follistatin, collagen type XIX alpha-1, ceramide kinase-like, sestrin-3 and CXorf64. In conclusion, there is differential tissue-specific gene expression in patients with ALS relative to MMN and controls. Further studies are necessary to evaluate the identified genes in larger patient groups and different tissues.

Related Research Articles

<span class="mw-page-title-main">Peripherin</span>

Peripherin is a type III intermediate filament protein expressed mainly in neurons of the peripheral nervous system. It is also found in neurons of the central nervous system that have projections toward peripheral structures, such as spinal motor neurons. Its size, structure, and sequence/location of protein motifs is similar to other type III intermediate filament proteins such as desmin, vimentin and glial fibrillary acidic protein. Like these proteins, peripherin can self-assemble to form homopolymeric filamentous networks, but it can also heteropolymerize with neurofilaments in several neuronal types. This protein in humans is encoded by the PRPH gene. Peripherin is thought to play a role in neurite elongation during development and axonal regeneration after injury, but its exact function is unknown. It is also associated with some of the major neuropathologies that characterize amyotropic lateral sclerosis (ALS), but despite extensive research into how neurofilaments and peripherin contribute to ALS, their role in this disease is still unidentified.

<span class="mw-page-title-main">ALS2</span> Protein-coding gene in the species Homo sapiens

Alsin is a protein that in humans is encoded by the ALS2 gene. ALS2 orthologs have been identified in all mammals for which complete genome data are available.

<span class="mw-page-title-main">Proline-rich 12</span> Protein-coding gene in the species Homo sapiens

Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.

PRP36 is an extracellular protein in Homo sapiens that is encoded by the PRR36 gene that contains a domain of unknown function, DUF4596, towards the C terminus of the protein. The function of PRP36 is unknown, but high gene expression has been observed in various regions of the brain such as the prefrontal cortex, cerebellum, and the amygdala. PRP36 has one alias: Putative Uncharacterized Protein FLJ22184.

<span class="mw-page-title-main">SHOC1</span>

Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

<span class="mw-page-title-main">FAM76A</span>

FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">KIAA1211L</span>

KIAA1211L is a protein that in humans is encoded by the KIAA1211L gene. It is highly expressed in the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, KIAA1211L is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">C3orf62</span>

Chromosome 3 Open Reading Frame 62 (C3orf62), is a protein that in humans is encoded by the C3orf62 gene. C3orf62 is a glycine depleted protein relative to the amount of glycine in proteins in the rest of the genome. C3orf62 has a KKXX-like motif and is predicted to be localized in the nucleus. Expression of C3orf62 remains highest in whole blood.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

<span class="mw-page-title-main">LOC101059915</span> Protein-coding gene in the species Homo sapiens

LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.

<span class="mw-page-title-main">PROB1</span>

Proline-rich basic protein 1(PROB1) is a protein encoded by the PROB1 gene located on human chromosome 5, open reading frame 65. PROB1 is also known as C5orf65 and weakly similar to basic proline-rich protein.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">CXorf38 Isoform 1</span> Human protein

Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.

<span class="mw-page-title-main">C7orf57</span> Uncharacterized protein in humans

Chromosome 7 open reading frame 57 is an uncharacterized protein found in humans and several other homologs. It is encoded by the C7orf57 gene. This gene is found to be greatly expressed in the Fallopian tubes, testes, lungs, hippocampus, hypothalamus, and caudate. There are three isoforms of the gene. Within the gene sequence 9 exons are present. C7orf57 has been linked to lupus, pancreatic cancer sporadic amyotrophic lateral sclerosis. and gastrointestinal toxicity

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">TMEM169</span> Gene

Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.

<span class="mw-page-title-main">C12orf29</span>

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

<span class="mw-page-title-main">CXorf65</span> Gene on human chromosome X

Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000183631 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000037086 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: PRR32 proline rich 32".
  6. Shtilbans A, Choi SG, Fowkes ME, Khitrov G, Shahbazi M, Ting J, Zhang W, Sun Y, Sealfon SC, Lange DJ (July 2011). "Differential gene expression in patients with amyotrophic lateral sclerosis". Amyotrophic Lateral Sclerosis. 12 (4): 250–6. doi:10.3109/17482968.2011.560946. PMID   21375368. S2CID   11600103.
  7. "PRR32 Gene - GeneCards | PRR32 Protein | PRR32 Antibody".
  8. "PRR32 proline rich 32 [Homo sapiens (human)] - Gene - NCBI".
  9. "PRR32 proline rich 32 [Homo sapiens (human)] - Gene - NCBI".
  10. "Home - GEO - NCBI".
  11. "Compartments - Prr32".
  12. "ExPASy: SIB Bioinformatics Resource Portal - Home".
  13. "ExPASy: SIB Bioinformatics Resource Portal - Home".