PRR32 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | PRR32 , CXorf64, proline rich 32, chromosome x open reading frame 64, Cxorf33 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1916050 HomoloGene: 90132 GeneCards: PRR32 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
PRR32 is a protein that in humans is encoded by the CXorf64 (Chromosome X open reading frame 64) gene. It was also found that the homologs of the PRR32 gene is conserved in chimpanzee, Rhesus monkey, dog, cow, mouse, and rat. It was also found through ncbi that 82 organisms have orthologs with human gene PRR323. [5]
PRR32 (CXorf64) seems to be involved with a group of genes over-expressed in ALS (Amyotrophic lateral sclerosis), evident from a study aiming to study gene expression patterns in muscles from patients with amyotrophic lateral sclerosis and multifocal motor neuropathy. [6]
The gene is located on Chromosome X, at position Xq25, and is 2023 bases long (significantly small when compared to other genes). Its location is at the antisense (+) strand. As shown in the graphic below, it is flanked by upstream by the DKAF12l1 shown in green on the left and downstream by LOC10. CXorf64 is the blue column shown in the middle. [7]
CXorf64 has 1 exon which ultimately forms 1 transcript variant. This gene is expressed most in muscle tissue. There are no known isoforms in which CXorf64 is expressed. [8]
Property | Cleaved Protein | Mature Protein |
---|---|---|
Amino Acid Length | 298 | 298 |
Isoelectric Point | 7.34 | ~7.2-7.4 |
Molecular Weight | 31.9 kDa | ~31.9-33 kDa |
Shown to the right is a predicted tertiary structure of the protein. It is marked by long alpha-helices localized to the end of the protein opposite the N- and C- terminal ends
According to microarray-assessed tissue expression analysis by NCBI GEO, the gene CXorf64 has average expression levels in most tissues save for, prostate, and heart, fat, and endometrium which have relatively low levels of expression (~25th percentile of tissue gene expression) . Almost every microarray study determined very low levels of gene expression across all tissues assessed . This evidence prompted further analysis with other tools. While no tissue analysis was present on Allen Brain Atlas or genepaint, the Human Protein Atlas suggested that the gene was more highly expressed in heart, prostate, and a and fat tissues .
It synthesizes in the cell cytoplasm and is predicted to localize in the mitochondria. It is predicted to localize in the nucleus, plasma membrane, extracellular, mitochondrion, peroxisome, and cytosol.
It was found that there are several post-translational modifications that are performed on PRR32. These include several N-linked glycosylation sites that were predicted with high confidence. Glycosylation is known to play a part in cell-cell adhesion ( `a mechanism employed by cells of the immune system) via sugar-binding proteins. The graph illustrates predicted N-glyc sites across the protein chain (x-axis represents protein length from N- to C-terminal). A position with a potential (vertical lines) crossing the threshold (horizontal line at 0.5) is predicted glycosylated. [12]
A list was generated of orthologs using the NCBI (protein) blast for PRR32, as well as UCSC’s BLAT (BLAST-Like Alignment Tool. These search and analyzing engines generated orthologs that are 80+long. This list was then narrowed down to a list of 30 different species. From those 30 species, they were divided into certain portions. The first to be selected were the primates since they are the closest related species when it comes to genetics. The similarity generated when compared to humans was approximately 95-99%. Then, using the edit and resubmit option on Blast (proteins), I excluded all primates in order to widen the species selection. The BLAST search again yielded many mammals including Rabbit, Dog, Ferret, Rhino, and many more. These species’ similarity percentage was anywhere between 69%-89%. Next, to further diversify my species selection, again using the same method, I excluded all mammals from the BLAST sequencing. This time, there were absolutely no matches in the inquiry. This action was repeated a few more times and it seems that the PRR32 gene is only relevant in mammals. The google spreadsheet created below lists the selection of species and all important information such as, order, year of divergence, etc. One thing to be noted is that the protein sequences for PRR32 were highly conserved amongst closely related species of Homo sapiens such as chimpanzees, gorillas and orangutans.
An experiment analyzed gene expression pattern in muscles from patients with amyotrophic lateral sclerosis (ALS) and multifocal motor neuropathy (MMN) compared to controls. Biopsied skeletal muscles from three ALS, three MMN and three control subjects had total RNA extracted and subjected to genome-wide gene expression analysis using Affymetrix GeneChip Exon 1.0 ST array. The most significant expression pattern differences were confirmed with RT-PCR in four additional ALS patients. Results showed that over 3000 genes were identified across the groups using q < 10%. Among 50 genes that were overexpressed only in the ALS group were: leucine-rich repeat kinase-2, follistatin, collagen type XIX alpha-1, ceramide kinase-like, sestrin-3 and CXorf64. No genes were significantly overexpressed in MMN alone. Underexpressed genes only in ALS included actinin αα3, fructose-1,6-bisphosphatase-2 and homeobox C10; whereas only in MMN: hemoglobin A1 and CXorf64. Ankyrin repeat domain-1 was overexpressed in both groups. Underexpressed genes in both groups included myosin light chain kinase-2, enolase-3 and 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase-1. Validation analysis using RT-PCR confirmed the data for leucine-rich repeat kinase-2, follistatin, collagen type XIX alpha-1, ceramide kinase-like, sestrin-3 and CXorf64. In conclusion, there is differential tissue-specific gene expression in patients with ALS relative to MMN and controls. Further studies are necessary to evaluate the identified genes in larger patient groups and different tissues.
Peripherin is a type III intermediate filament protein expressed mainly in neurons of the peripheral nervous system. It is also found in neurons of the central nervous system that have projections toward peripheral structures, such as spinal motor neurons. Its size, structure, and sequence/location of protein motifs is similar to other type III intermediate filament proteins such as desmin, vimentin and glial fibrillary acidic protein. Like these proteins, peripherin can self-assemble to form homopolymeric filamentous networks, but it can also heteropolymerize with neurofilaments in several neuronal types. This protein in humans is encoded by the PRPH gene. Peripherin is thought to play a role in neurite elongation during development and axonal regeneration after injury, but its exact function is unknown. It is also associated with some of the major neuropathologies that characterize amyotropic lateral sclerosis (ALS), but despite extensive research into how neurofilaments and peripherin contribute to ALS, their role in this disease is still unidentified.
Alsin is a protein that in humans is encoded by the ALS2 gene. ALS2 orthologs have been identified in all mammals for which complete genome data are available.
Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.
PRP36 is an extracellular protein in Homo sapiens that is encoded by the PRR36 gene that contains a domain of unknown function, DUF4596, towards the C terminus of the protein. The function of PRP36 is unknown, but high gene expression has been observed in various regions of the brain such as the prefrontal cortex, cerebellum, and the amygdala. PRP36 has one alias: Putative Uncharacterized Protein FLJ22184.
Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
KIAA1211L is a protein that in humans is encoded by the KIAA1211L gene. It is highly expressed in the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, KIAA1211L is associated with certain mental disorders and various cancers.
Chromosome 3 Open Reading Frame 62 (C3orf62), is a protein that in humans is encoded by the C3orf62 gene. C3orf62 is a glycine depleted protein relative to the amount of glycine in proteins in the rest of the genome. C3orf62 has a KKXX-like motif and is predicted to be localized in the nucleus. Expression of C3orf62 remains highest in whole blood.
Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.
LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.
Proline-rich basic protein 1(PROB1) is a protein encoded by the PROB1 gene located on human chromosome 5, open reading frame 65. PROB1 is also known as C5orf65 and weakly similar to basic proline-rich protein.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
Chromosome 7 open reading frame 57 is an uncharacterized protein found in humans and several other homologs. It is encoded by the C7orf57 gene. This gene is found to be greatly expressed in the Fallopian tubes, testes, lungs, hippocampus, hypothalamus, and caudate. There are three isoforms of the gene. Within the gene sequence 9 exons are present. C7orf57 has been linked to lupus, pancreatic cancer sporadic amyotrophic lateral sclerosis. and gastrointestinal toxicity
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.
C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.
Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa