PRR16

Last updated

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. [1] The protein is known by the alias Largen.

Contents

Gene

Locus

PRR16 is located on the long arm of chromosome 5. It is found at position 5q23.1. It has five known transcripts. [2]

Gene Neighborhood

Cytogenic band: 5q23.1

PRR16 Cytogenic Band.png

[3]

The image above shows chromosome five and the various gene locations on it. PRR16 can be seen at the thin red band that has been placed at q23.

Size

PRR16 spans 330,365 bases and encodes for a mRNA that is 1,707 nucleotides. The resulting protein is 304 amino acids long. [1]

mRNA

Splice Variants

PRR16 has five known splice variants, each with a different processed transcript. PRR16 Splice Variants Ensenbl.png [4]

Isoforms

There are three known isoforms of PRR16. Isoform 2 contains an additional exon in the 5' region, and it thus differs in the 5' UTR and initiates translation at an alternate start codon, compared to variant 1. Isoform 3 has two variants. The first contains an alternate 5' terminal exon, and it thus differs in the 5' UTR and initiates translation at a downstream in-frame start codon, compared to variant 1. The second contains alternate 5' exon structure, and it thus differs in the 5' UTR and initiates translation at a downstream in-frame start codon, compared to variant 1. All three isoforms are shorter at the N-terminus, compared to isoform 1. [1]

Protein

Structure

Primary Structure

The PRR16 protein is 304 amino acids in length. It has a molecular weight of 32.8 kDa and an isoelectric point of 8.09. [5] [6] The protein does not interact with the membrane.

Secondary Structure

a-helix from amino acid 38 to 52 of PRR16 determined by PHYRE2 PRR16 Alpha Helix.jpg
α-helix from amino acid 38 to 52 of PRR16 determined by PHYRE2

The only predictable feature of the PRR16 protein is an α-helix near the N-terminus, spanning about thirty amino acids. The remainder of the protein has a disordered structure. [7]

Tertiary Structure

This structure was predicted by analyzing the amino acid sequence using I-TASSER. The final result can be seen below. [8]

I-TASSER software was used to predict the tertiary structure of PRR16. The blue coils indicate the presence of a-helices, while the remainder is relatively disordered. PRR16 Model.jpg
I-TASSER software was used to predict the tertiary structure of PRR16. The blue coils indicate the presence of α-helices, while the remainder is relatively disordered.

Post-Translational Modifications

Subcellular Location

The k-NN tool suggests the location of PRR16 in the nucleus of the cell with 52.2% certainty. The cytoplasm was predicted with 30.4% certainty, The following locations were predicted with 4.3% certainty: cytoskeleton, plasma membrane, mitochondria, and peroxisome. [14]

Expression

The PRR16 gene is expressed at very low levels throughout the body. It is expressed in the skeletal muscle, heart, lung, skin, portions of the brain, and bone marrow. [15]

Interacting Proteins

ProteinFunction [16] Tool
Abelson-interactor 2 (ABI2)Regulation of cell growth and actin filament reorganizationBiogrid, [17] IntAct, [18] Mentha [19]
Amyloid precursor protein (APP)Caspase activation and degeneration of both neuronal cell bodiesBiogrid, InnateDB, [20] Mentha
Death-associated protein kinase 1 (DAPK1)Positive mediator of programmed cell deathBiogrid, IntAct
Neural precursor cell expressed developmentally downregulated protein 4-like (NEDD4L)E3 ubiquitin-protein ligase Biogrid, InnateDB, Mentha
Neural precursor cell expressed developmentally downregulated protein 4 (NEDD4)E3 ubiquitin-protein ligaseBiogrid, InnateDB
Nucleotide-binding oligomerization domain-containing protein 2 (NOD2)Involved in gastrointestinal immunityBiogrid
Protein phosphatase 1 catalytic subunit alpha (PPP1CA)Cell division, regulation of glycogen metabolism, muscle contractility, and protein synthesisBiogrid, InnateDB, IntAct, Mentha
Protein phosphatase 1 catalytic subunit gamma (PPP1CC)Cell division, regulation of glycogen metabolism, muscle contractility, and protein synthesisBiogrid, InnateDB, Mentha
SMAD ubiquitin regulatory factor 1 (SMURF1)E3 ubiquitin-protein ligaseBiogrid, Mentha

Homology

Paralogs

Unrooted phylogenetic tree of vertebrate and invertebrate PRR16 orthologs. All abbreviations are the first two letters of genus name and first letter of species name. Tree made with a Neighbor-Joining method using a ClustalW-formatted set of sequences as input. PRR16 Tree.png
Unrooted phylogenetic tree of vertebrate and invertebrate PRR16 orthologs. All abbreviations are the first two letters of genus name and first letter of species name. Tree made with a Neighbor-Joining method using a ClustalW-formatted set of sequences as input.

There are two isoforms of Inhibitory synaptic factor 1 that are known paralogs of PRR16.

Orthologs

PRR16 is found in all classes of vertebrates, including mammals, birds, fish, reptiles and amphibians. The most distant ortholog of PRR16 is in Branchiostoma belcheri and Branchiostoma floridae, which diverged an estimated 684 million years ago. [22] The gene has not been found in any plants, fungi or single celled organisms. The table below compares the known orthologs. [23]

OrganismCommon NameClassAccession NumberSequence IdentitySequence Similarity
Homo sapiens Human Mammalia NP_001287712.1100%100%
Pan paniscus BonoboMammaliaXP_003826752.199%100%
Mus musculus House mouseMammaliaNP_001074693.193%95%
Ochotona princeps American pikaMammaliaXP_004586413.191%94%
Vombatus ursinus Common wombatMammaliaXP_027730777.187%81%
Podarcis muralis Common wall lizard Reptilia XP_028604016.188%91%
Alligator sinensis Chinese alligatorReptiliaXP_006033732.186%89%
Chrysemys picta bellii Painted turtleReptiliaXP_005305197.188%91%
Pogona vitticeps Central bearded dragonReptiliaXP_020659290.184%86%
Pseudonaja textilis Eastern brown snakeReptiliaXP_026576168.181%84%
Gallus gallus Chicken Aves XP_001232593.381%85%
Columba livia Rock pigeonAvesXP_005506281.185%89%
Haliaeetus leucocephalus Bald eagleAvesXP_010560635.182%86%
Empidonax traillii Willow flycatcherAvesXP_027735123.182%86%
Nanorana parkeri High Himalaya frog Amphibia XP_018426570.171%79%
Xenopus tropicalis Western clawed frogAmphibiaXP_017946181.172%79%
Lepisosteus oculatus Spotted gar fish Actinopterygii XP_006626913.165%77%
Callorhinchus milii Australian ghostshark Chondrichthyes XP_007897515.164%75%
Branchiostoma belcheri Lancelet Amphioxiformes XP_019614579.147%78%
Branchiostoma floridae LanceletAmphioxiformesXP_002601582.147%78%

Related Research Articles

YIF1A

Protein YIF1A is a protein that in humans is encoded by the YIF1A gene.

Interferon-inducible GTPase 5

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

Proline-rich 12

Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.

Transmembrane protein 241 is a ubiquitous sugar transporter protein which in humans is encoded by the TMEM241 gene.

PRR29

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

C16orf46 Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

TMEM44

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

C2orf81 is a human gene encoding protein c2orf81, which is predicted to have nuclear localization.

TEX9

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

C4orf51

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

CFAP299

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

TMEM155

Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.

C1orf122

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

C17orf78

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

LSMEM2

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

FAM120AOS

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

FAM166C

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19-85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

Proline-rich protein 29, encoded by the PRR29 gene in humans, is a protein which is located in the human genome at 17q23. Its function is not fully understood. Its name is derived from the chain of 5 proline amino acids located toward the end of the protein. The primary domain within the sequence of this protein is known as DUF4587. It is reported to have high levels of expression in tissues pertaining to the circulatory system and the immune system. It is hypothesized that PRR29 is a nuclear protein that facilitates communication between the nucleus and the mitochondria.

C5orf22

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

References

  1. 1 2 3 "PRR16 proline rich 16 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-06-17.
  2. "Gene: PRR16 (ENSG00000184838) - Summary - Homo sapiens - Ensembl genome browser 97". useast.ensembl.org. Retrieved 2019-07-30.
  3. "PRR16 Gene". www.genecards.org. Retrieved 2019-07-30.
  4. "Ensembl genome browser 78: Homo sapiens - Summary - Gene: PRR16 (ENSG00000184838)". dec2014.archive.ensembl.org. Retrieved 2019-07-30.
  5. "SAPS Results". www.ebi.ac.uk. Retrieved 2019-07-30.
  6. "CALCULATION OF PROTEIN ISOELECTRIC POINT". isoelectric.org. Retrieved 2019-07-30.
  7. "DisEMBL 1.5 - Predictors of intrinsic protein disorder". dis.embl.de. Retrieved 2019-08-01.
  8. "I-TASSER results". zhanglab.ccmb.med.umich.edu. Retrieved 2019-08-01.
  9. "NetGlycate 1.0 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-08-04.
  10. "NetNES 1.1 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-08-04.
  11. "NetOGlyc 4.0 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-08-04.
  12. "NetPhos 3.1 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-08-04.
  13. "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Retrieved 2019-08-04.
  14. "PSORT II Prediction". psort.hgc.jp. Retrieved 2019-08-01.
  15. "GDS3113 / 202795". www.ncbi.nlm.nih.gov. Retrieved 2019-08-01.
  16. "STRING: functional protein association networks". string-db.org. Retrieved 2019-08-04.
  17. "PRR16 Result Summary | BioGRID". thebiogrid.org. Retrieved 2019-08-04.
  18. "PRR16 Interactions".
  19. "Results - mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2019-08-04.
  20. al, David Lynn et. "InnateDB: Systems Biology of the Innate Immune Response". www.innatedb.com. Retrieved 2019-08-04.
  21. "Multiple Sequence Alignment - CLUSTALW". www.genome.jp. Retrieved 2019-07-03.
  22. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2019-07-01.
  23. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2019-08-01.

Attribution: Contains public domain text from https://www.ncbi.nlm.nih.gov/gene/51334