Pentatricopeptide repeat

Last updated
Pentatricopeptide repeat
Identifiers
SymbolPPR
Pfam PF01535
Pfam clan CL0020
InterPro IPR002885
PROSITE PS51375
Available protein structures:
Pfam   structures / ECOD  
PDB RCSB PDB; PDBe; PDBj
PDBsum structure summary

The pentatricopeptide repeat (PPR) is a 35-amino acid sequence motif. Pentatricopeptide-repeat-containing proteins are a family of proteins commonly found in the plant kingdom. They are distinguished by the presence of tandem degenerate PPR motifs [1] and by the relative lack of introns in the genes coding for them. [2]

Approximately 450 such proteins have been identified in the Arabidopsis genome, and another 477 in the rice genome. [3] Despite the large size of the protein family, genetic data suggest that there is little or no redundancy of function between the PPR proteins in Arabidopsis. [2]

The purpose of PPR proteins is currently under dispute. It has been shown that a good deal of those in Arabidopsis interact (often essentially) with mitochondria and other organelles [2] and that they are possibly involved in RNA editing. [4] However many trans proteins are required for this editing to occur and research continues to look at which proteins are needed. [5]

The structure of the PPR has been resolved. It folds into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat. Several repeats of the protein forms a ring around a single-strand RNA molecule in a sequence-sensitive way reminiscent of TAL effectors. [6]

Examples

Human genes encoding proteins containing this repeat include:

Related Research Articles

<span class="mw-page-title-main">Symbiogenesis</span> Evolutionary theory holding that eukaryotic organelles evolved through symbiosis with prokaryotes

Symbiogenesis is the leading evolutionary theory of the origin of eukaryotic cells from prokaryotic organisms. The theory holds that mitochondria, plastids such as chloroplasts, and possibly other organelles of eukaryotic cells are descended from formerly free-living prokaryotes taken one inside the other in endosymbiosis. Mitochondria appear to be phylogenetically related to Rickettsiales bacteria, while chloroplasts are thought to be related to cyanobacteria.

<span class="mw-page-title-main">RNA editing</span> Molecular process

RNA editing is a molecular process through which some cells can make discrete changes to specific nucleotide sequences within an RNA molecule after it has been generated by RNA polymerase. It occurs in all living organisms and is one of the most evolutionarily conserved properties of RNAs. RNA editing may include the insertion, deletion, and base substitution of nucleotides within the RNA molecule. RNA editing is relatively rare, with common forms of RNA processing not usually considered as editing. It can affect the activity, localization as well as stability of RNAs, and has been linked with human diseases.

<span class="mw-page-title-main">F-box protein</span> Protein containing at least one F-box domain

F-box proteins are proteins containing at least one F-box domain. The first identified F-box protein is one of three components of the SCF complex, which mediates ubiquitination of proteins targeted for degradation by the 26S proteasome.

<span class="mw-page-title-main">Small nucleolar RNA SNORD34</span>

In molecular biology, snoRNA U34 is a non-coding RNA (ncRNA) molecule which functions in the modification of other small nuclear RNAs (snRNAs). This type of modifying RNA is usually located in the nucleolus of the eukaryotic cell which is a major site of snRNA biogenesis. It is known as a small nucleolar RNA (snoRNA) and also often referred to as a guide RNA.

<span class="mw-page-title-main">CUGBP1</span> Protein-coding gene in the species Homo sapiens

CUG triplet repeat, RNA binding protein 1, also known as CUGBP1, is a protein which in humans is encoded by the CUGBP1 gene.

<span class="mw-page-title-main">CUGBP2</span> Protein-coding gene in the species Homo sapiens

CUGBP, Elav-like family member 2, also known as Etr-3 is a protein that in humans is encoded by the CELF2 gene.

<span class="mw-page-title-main">Transcription elongation regulator 1</span> Protein-coding gene in the species Homo sapiens

Transcription elongation regulator 1, also known as TCERG1, is a protein which in humans is encoded by the TCERG1 gene.

<span class="mw-page-title-main">40S ribosomal protein S16</span> Protein-coding gene in the species Homo sapiens

40S ribosomal protein S16' is a protein that in humans is encoded by the RPS16 gene.

<span class="mw-page-title-main">60S ribosomal protein L14</span> Protein found in humans

60S ribosomal protein L14 is a protein that in humans is encoded by the RPL14 gene.

<span class="mw-page-title-main">60S ribosomal protein L12</span> Protein found in humans

60S ribosomal protein L12 is a protein that in humans is encoded by the RPL12 gene.

<span class="mw-page-title-main">60S ribosomal protein L28</span> Protein found in humans

60S ribosomal protein L28 is a protein that in humans is encoded by the RPL28 gene.

<span class="mw-page-title-main">60S ribosomal protein L13a</span> Protein found in humans

60S ribosomal protein L13a is a protein that in humans is encoded by the RPL13A gene.

<span class="mw-page-title-main">LRPPRC</span> Protein-coding gene in the species Homo sapiens

Leucine-rich PPR motif-containing protein, mitochondrial is a protein that in humans is encoded by the LRPPRC gene. Transcripts ranging in size from 4.8 to 7.0 kb which result from alternative polyadenylation have been reported for this gene.

<span class="mw-page-title-main">40S ribosomal protein S8</span> Protein-coding gene in the species Homo sapiens

40S ribosomal protein S8 is a protein that in humans is encoded by the RPS8 gene.

<span class="mw-page-title-main">PLRG1</span> Protein-coding gene in the species Homo sapiens

Pleiotropic regulator 1 is a protein that in humans is encoded by the PLRG1 gene.

<span class="mw-page-title-main">60S ribosomal protein L37</span> Protein found in humans

60S ribosomal protein L37 is a protein that in humans is encoded by the RPL37 gene.

<span class="mw-page-title-main">Chloroplast DNA</span> DNA located in cellular organelles called chloroplasts

Chloroplast DNA (cpDNA) is the DNA located in chloroplasts, which are photosynthetic organelles located within the cells of some eukaryotic organisms. Chloroplasts, like other types of plastid, contain a genome separate from that in the cell nucleus. The existence of chloroplast DNA was identified biochemically in 1959, and confirmed by electron microscopy in 1962. The discoveries that the chloroplast contains ribosomes and performs protein synthesis revealed that the chloroplast is genetically semi-autonomous. The first complete chloroplast genome sequences were published in 1986, Nicotiana tabacum (tobacco) by Sugiura and colleagues and Marchantia polymorpha (liverwort) by Ozeki et al. Since then, a great number of chloroplast DNAs from various species have been sequenced.

Alice Barkan is an American molecular biologist and a professor of biology at the University of Oregon. She is known for her work on chloroplast gene regulation and protein synthesis.

Jian-Kang Zhu is a plant scientist, researcher and academic. He is a Senior Principal Investigator in the Shanghai Center for Plant Stress Biology, Chinese Academy of Sciences (CAS). He is also the Academic Director of CAS Center of Excellence in Plant Sciences.

<span class="mw-page-title-main">Maureen Hanson</span> American molecular biologist

Maureen Hanson is an American molecular biologist and Liberty Hyde Bailey Professor in the Department of Molecular Biology and Genetics at Cornell University in Ithaca, New York. She is a joint member of the Section of Plant Biology and Director of the Center for Enervating Neuroimmune Disease. Her research concerns gene expression in chloroplasts and mitochondria, photosynthesis, and the molecular basis of the disease Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS).

References

  1. Mingler MK, Hingst AM, Clement SL, Yu LE, Reifur L, Koslowsky DJ (November 2006). "Identification of pentatricopeptide repeat proteins in Trypanosoma brucei". Mol. Biochem. Parasitol. 150 (1): 37–45. doi:10.1016/j.molbiopara.2006.06.006. PMID   16837079.
  2. 1 2 3 Lurin C, Andrés C, Aubourg S, Bellaoui M, Bitton F, Bruyère C, Caboche M, Debast C, Gualberto J, Hoffmann B, Lecharny A, Le Ret M, Martin-Magniette ML, Mireau H, Peeters N, Renou JP, Szurek B, Taconnat L, Small I (August 2004). "Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis". Plant Cell. 16 (8): 2089–103. doi:10.1105/tpc.104.022236. PMC   519200 . PMID   15269332.
  3. O'Toole N, Hattori M, Andres C, Iida K, Lurin C, Schmitz-Linneweber C, Sugita M, Small I (June 2008). "On the expansion of the pentatricopeptide repeat gene family in plants". Mol. Biol. Evol. 25 (6): 1120–8. doi: 10.1093/molbev/msn057 . PMID   18343892.
  4. Kotera E, Tasaka M, Shikanai T (January 2005). "A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts". Nature. 433 (7023): 326–30. Bibcode:2005Natur.433..326K. doi:10.1038/nature03229. PMID   15662426. S2CID   4416316.
  5. Takenaka M, Verbitskiy D, Zehrmann A, Brennicke A (June 2010). "Reverse genetic screening identifies five E-class PPR-proteins involved in RNA editing in mitochondria of Arabidopsis Thaliana". J Biol Chem. 285 (35): 27122–27129. doi: 10.1074/jbc.M110.128611 . PMC   2930711 . PMID   20566637.
  6. Yin P, Li Q, Yan C, Liu Y, Liu J, Yu F, et al. (December 2013). "Structural basis for the modular recognition of single-stranded RNA by PPR proteins". Nature. 504 (7478): 168–71. Bibcode:2013Natur.504..168Y. doi:10.1038/nature12651. PMID   24162847. S2CID   4471801.