GP20 RNA motif

Last updated
GP20-b
RF03003.svg
Consensus secondary structure and sequence conservation of GP20-b RNA
Identifiers
SymbolGP20-b
Rfam RF03003
Other data
RNA type Cis-reg
SO SO:0005836
PDB structures PDBe

The GP20 RNA motif is a conserved RNA structure that was discovered by bioinformatics. [1] THE GP20 motif is subdivided into two related motifs, called GP20-a and GP20-b. GP20-a RNAs are found exclusively in metagenomic sequences isolated from the gut, while GP20-b RNAs are found in bacteria classified as Clostridia. Both motifs share similar, but not identical secondary structures.

GP20 motif RNAs likely function as cis-regulatory elements, in view of their positions upstream of protein-coding genes. Specifically, they are consistently upstream of GP20 genes, whose function is unknown. GP20 are usually found in phages, suggesting that GP20 RNAs are also part of phages. Indeed, one GP20-b RNA was observed in a sequenced phage: Streptococcus phage EJ-1.

Related Research Articles

<span class="mw-page-title-main">Lambda phage</span> Bacteriophage that infects Escherichia coli

Enterobacteria phage λ is a bacterial virus, or bacteriophage, that infects the bacterial species Escherichia coli. It was discovered by Esther Lederberg in 1950. The wild type of this virus has a temperate life cycle that allows it to either reside within the genome of its host through lysogeny or enter into a lytic phase, during which it kills and lyses the cell to produce offspring. Lambda strains, mutated at specific sites, are unable to lysogenize cells; instead, they grow and enter the lytic cycle after superinfecting an already lysogenized cell.

<span class="mw-page-title-main">Bacteriophage MS2</span> Species of virus

Bacteriophage MS2, commonly called MS2, is an icosahedral, positive-sense single-stranded RNA virus that infects the bacterium Escherichia coli and other members of the Enterobacteriaceae. MS2 is a member of a family of closely related bacterial viruses that includes bacteriophage f2, bacteriophage Qβ, R17, and GA.

Antitermination is the prokaryotic cell's aid to fix premature termination of RNA synthesis during the transcription of RNA. It occurs when the RNA polymerase ignores the termination signal and continues elongating its transcript until a second signal is reached. Antitermination provides a mechanism whereby one or more genes at the end of an operon can be switched either on or off, depending on the polymerase either recognizing or not recognizing the termination signal.

fis E. coli gene

fis is an E. coli gene encoding the Fis protein. The regulation of this gene is more complex than most other genes in the E. coli genome, as Fis is an important protein which regulates expression of other genes. It is supposed that fis is regulated by H-NS, IHF and CRP. It also regulates its own expression (autoregulation). Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions.

<span class="mw-page-title-main">YkoK leader</span>

The Ykok leader or M-box is a Mg2+-sensing RNA structure that controls the expression of Magnesium ion transport proteins in bacteria. It is a distinct structure to the Magnesium responsive RNA element.

c4 antisense RNA

The c4 antisense RNA is a non-coding RNA used by certain phages that infect bacteria. It was initially identified in the P1 and P7 phages of E. coli. The identification of c4 antisense RNAs solved the mystery of the mechanism for regulation of the ant gene, which is an anti-repressor.

<span class="mw-page-title-main">Downstream-peptide motif</span>

The Downstream-peptide motif refers to a conserved RNA structure identified by bioinformatics in the cyanobacterial genera Synechococcus and Prochlorococcus and one phage that infects such bacteria. It was also detected in marine samples of DNA from uncultivated bacteria, which are presumably other species of cyanobacteria.

The wcaG RNA motif is an RNA structure conserved in some bacteria that was detected by bioinformatics. wcaG RNAs are found in certain phages that infect cyanobacteria. Most known wcaG RNAs were found in sequences of DNA extracted from uncultivated marine bacteria. wcaG RNAs might function as cis-regulatory elements, in view of their consistent location in the possible 5' untranslated regions of genes. It was suggested the wcaG RNAs might further function as riboswitches.

<span class="mw-page-title-main">PhotoRC RNA motifs</span>

PhotoRC RNA motifs refer to conserved RNA structures that are associated with genes acting in the photosynthetic reaction centre of photosynthetic bacteria. Two such RNA classes were identified and called the PhotoRC-I and PhotoRC-II motifs. PhotoRC-I RNAs were detected in the genomes of some cyanobacteria. Although no PhotoRC-II RNA has been detected in cyanobacteria, one is found in the genome of a purified phage that infects cyanobacteria. Both PhotoRC-I and PhotoRC-II RNAs are present in sequences derived from DNA that was extracted from uncultivated marine bacteria.

<span class="mw-page-title-main">D12-methyl RNA motif</span>

The D12-methyl RNA motif is a conserved RNA structure that was discovered by bioinformatics. D12-methyl motifs are found in metagenomic DNA samples, and have not yet been found in a classified organism.

<span class="mw-page-title-main">DUF2693 RNA motif</span>

The DUF2693 RNA motif is a conserved RNA structure that was discovered by bioinformatics. DUF2693 motif RNAs are found in Porphyromonas.

<span class="mw-page-title-main">DUF2800 RNA motif</span>

The DUF2800 RNA motif is a conserved RNA structure that was discovered by bioinformatics. DUF2800 motif RNAs are found in Bacillota. DUF2800 RNAs are also predicted in the phyla Actinomycetota and Synergistota, although these RNAs are likely the result of recent horizontal gene transfer or conceivably sequence contamination.

<span class="mw-page-title-main">DUF2815 RNA motif</span>

The DUF2815 RNA motif is a conserved RNA structure that was discovered by bioinformatics. As of 2018, the DUF2815 motif has not been identified in any classified organism, but is known through metagenomic sequences isolated from environmental sources.

The freshwater-2 RNA motif is a conserved RNA structure that was discovered by bioinformatics. Freshwater-2 motif RNAs are found in metagenomic sequences that are isolated from aquatic and especially freshwater environments. As of 2018, no freshwater-2 RNA has been identified in a classified organism.

malK RNA motifs are conserved RNA structures that were discovered by bioinformatics. They are defined by being consistently located upstream of malK genes, which encode an ATPase that is used by transporters whose ligand is likely a kind of sugar. Most of these genes are annotated either as transporting maltose or glycerol-3-phosphate, however the substrate of the transporters associated with malK motif RNAs has not been experimentally determined. All known types of malK RNA motif are generally located nearby to the Shine-Dalgarno sequence of the downstream gene.

The mcrA RNA motif is a conserved RNA structure that was discovered by bioinformatics.

<span class="mw-page-title-main">Mu-gpT-DE RNA motif</span>

The Mu-like gpT Downstream Element RNA motif is a conserved RNA structure that was discovered by bioinformatics. The Mu-gpT-DE motif is only found in metagenomic sequences arising from unknown organisms.

<span class="mw-page-title-main">Proteo-phage-1 RNA motif</span>

The Proteo-phage-1 RNA motif is a conserved RNA structure that was discovered by bioinformatics. Energetically stable tetraloops often occur in this motif.

The sbcC RNA motif is a conserved RNA structure that was discovered by bioinformatics. The sbcC motif has, as of 2018, only been detected in metagenomic sequences, and the identities of organisms that contain these RNAs is unknown.

ssNA-helicase RNA motif

The ssNA-helicase RNA motif is a conserved RNA structure that was discovered by bioinformatics. Although the ssNA-helicase motif was published as an RNA candidate, there is some reason to suspect that it might function as a single-stranded DNA. In terms of secondary structure, RNA and DNA are difficult to distinguish when only sequence information is available.

References

  1. Weinberg Z, Lünse CE, Corbino KA, Ames TD, Nelson JW, Roth A, Perkins KR, Sherlock ME, Breaker RR (October 2017). "Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions". Nucleic Acids Res. 45 (18): 10811–10823. doi:10.1093/nar/gkx699. PMC   5737381 . PMID   28977401.