Escherichia coli contains a number of small RNAs located in intergenic regions of its genome. The presence of at least 55 of these has been verified experimentally. [1] 275 potential sRNA-encoding loci were identified computationally using the QRNA program. These loci will include false positives, so the number of sRNA genes in E. coli is likely to be less than 275. [2] A computational screen based on promoter sequences recognised by the sigma factor sigma 70 and on Rho-independent terminators predicted 24 putative sRNA genes, 14 of these were verified experimentally by northern blotting. The experimentally verified sRNAs included the well characterised sRNAs RprA and RyhB. Many of the sRNAs identified in this screen, including RprA, RyhB, SraB and SraL, are only expressed in the stationary phase of bacterial cell growth. [3] A screen for sRNA genes based on homology to Salmonella and Klebsiella identified 59 candidate sRNA genes. From this set of candidate genes, microarray analysis and northern blotting confirmed the existence of 17 previously undescribed sRNAs, many of which bind to the chaperone protein Hfq and regulate the translation of RpoS (Sigma 38). [4] UptR sRNA transcribed from the uptR gene is implicated in suppressing extracytoplasmic toxicity by reducing the amount of membrane-bound toxic hybrid protein. [5]
Cell motility enhancing sRNA named Esr41, was discovered in intergenic region of pathogenic enterohemorrhagic E.coli (EHEC) O157:H7 Sakai. Esr41 sequence is not present in nonpathogenic E. coli K12, but the sRNA can induce cell motility in K12 as well, suggesting that target genes controlled by Esr41 are present in both E.coli. [6]
Trans-encoded small RNA RalA has 16 nucleotides complementary to coding region of toxin RalR mRNA. RalA functions as an antitoxin by preventing translation of RalR (a non-specific endonuclease that cleaves methylated and unmethylated DNA). Its activity requires RNA chaperone Hfq. RalR and RalA form a type I toxin-antitoxin (TA) system. RalR/RaLA TA locus is responsible for resistance to the antibiotic fosfomycin in E.coli. [7]
Deep sequencing of RNA expressed during chemical stress and high cell density fermentation discovered 253 novel intergenic transcripts adding to roughly 200 intergenic sRNAs previously described in E. coli. Several of the sRNAs exhibited specific expression patterns during high cell density fermentation and are differentially expressed in the presence of multiple chemicals, suggesting they may play roles during stress conditions. The novel sRNAs showing differential expression in several stress conditions were: ES003, ES036, ES056, ES098, ES173, ES180, ES205, ES220, ES222, ES239. [8]
Esre sRNA, for "essential small RNA in E. coli", is located in 3′ moiety of yigP gene [9] (also known as ubiJ), which is involved in coenzyme Q8 biosynthesis in Escherichia coli and Salmonella enterica serovar Typhimurium.
AgrB antisene RNA (arsR-govregion gene B) is transcribed opposite of dinQ (translates into a toxic single transmembrane peptide) with 30 complementary nucleotides. AgrB appears to repress accumulation of dinQ by RNA interference and counteracts its toxicity. [10]
fis is an E. coli gene encoding the Fis protein. The regulation of this gene is more complex than most other genes in the E. coli genome, as Fis is an important protein which regulates expression of other genes. It is supposed that fis is regulated by H-NS, IHF and CRP. It also regulates its own expression (autoregulation). Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions.
The 245 nucleotide sRNA of Escherichia coli, CsrC, was discovered using a genetic screen for factors that regulate glycogen biosynthesis. CsrC RNA binds multiple copies of CsrA, a protein that post-transcriptionally regulates central carbon flux, biofilm formation and motility in E. coli. CsrC antagonises the regulatory effects of CsrA, presumably by sequestering this protein. The discovery of CsrC is intriguing, in that a similar sRNA, CsrB, performs essentially the same function. Both sRNAs possess similar imperfect repeat sequences, primarily localised in the loops of predicted hairpins, which may serve as CsrA binding elements. Transcription of csrC increases as the culture approaches the stationary phase of growth and is indirectly activated by CsrA via the response regulator UvrY [1]. This RNA was also discovered in E. coli during a large scale screen [2]. The gene called SraK, was highly abundant in stationary phase, but low levels could be detected in exponentially growing cells as well [2].
The SraC/RyeA RNA is a non-coding RNA that was discovered in E. coli during two large scale screens for RNAs. The function of this RNA is currently unknown. This RNA overlaps the SdsR/RyeB RNA on the opposite strand suggesting that the two RNAs may act in a concerted manner.
The OmrA-B RNA gene family is a pair of homologous OmpR-regulated small non-coding RNA that was discovered in E. coli during two large-scale screens. OmrA-B is highly abundant in stationary phase, but low levels could be detected in exponentially growing cells as well. RygB is adjacent to RygA a closely related RNA. These RNAs bind to the Hfq protein and regulate gene expression by antisense binding. They negatively regulate the expression of several genes encoding outer membrane proteins, including cirA, CsgD, fecA, fepA and ompT by binding in the vicinity of the Shine-Dalgarno sequence, suggesting the control of these targets is dependent on Hfq protein and RNase E. Taken together, these data suggest that OmrA-B participates in the regulation of outer membrane composition, responding to environmental conditions.
Sib RNA refers to a group of related non-coding RNA. They were originally named QUAD RNA after they were discovered as four repeat elements in Escherichia coli intergenic regions. The family was later renamed Sib when it was discovered that the number of repeats is variable in other species and in other E. coli strains.
RyhB RNA is a 90 nucleotide RNA that down-regulates a set of iron-storage and iron-using proteins when iron is limiting; it is itself negatively regulated by the ferric uptake repressor protein, Fur.
In molecular biology the ArcZ RNA is a small non-coding RNA (ncRNA). It is the functional product of a gene which is not translated into protein. ArcZ is an Hfq binding RNA that functions as an antisense regulator of a number of protein coding genes.
GlmZ is a small non-coding RNA (ncRNA). It is the functional product of a gene which is not translated into protein.
The GlmY RNA family consists of a number of bacterial RNA genes of around 167 bases in length. The GlmY RNA gene is present in Escherichia coli, Shigella flexneri, Yersinia pestis and Salmonella species, where it is found between the yfhK and purL genes. It was originally predicted in a bioinformatic screen for novel ncRNAs in E. coli.
In a screen of the Bacillus subtilis genome for genes encoding ncRNAs, Saito et al. focused on 123 intergenic regions (IGRs) over 500 base pairs in length, the authors analyzed expression from these regions. Seven IGRs termed bsrC, bsrD, bsrE, bsrF, bsrG, bsrH and bsrI expressed RNAs smaller than 380 nt. All the small RNAs except BsrD RNA were expressed in transformed Escherichia coli cells harboring a plasmid with PCR-amplified IGRs of B. subtilis, indicating that their own promoters independently express small RNAs. Under non-stressed condition, depletion of the genes for the small RNAs did not affect growth. Although their functions are unknown, gene expression profiles at several time points showed that most of the genes except for bsrD were expressed during the vegetative phase, but undetectable during the stationary phase. Mapping the 5' ends of the 6 small RNAs revealed that the genes for BsrE, BsrF, BsrG, BsrH, and BsrI RNAs are preceded by a recognition site for RNA polymerase sigma factor σA.
Bacterial small RNAs are small RNAs produced by bacteria; they are 50- to 500-nucleotide non-coding RNA molecules, highly structured and containing several stem-loops. Numerous sRNAs have been identified using both computational analysis and laboratory-based techniques such as Northern blotting, microarrays and RNA-Seq in a number of bacterial species including Escherichia coli, the model pathogen Salmonella, the nitrogen-fixing alphaproteobacterium Sinorhizobium meliloti, marine cyanobacteria, Francisella tularensis, Streptococcus pyogenes, the pathogen Staphylococcus aureus, and the plant pathogen Xanthomonas oryzae pathovar oryzae. Bacterial sRNAs affect how genes are expressed within bacterial cells via interaction with mRNA or protein, and thus can affect a variety of bacterial functions like metabolism, virulence, environmental stress response, and structure.
The TisB-IstR toxin-antitoxin system is the first known toxin-antitoxin system which is induced by the SOS response in response to DNA damage.
A toxin-antitoxin system consists of a "toxin" and a corresponding "antitoxin", usually encoded by closely linked genes. The toxin is usually a protein while the antitoxin can be a protein or an RNA. Toxin-antitoxin systems are widely distributed in prokaryotes, and organisms often have them in multiple copies. When these systems are contained on plasmids – transferable genetic elements – they ensure that only the daughter cells that inherit the plasmid survive after cell division. If the plasmid is absent in a daughter cell, the unstable antitoxin is degraded and the stable toxic protein kills the new cell; this is known as 'post-segregational killing' (PSK).
RdlD RNA is a family of small non-coding RNAs which repress the protein LdrD in a type I toxin-antitoxin system. It was discovered in Escherichia coli strain K-12 in a long direct repeat (LDR) named LDR-D. This locus encodes two products: a 35 amino acid peptide toxin (ldrD) and a 60 nucleotide RNA antitoxin. The 374nt toxin mRNA has a half-life of around 30 minutes while rdlD RNA has a half-life of only 2 minutes. This is in keeping with other type I toxin-antitoxin systems.
The SymE-SymR toxin-antitoxin system consists of a small symbiotic endonuclease toxin, SymE, and a non-coding RNA symbiotic RNA antitoxin, SymR, which inhibits SymE translation. SymE-SymR is a type I toxin-antitoxin system, and is under regulation by the antitoxin, SymR. The SymE-SymR complex is believed to play an important role in recycling damaged RNA and DNA. The relationship and corresponding structures of SymE and SymR provide insight into the mechanism of toxicity and overall role in prokaryotic systems.
The FlmA-FlmB toxin-antitoxin system consists of FlmB RNA, a family of non-coding RNAs and the protein toxin FlmA. The FlmB RNA transcript is 100 nucleotides in length and is homologous to sok RNA from the hok/sok system and fulfills the identical function as a post-segregational killing (PSK) mechanism.
Mycobacterium tuberculosis contains at least nine small RNA families in its genome. The small RNA (sRNA) families were identified through RNomics – the direct analysis of RNA molecules isolated from cultures of Mycobacterium tuberculosis. The sRNAs were characterised through RACE mapping and Northern blot experiments. Secondary structures of the sRNAs were predicted using Mfold.
The TxpA/RatA toxin-antitoxin system was first identified in Bacillus subtilis. It consists of a non-coding 222nt sRNA called RatA and a protein toxin named TxpA.
An RNA thermometer is a temperature-sensitive non-coding RNA molecule which regulates gene expression. RNA thermometers often regulate genes required during either a heat shock or cold shock response, but have been implicated in other regulatory roles such as in pathogenicity and starvation.
The SraL RNA, also known as RyjA, is a small non-coding RNA discovered in E. coli, and later in Salmonella Tiphimurium. This ncRNA was found to be expressed only in stationary phase. It may possibly play a role in Salmonella virulence. The major stationary phase regulator RpoS is transcriptionally regulating SraL and directly binds to the sraL gene promoter. SraL down-regulates the expression of the ribosome-associated chaperone Trigger Factor (TF), which is involved in the folding of the newly synthesised cystolic proteins.