Collinsella-1 RNA | |
---|---|
Identifiers | |
Symbol | Collinsella-1 |
Rfam | RF01700 |
Other data | |
RNA type | sRNA |
PDB structures | PDBe |
The Collinsella-1 RNA motif denotes a particular conserved RNA structure discovered by bioinformatics. [1] Of the six sequences belonging to this motif that were originally identified, five are from uncultivated bacteria residing in the human gut, while only the sixth is in a cultivated species, Collinsella aerofaciens . The evidence supporting the stem-loops designated as "P1" and "P2" is ambiguous.
The ykkC/yxkD leader is a conserved RNA structure found upstream of the ykkC and yxkD genes in Bacillus subtilis and related genes in other bacteria. The function of this family is unclear for many years although it has been suggested that it may function to switch on efflux pumps and detoxification systems in response to harmful environmental molecules. The Thermoanaerobacter tengcongensis sequence AE013027 overlaps with that of purine riboswitch suggesting that the two riboswitches may work in conjunction to regulate the upstream gene which codes for TTE0584 (Q8RC62), a member of the permease family.
The Acido-Lenti-1 RNA motif describes a predicted non-coding RNA that is found in bacteria within the phyla acidobacteria and lentisphaerae. It is sometimes found nearby to group II introns, but the reason for this apparent association is unknown.
The Bacillaceae-1 RNA motif is a conserved RNA structure identified by bioinformatics within bacteria in the family bacillaceae. The RNA is presumed to operate as a non-coding RNA, and is sometimes adjacent to operons containing ribosomal RNAs. The most characteristic feature is two terminal loops that have the nucleotide consensus RUCCU, where R is either A or G. The motif might be related to the Desulfotalea-1 RNA motif, as the motifs share some similarity in conserved features, and the Desulfotalea-1 RNA motif is also sometimes adjacent to ribosomal RNA operons.
The Bacteroidales-1 RNA motif is a conserved RNA structure identified by bioinformatics. It has been identified only in bacteria within the order (biology) Bacteroidales. Its presumed length is marked by a promoter on one end that conforms to an alternate consensus sequence that is common in the phylum Bacteroidetes, and its 3′ end is indicated by predicted transcription terminators. It is often located downstream of a gene that encodes the L20 ribosomal subunit, although it is unclear whether there is a functional reason underlying this apparent association.
The Chlorobi-1 RNA motif is a conserved RNA secondary structure identified by bioinformatics. It is predicted to be used only by Chlorobi, a phylum of bacteria. The motif consists of two stem-loops that are followed by an apparent rho-independent transcription terminator. The motif is presumed to function as an independently transcribed non-coding RNA.
The Chloroflexi-1 RNA motif is a conserved RNA structure detected by bioinformatics within the species Chloroflexus aggregans. C. aggregans has three predicted Chloroflexi-1 RNAs, which are located nearby to one another. This arrangement might suggest a repetitive element. C. aggregans is classified as belonging to the bacterial phylum Chloroflexi.
The Clostridiales-1 RNA motif is a conserved RNA structure identified by bioinformatics. It is a four-stem structure common in bacteria that inhabit the human gut and is also found in a variety of bacteria classified within the order Clostridiales. Its function is unknown.
The Downstream-peptide motif refers to a conserved RNA structure identified by bioinformatics in the cyanobacterial genera Synechococcus and Prochlorococcus and one phage that infects such bacteria. It was also detected in marine samples of DNA from uncultivated bacteria, which are presumably other species of cyanobacteria.
The glutamine riboswitch is a conserved RNA structure that was predicted by bioinformatics. It is present in a variety of lineages of cyanobacteria, as well as some phages that infect cyanobacteria. It is also found in DNA extracted from uncultivated bacteria living in the ocean that are presumably species of cyanobacteria.
The Flavo-1 RNA motif is a conserved RNA structure that was identified by bioinformatics. The vast majority of Flavo-1 RNAs are found in Flavobacteria, but some were detected in the phylum Bacteroidetes, which contains Flavobacteria, or the phylum Spirochaetes, which is evolutionarily related to Bacteroidetes. It was presumed that Flavo-1 RNAs function as non-coding RNAs.
The Lacto-usp RNA motif is a conserved RNA structure identified in bacteria by bioinformatics. Lacto-usp RNAs are found exclusively in lactic acid bacteria, and exclusively in the possible 5′ untranslated regions of operons that contain a hypothetical gene and a usp gene. The usp gene encodes the universal stress protein. It was proposed that the Lacto-usp might correspond to the 6S RNA of the relevant species, because four of five of these species lack a predicted 6S RNA, and 6S RNAs commonly occur in 5′ UTRs of usp genes. However, given that the Lacto-usp RNA motif is much shorter than the standard 6S RNA structure, the function of Lacto-usp RNAs remains unclear.
The pan RNA motif defines a conserved RNA structure that was identified using bioinformatics. pan motif RNAs are present in three phyla: Chloroflexi, Firmicutes and Proteobacteria, although within the latter phylum they are only known in deltaproteobacteria. A pan RNA is present in the Firmicute Bacillus subtilis, which is one of the most extensively studied bacteria.
The Polynucleobacter-1 RNA motif is a conserved RNA structure that was identified by bioinformatics. The RNA structure is predominantly located in genome sequences derived from DNA extracted from uncultivated marine samples. However it was also predicted in the genome of Polynucleobacter species QLW-P1DMWA-1, a kind of betaproteobacteria. The RNAs are often located near to a conserved gene that might be homologous to a gene found in a phage that infects cyanobacteria. However, it is unknown if the RNA is used by phages.
The potC RNA motif is a conserved RNA structure discovered using bioinformatics. The RNA is detected only in genome sequences derived from DNA that was extracted from uncultivated marine bacteria. Thus, this RNA is present in environmental samples, but not yet found in any cultivated organism. potC RNAs are located in the presumed 5' untranslated regions of genes predicted to encode either membrane transport proteins or peroxiredoxins. Therefore, it was hypothesized that potC RNAs are cis-regulatory elements, but their detailed function is unknown.
The Pseudomon-groES RNA motif is a conserved RNA structure identified in certain bacteria using bioinformatics. It is found in most species within the family Pseudomonadaceae, and is consistently located in the 5' untranslated regions of operons that contain groES genes. RNA transcripts of the groES genes in Pseudomonas aeruginosa where shown experimentally to be initiated at one of two start sites, from promoters called "P1" and "P2". The Pseudomon-groES RNA is in the 5' UTR of transcripts initiated from the P1 site, but is truncated in P2 transcripts. groES genes are involved in the cellular response to heat shock, but it is not thought that the Pseudomonas-groES RNA motif is involved in heat shock regulation. However, it is thought that the motif might regulate groES genes in response to other stimuli.
The Pyrobac-1 RNA motif is a conserved RNA structure discovered by bioinformatics. RNAs conforming to this motif have been found only in Pyrobaculum, a genus of archaea. Instances of the motif are hypothesized to function as non-coding RNAs. The motif has been shown to be part of sRNA202 and sRNA203 canonical and noncanonical pseudouridine guide RNAs in Pyrobaculum.
The Ssbp, Topoisomerase, Antirestriction, XerDC Integrase RNA motif is a conserved RNA-like structure identified using bioinformatics. STAXI RNAs are located near to genes encoding proteins that interact with DNA or are associated with such proteins. This observation raised the possibility that instances of the STAXI motif function as single-stranded DNA molecules, perhaps during DNA replication or DNA repair. On the other hand, STAXI motifs often contain terminal loops conforming to the stable UNCG tetraloop, but the DNA version of this tetraloop (TNCG) is not especially stable. The STAXI motif consists of a simple pseudoknot structure that is repeated two or more times.
The sucA-II RNA motif is a conserved RNA structure identified by bioinformatics. It is consistently found in the presumed 5' untranslated regions of sucA genes, which encode Oxoglutarate dehydrogenase enzymes that participate in the citric acid cycle. Given this arrangement, sucA-II RNAs might regulate the downstream sucA gene. This genetic arrangement is similar to the previously reported sucA RNA motif. However, sucA-II RNAs are found only in bacteria classified within the genus Pseudomonas, whereas the previously reported motif is found only in betaproteobacteria.
The TwoAYGGAY RNA motif is a conserved RNA structure identified by bioinformatics. Its name refers to the conserved AYGGAY nucleotide sequence found in the motif's two terminal loops. The RNAs are found in sequences derived from DNA extracted from uncultivated bacteria present in the human gut, as well as some bacteria in the classes Clostridia and Gammaproteobacteria.
The Whalefall-1 RNA motif refers to a conserved RNA structure that was discovered using bioinformatics. Structurally, the motif consists of two stem-loops, the second of which is often terminated by a CUUG tetraloop, which is an energetically favorable RNA sequence. Whalefall-1 RNAs are found only in DNA extracted from uncultivated bacteria found on whale fall, i.e., a whale carcass. As of 2010, Whalefall-1 RNAs have not been detected in any known, cultivated species of bacteria, and are thus one of several RNAs present in environmental samples.