The lacto-2 RNA motif is an RNA structure that is conserved amongst bacteria within the order Lactobacillales. [1] The motif consists of a stem-loop whose stem is interrupted by many internal loops and bulges. Nucleotide identities in many places are conserved, and one internal loop in particular is highly conserved.
As lacto-2 RNAs are not consistently located in 5′ UTRs, they are presumed to correspond to non-coding RNAs. However, most (80%) of the RNAs are in a position that may correspond to the 5′ UTR, so it is not inconceivable that the RNA has a role as a cis-regulatory element. Many lacto-2 RNAs are present in operons that encode tRNAs and rRNAs, and many are adjacent to genes encoding protein subunits of the ribosome, although they are not necessarily in the same operon as these protein-coding genes. Lacto-2 RNAs also have a weak association with genes involved in nucleotide biosynthesis and transport, including several independent genes within the de-novo purine biosynthesis pathway and some in pyrimidine biosynthesis.
In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′-UTR often contains regulatory regions that post-transcriptionally influence gene expression.
In molecular biology, a riboswitch is a regulatory segment of a messenger RNA molecule that binds a small molecule, resulting in a change in production of the proteins encoded by the mRNA. Thus, an mRNA that contains a riboswitch is directly involved in regulating its own activity, in response to the concentrations of its effector molecule. The discovery that modern organisms use RNA to bind small molecules, and discriminate against closely related analogs, expanded the known natural capabilities of RNA beyond its ability to code for proteins, catalyze reactions, or to bind other RNA or protein macromolecules.
In genetics, attenuation is a proposed mechanism of control in some bacterial operons which results in premature termination of transcription and is based on the fact that, in bacteria, transcription and translation proceed simultaneously. Attenuation involves a provisional stop signal (attenuator), located in the DNA segment that corresponds to the leader sequence of mRNA. During attenuation, the ribosome becomes stalled (delayed) in the attenuator region in the mRNA leader. Depending on the metabolic conditions, the attenuator either stops transcription at that point or allows read-through to the structural gene part of the mRNA and synthesis of the appropriate protein.
Usually found in gram-positive bacteria, the T box leader sequence is an RNA element that controls gene expression through the regulation of translation by binding directly to a specific tRNA and sensing its aminoacylation state. This interaction controls expression of downstream aminoacyl-tRNA synthetase genes, amino acid biosynthesis, and uptake-related genes in a negative feedback loop. The uncharged tRNA acts as the effector for transcription antitermination of genes in the T-box leader family. The anticodon of a specific tRNA base pairs to a specifier sequence within the T-box motif, and the NCCA acceptor tail of the tRNA base pairs to a conserved bulge in the T-box antiterminator hairpin.
The mini-ykkC RNA motif was discovered as a putative RNA structure that is conserved in bacteria. The motif consists of two conserved stem-loops whose terminal loops contain the RNA sequence ACGR, where R represents either A or G. Mini-ykkC RNAs are widespread in Proteobacteria, but some are predicted in other phyla of bacteria. It was expected that the RNAs are cis-regulatory elements, because they are typically located upstream of protein-coding genes.
The ATPC RNA motif is a conserved RNA structure found in certain cyanobacteria. It is apparently ubiquitous in Prochlorococcus marinus, and is present in many species in the genus Synechococcus. The RNA is always found within an operon encoding subunits of ATP synthase, and it is always located downstream of the gene encoding the A subunit of ATP synthase, and upstream of the C subunit gene. This location is consistent with a cis-regulatory element, but also with a non-coding RNA that is transcribed with the ATP synthase genes.
The gamma-150 RNA motif is a conserved RNA structure that is found in bacteria within the order Pseudomonadales. Because gamma-150 RNAs are not consistently in 5' UTRs, the gamma-150 motif is presumed to correspond to a non-coding RNA.
The MAEB RNA motif is a conserved stem-loop RNA structure present in many species in the genus Burkholderia. MAEB stem-loops typically occur in blocks of repeats, usually with 2–6 consecutive instances of MAEB stem-loops separated by a short and conserved linker sequence. As many as 12 consecutive MAEB stem-loops have been observed in a single block.
The Chlorobi-RRM RNA motif is a conserved RNA structure identified by bioinformatics. It is found within bacteria in the phylum Chlorobi, and is exclusively detected in the presumed 5' untranslated regions of genes that encode putative RNA-binding proteins. Since many RNA-binding proteins regulate their own expression in a feedback mechanism by binding or acting up their 5' UTR, it was proposed that the Chlorobi-RRM is a component in an analogous feedback mechanism. Structurally, the motif consists of two stem-loops, the second of which might function as a rho-independent transcription terminator.
The Cyano-2 RNA motif is a conserved RNA structure identified by bioinformatics. Cyano-2 RNAs are found in Cyanobacterial species classified within the genus Synechococcus. Many terminal loops in the two conserved stem-loops contain the nucleotide sequence GCGA, and these sequences might in some cases form stable GNRA tetraloops. Since the two stem-loops are somewhat distant from one another it is possible that they represent two independent non-coding RNAs that are often or always co-transcribed. The region one thousand base pairs upstream of predicted Cyano-2 RNAs is usually devoid of annotated features such as RNA or protein-coding genes. This absence of annotated genes within one thousand base pairs is relatively unusual within bacteria.
The Downstream-peptide motif refers to a conserved RNA structure identified by bioinformatics in the cyanobacterial genera Synechococcus and Prochlorococcus and one phage that infects such bacteria. It was also detected in marine samples of DNA from uncultivated bacteria, which are presumably other species of cyanobacteria.
The Lacto-usp RNA motif is a conserved RNA structure identified in bacteria by bioinformatics. Lacto-usp RNAs are found exclusively in lactic acid bacteria, and exclusively in the possible 5′ untranslated regions of operons that contain a hypothetical gene and a usp gene. The usp gene encodes the universal stress protein. It was proposed that the Lacto-usp might correspond to the 6S RNA of the relevant species, because four of five of these species lack a predicted 6S RNA, and 6S RNAs commonly occur in 5′ UTRs of usp genes. However, given that the Lacto-usp RNA motif is much shorter than the standard 6S RNA structure, the function of Lacto-usp RNAs remains unclear.
The pan RNA motif defines a conserved RNA structure that was identified using bioinformatics. pan motif RNAs are present in three phyla: Chloroflexi, Firmicutes and Proteobacteria, although within the latter phylum they are only known in deltaproteobacteria. A pan RNA is present in the Firmicute Bacillus subtilis, which is one of the most extensively studied bacteria.
The pfl RNA motif refers to a conserved RNA structure present in some bacteria and originally discovered using bioinformatics. pfl RNAs are consistently present in genomic locations that likely correspond to the 5' untranslated regions of protein-coding genes. This arrangement in bacteria is commonly associated with cis-regulatory elements. Moreover, they are in presumed 5' UTRs of multiple non-homologous genes, suggesting that they function only in these locations. Additional evidence of cis-regulatory function came from the observation that predicted rho-independent transcription terminators overlap pfl RNAs. This overlap suggests that the alternate secondary structures of pfl RNA and the transcription terminator stem-loops compete with each other, and this is a common mechanism for cis gene control in bacteria.
The psaA RNA motif describes a class of RNAs with a common secondary structure. psaA RNAs are exclusively found in locations that presumably correspond to the 5' untranslated regions of operons formed of psaA and psaB genes. For this reason, it was hypothesized that psaA RNAs function as cis-regulatory elements of these genes. The psaAB genes encode proteins that form subunits in the photosystem I structure used for photosynthesis. psaA RNAs have been detected only in cyanobacteria, which is consistent with their association with photosynthesis.
The yjdF RNA motif is a conserved RNA structure identified using bioinformatics. Most yjdF RNAs are located in bacteria classified within the phylum Firmicutes. A yjdF RNA is found in the presumed 5' untranslated region of the yjdF gene in Bacillus subtilis, and almost all yjdF RNAs are found in the 5' UTRs of homologs of this gene. The function of the yjdF gene is unknown, but the protein that it is predicted to encode is classified by the Pfam Database as DUF2992.
The leuA-Halobacteria RNA motif is a conserved RNA structure that was discovered by bioinformatics. leuA-Halobacteria motifs are found in Halobacteriaceae, a lineage of archaea.
The uup RNA motif is a conserved RNA structure that was discovered by bioinformatics. uup motif RNAs are found in Firmicutes and Gammaproteobacteria.
The Zeta-pan RNA motif is a conserved RNA structure that was discovered by bioinformatics. Zeta-pan motif RNAs are found in Zetaproteobacteria.
LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.