SILVA ribosomal RNA database

Last updated

SILVA is a ribosomal RNA database [1] [2] [3] established in collaboration between the Microbial Genomics Group at the Max Planck Institute for Marine Microbiology in Bremen, Germany, the Department of Microbiology at the Technical University Munich, and Ribocon. [4] [5]

Contents

Release 117 of the database (January 2014) held more than 4,000,000 small subunit (SSU - 16S/18S) and 400,000 large subunit (LSU - 23S/28S) sequences. [6] [ failed verification ] Sequences are provided as files for the ARB software environment. [4]

See also

Related Research Articles

<span class="mw-page-title-main">Ribosomal RNA</span> RNA component of the ribosome, essential for protein synthesis in all living organisms

Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosomal DNA (rDNA) and then bound to ribosomal proteins to form small and large ribosome subunits. rRNA is the physical and mechanical factor of the ribosome that forces transfer RNA (tRNA) and messenger RNA (mRNA) to process and translate the latter into proteins. Ribosomal RNA is the predominant form of RNA found in most cells; it makes up about 80% of cellular RNA despite never being translated into proteins itself. Ribosomes are composed of approximately 60% rRNA and 40% ribosomal proteins by mass.

A structural gene is a gene that codes for any RNA or protein product other than a regulatory factor. A term derived from the lac operon, structural genes are typically viewed as those containing sequences of DNA corresponding to the amino acids of a protein that will be produced, as long as said protein does not function to regulate gene expression. Structural gene products include enzymes and structural proteins. Also encoded by structural genes are non-coding RNAs, such as rRNAs and tRNAs.

The ARB Project is a free software package for the phylogenetic analysis of rRNA and other biological sequences such as amino acids. It simplifies the import and assembly of genetic sequences from any organisms with the use of an automated aligner. This editing process can be depicted into two subcategories: "Primary Structure Editor" and "Secondary Structure Editor." Comprehensively, this allows the necessary make up for the "Phylogenetic Treeing." The software also allows the visualization of these biological sequences which gives the user a more in depth experience and interaction. This is particularly necessary when comparing phylogeny data from various organisms.

Rfam is a database containing information about non-coding RNA (ncRNA) families and other structured RNA elements. It is an annotated, open access database originally developed at the Wellcome Trust Sanger Institute in collaboration with Janelia Farm, and currently hosted at the European Bioinformatics Institute. Rfam is designed to be similar to the Pfam database for annotating protein families.

<span class="mw-page-title-main">Small nucleolar RNA SNORD102</span>

In molecular biology, snoRNA U102 is a non-coding RNA (ncRNA) molecule which functions in the modification of other small nuclear RNAs (snRNAs). This type of modifying RNA is usually located in the nucleolus of the eukaryotic cell which is a major site of snRNA biogenesis. It is known as a small nucleolar RNA (snoRNA) and also often referred to as a guide RNA.

<span class="mw-page-title-main">Small nucleolar RNA SNORD15</span>

In molecular biology, SNORD15 is a non-coding RNA (ncRNA) molecule which functions in the modification of other small nuclear RNAs (snRNAs). This type of modifying RNA is usually located in the nucleolus of the eukaryotic cell which is a major site of snRNA biogenesis. It is known as a small nucleolar RNA (snoRNA) and also often referred to as a guide RNA.

<span class="mw-page-title-main">16S ribosomal RNA</span> RNA component

16S ribosomal RNA is the RNA component of the 30S subunit of a prokaryotic ribosome. It binds to the Shine-Dalgarno sequence and provides most of the SSU structure.

<span class="mw-page-title-main">28S ribosomal RNA</span> RNA component of the large subunit of the eukaryotic ribosome

28S ribosomal RNA is the structural ribosomal RNA (rRNA) for the large subunit (LSU) of eukaryotic cytoplasmic ribosomes, and thus one of the basic components of all eukaryotic cells. It has a size of 25S in plants and 28S in mammals, hence the alias of 25S–28S rRNA.

<span class="mw-page-title-main">Human Microbiome Project</span> Former research initiative

The Human Microbiome Project (HMP) was a United States National Institutes of Health (NIH) research initiative to improve understanding of the microbiota involved in human health and disease. Launched in 2007, the first phase (HMP1) focused on identifying and characterizing human microbiota. The second phase, known as the Integrative Human Microbiome Project (iHMP) launched in 2014 with the aim of generating resources to characterize the microbiome and elucidating the roles of microbes in health and disease states. The program received $170 million in funding by the NIH Common Fund from 2007 to 2016.

miRBase

In bioinformatics, miRBase is a biological database that acts as an archive of microRNA sequences and annotations. As of September 2010 it contained information about 15,172 microRNAs. This number has risen to 38,589 by March 2018. The miRBase registry provides a centralised system for assigning new names to microRNA genes.

<span class="mw-page-title-main">Sequence Read Archive</span>

The Sequence Read Archive is a bioinformatics database that provides a public repository for DNA sequencing data, especially the "short reads" generated by high-throughput sequencing, which are typically less than 1,000 base pairs in length. The archive is part of the International Nucleotide Sequence Database Collaboration (INSDC), and run as a collaboration between the NCBI, the European Bioinformatics Institute (EBI), and the DNA Data Bank of Japan (DDBJ).

The Negativicutes are a class of bacteria in the phylum Bacillota, whose members have a peculiar cell wall with a lipopolysaccharide outer membrane which stains gram-negative, unlike most other members of the Bacillota. Although several neighbouring Clostridia species also stain gram-negative, the proteins responsible for the unusual diderm structure of the Negativicutes may have actually been laterally acquired from Pseudomonadota. Additional research is required to confirm the origin of the diderm cell envelope in the Negativicutes.

<span class="mw-page-title-main">'The All-Species Living Tree' Project</span>

'The All-Species Living Tree' Project is a collaboration between various academic groups/institutes, such as ARB, SILVA rRNA database project, and LPSN, with the aim of assembling a database of 16S rRNA sequences of all validly published species of Bacteria and Archaea. At one stage, 23S sequences were also collected, but this has since stopped.

Machine learning in bioinformatics is the application of machine learning algorithms to bioinformatics, including genomics, proteomics, microarrays, systems biology, evolution, and text mining.

Non-coding RNAs have been discovered using both experimental and bioinformatic approaches. Bioinformatic approaches can be divided into three main categories. The first involves homology search, although these techniques are by definition unable to find new classes of ncRNAs. The second category includes algorithms designed to discover specific types of ncRNAs that have similar properties. Finally, some discovery methods are based on very general properties of RNA, and are thus able to discover entirely new kinds of ncRNAs.

Devosiaceae is a family of Alphaproteobacteria.

Pleomorphomonadaceae is a family of Alphaproteobacteria.

References

  1. Christian Quast, Elmar Pruesse, Pelin Yilmaz, Jan Gerken, Timmy Schweer, Pablo Yarza, Jörg Peplies, Frank Oliver Glöckner (2013). The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Research41 (D1): D590-D596. doi:10.1093/nar/gks1219 [ third-party source needed ]
  2. Pelin Yilmaz; Laura Wegener Parfrey; Pablo Yarza; Jan Gerken; Elmar Pruesse; Christian Quast; Timmy Schweer; Jörg Peplies; Wolfgang Ludwig; Frank Oliver Glöckner (2014). "The SILVA and "All-species Living Tree Project (LTP)" taxonomic frameworks". Nucleic Acids Research. 42 (D1): D643–D648. doi: 10.1093/nar/gkt1209 . PMC   3965112 . PMID   24293649.
  3. Elmar Pruesse, Christian Quast, Katrin Knittel, Bernhard M. Fuchs, Wolfgang Ludwig, Jörg Peplies, Frank Oliver Glöckner (2007). SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Research35 (21): 7188-7196. doi:10.1093/nar/gkm864 [ third-party source needed ]
  4. 1 2 Silva. Ribocon GmbH. Accessed November 2016.
  5. Environmental Bioinformatics. Microbial Genomics and Bioinformatics Group, Max Planck Institute for Marine Microbiology. Accessed November 2016.[ third-party source needed ]
  6. Marc P. Hoeppner, Lars E. Barquist, Paul P. Gardner (2014). An Introduction to RNA Databases. Methods in Molecular Biology1097: 107–123. doi:10.1007/978-1-62703-709-9_6.