MiRBase

Last updated
miRBase
MiRBase logo.png
Content
DescriptionmicroRNA database
Contact
Research center University of Manchester
Authors Ana Kozomara
Primary citationKozomara & al. (2011) [1]
Release date2010
Access
Website www.mirbase.org

In bioinformatics, miRBase is a biological database that acts as an archive of microRNA sequences and annotations. [1] [2] [3] [4] As of September 2010 it contained information about 15,172 microRNAs. [1] This number has risen to 38,589 by March 2018. [5] The miRBase registry provides a centralised system for assigning new names to microRNA genes. [6]

miRBase grew from the microRNA registry resource set up by Sam Griffiths-Jones in 2003. [7]

According to Ana Kozomara and Sam Griffiths-Jones miRBase has five aims: [1]

  1. To provide a consistent naming system for microRNAs
  2. To provide a central place collecting all known microRNA sequences
  3. To provide human and computer readable information for each microRNA
  4. To provide primary evidence for each microRNA
  5. To aggregate and link to microRNA target information

MiRBase contains miRNAs belonging of various species belonging to Alveolata, Chromalveolata, Metazoa, Mycetozoa, Viridiplantae and Viruses. For the Viridiplantae, in release 21 (2014) data is available for 73 species. This includes 4800 unique mature miRNAs and 8480 precursor sequences. [8]

The current version of MiRBase is release 22 (March 2018).

Related Research Articles

UniProt Database of protein sequences and functional information

UniProt is a freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the research literature. It is maintained by the UniProt consortium, which consists of several European bioinformatics organisations and a foundation from Washington, DC, United States.

The Protein Information Resource (PIR), located at Georgetown University Medical Center, is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies. It contains protein sequences databases

Pfam

Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 34.0, was released in March 2021 and contains 19,179 families.

Amos Bairoch

Amos Bairoch is a Swiss bioinformatician and Professor of Bioinformatics at the Department of Human Protein Sciences of the University of Geneva where he leads the CALIPHO group at the Swiss Institute of Bioinformatics (SIB) combining bioinformatics, curation, and experimental efforts to functionally characterize human proteins.

The Korean Bioinformation Center (KOBIC) is the Korean national research centre in bioinformatics, based in Daejeon, South Korea. The centre is comparable to the National Center for Biotechnology Information (NCBI) in the United States and the European Bioinformatics Institute (EBI) in Europe and plays a key role in various areas such as genomics, proteomics, systems biology, and personalized medicine.

Rfam is a database containing information about non-coding RNA (ncRNA) families and other structured RNA elements. It is an annotated, open access database originally developed at the Wellcome Trust Sanger Institute in collaboration with Janelia Farm, and currently hosted at the European Bioinformatics Institute. Rfam is designed to be similar to the Pfam database for annotating protein families.

MicrobesOnline

MicrobesOnline is a publicly and freely accessible website that hosts multiple comparative genomic tools for comparing microbial species at the genomic, transcriptomic and functional levels. MicrobesOnline was developed by the Virtual Institute for Microbial Stress and Survival, which is based at the Lawrence Berkeley National Laboratory in Berkeley, California. The site was launched in 2005, with regular updates until 2011.

Anders Krogh is a bioinformatician at the University of Copenhagen, where he leads the university's bioinformatics center. He is known for his pioneering work on the use of hidden Markov models in bioinformatics, and is co-author of a widely used textbook in bioinformatics. In addition, he also co-authored one of the early textbooks on neural networks. His current research interests include promoter analysis, non-coding RNA, gene prediction and protein structure prediction.

SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, structural, and evolutionary units that form proteins. Domains of common Ancestry are grouped into superfamilies. The domains and domain superfamilies are defined and described in SCOP. Superfamilies are groups of proteins which have structural evidence to support a common evolutionary ancestor but may not have detectable sequence homology.

This microRNA database and microRNA targets databases is a compilation of databases and web portals and servers used for microRNAs and their targets. MicroRNAs (miRNAs) represent an important class of small non-coding RNAs (ncRNAs) that regulate gene expression by targeting messenger RNAs.

IsomiR

isomiRs are miRNA sequences that have variations with respect to the reference sequence. The term was coined by Morin et al in 2008. It has been found that isomiR expression profiles can also exhibit race, population, and gender dependencies.

Competing endogenous RNAs hypothesis: ceRNAs regulate other RNA transcripts by competing for shared microRNAs. They are playing important roles in developmental, physiological and pathological processes, such as cancer. Multiple classes of ncRNAs and protein-coding mRNAs function as key ceRNAs (sponges) and to regulate the expression of mRNAs in plants and mammalian cells.

Single nucleotide polymorphism annotation is the process of predicting the effect or function of an individual SNP using SNP annotation tools. In SNP annotation the biological information is extracted, collected and displayed in a clear form amenable to query. SNP functional annotation is typically performed based on the available information on nucleic acid and protein sequences.

Alex Bateman

Alexander George Bateman is a computational biologist and Head of Protein Sequence Resources at the European Bioinformatics Institute (EBI), part of the European Molecular Biology Laboratory (EMBL) in Cambridge, UK. He has led the development of the Pfam biological database and introduced the Rfam database of RNA families. He has also been involved in the use of Wikipedia for community-based annotation of biological databases.

Non-coding RNAs have been discovered using both experimental and bioinformatic approaches. Bioinformatic approaches can be divided into three main categories. The first involves homology search, although these techniques are by definition unable to find new classes of ncRNAs. The second category includes algorithms designed to discover specific types of ncRNAs that have similar properties. Finally, some discovery methods are based on very general properties of RNA, and are thus able to discover entirely new kinds of ncRNAs.

References

  1. 1 2 3 4 Kozomara, A.; Griffiths-Jones, S. (2010). "MiRBase: integrating microRNA annotation and deep-sequencing data". Nucleic Acids Research. 39 (Database issue): D152–7. doi:10.1093/nar/gkq1027. PMC   3013655 . PMID   21037258.
  2. Griffiths-Jones, Sam (2010). "MiRBase: microRNA Sequences and Annotation". Current Protocols in Bioinformatics. Vol. Chapter 12. pp. Unit 12.9.1–10. doi:10.1002/0471250953.bi1209s29. ISBN   978-0471250951. PMID   20205188. S2CID   19303675.
  3. Griffiths-Jones, S.; Saini, H. K.; Van Dongen, S.; Enright, A. J. (2007). "MiRBase: tools for microRNA genomics". Nucleic Acids Research. 36 (Database issue): D154–8. doi:10.1093/nar/gkm952. PMC   2238936 . PMID   17991681.
  4. Griffiths-Jones, Sam (2006). "MiRBase: the MicroRNA Sequence Database". MicroRNA Protocols. Methods in Molecular Biology. Vol. 342. pp. 129–38. doi:10.1385/1-59745-123-1:129. ISBN   1-59745-123-1. PMID   16957372.
  5. "Mirbase 22 Release Notes".
  6. Griffiths-Jones, S.; Grocock, RJ; Van Dongen, S; Bateman, A; Enright, AJ (2006). "MiRBase: microRNA sequences, targets and gene nomenclature". Nucleic Acids Research. 34 (Database issue): D140–4. doi:10.1093/nar/gkj112. PMC   1347474 . PMID   16381832.
  7. Griffiths-Jones S (January 2004). "The microRNA Registry". Nucleic Acids Res. 32 (Database issue): D109–11. doi:10.1093/nar/gkh023. PMC   308757 . PMID   14681370.
  8. Nithin, Chandran; Thomas, Amal; Basak, Jolly; Bahadur, Ranjit Prasad (2017-11-15). "Genome-wide identification of miRNAs and lncRNAs in Cajanus cajan". BMC Genomics. 18 (1): 878. doi:10.1186/s12864-017-4232-2. ISSN   1471-2164. PMC   5688659 . PMID   29141604.