Barcode of Life Data System

Last updated

The Barcode of Life Data System (commonly known as BOLD or BOLDSystems) is a web platform specifically devoted to DNA barcoding. [1] [2] It is a cloud-based data storage and analysis platform developed at the Centre for Biodiversity Genomics in Canada. It consists of four main modules, a data portal, an educational portal, a registry of BINs (putative species), and a data collection and analysis workbench which provides an online platform for analyzing DNA sequences. [2] Since its launch in 2005, BOLD has been extended to provide a range of functionality including data organization, validation, visualization and publication. The most recent version of the system, version 4, launched in 2017, brings a set of improvements supporting data collection and analysis but also includes novel functionality improving data dissemination, citation, and annotation. [3] Before November 16, 2020, BOLD already contained barcode sequences for 318,105 formally described species covering animals, plants, fungi, protists (with ~8.9 million specimens). [4]

BOLD is freely available to any researcher with interests in DNA Barcoding. By providing specialized services, it aids in the publication of records that meet the standards needed to gain BARCODE designation in the international nucleotide sequence databases. Because of its web-based delivery and flexible data security model, it is also well positioned to support projects that involve broad research alliances. [3]

Data release of BOLD mainly originated from a project BARCODE 500K executed by the International Barcode of Life (iBOL) Consortium from 2010 to 2015. It aimed for data acquisition of DNA barcode records for 5M specimens representing 500K species. All the specimens collection, sequences assignment, information sorting are contributed by great amount of scientists, collaborators and facilities from nations over the world. Data accumulation increases the accuracy of DNA barcode identification and facilitates the attainment of barcoding of life.

Related Research Articles

Bioinformatics Computational analysis of large, complex sets of biological data

Bioinformatics is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combines biology, chemistry, physics, computer science, information engineering, mathematics and statistics to analyze and interpret the biological data. Bioinformatics has been used for in silico analyses of biological queries using mathematical and statistical techniques.

Biological database

Biological databases are libraries of biological sciences, collected from scientific experiments, published literature, high-throughput experiment technology, and computational analysis. They contain information from research areas including genomics, proteomics, metabolomics, microarray gene expression, and phylogenetics. Information contained in biological databases includes gene function, structure, localization, clinical effects of mutations as well as similarities of biological sequences and structures.

Comparative genomics

Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural landmarks. In this branch of genomics, whole or large parts of genomes resulting from genome projects are compared to study basic biological similarities and differences as well as evolutionary relationships between organisms. The major principle of comparative genomics is that common features of two organisms will often be encoded within the DNA that is evolutionarily conserved between them. Therefore, comparative genomic approaches start with making some form of alignment of genome sequences and looking for orthologous sequences in the aligned genomes and checking to what extent those sequences are conserved. Based on these, genome and molecular evolution are inferred and this may in turn be put in the context of, for example, phenotypic evolution or population genetics.

Metagenomics Study of genes found in the environment

Metagenomics is the study of genetic material recovered directly from environmental samples. The broad field may also be referred to as environmental genomics, ecogenomics or community genomics.

The Consortium for the Barcode of Life (CBOL) was an international initiative dedicated to supporting the development of DNA barcoding as a global standard for species identification. CBOL's Secretariat Office is hosted by the National Museum of Natural History, Smithsonian Institution, in Washington, DC. Barcoding was proposed in 2003 by Prof. Paul Hebert of the University of Guelph in Ontario as a way of distinguishing and identifying species with a short standardized gene sequence. Hebert proposed the 658 bases of the Folmer region of the mitochondrial gene cytochrome-C oxidase-1 as the standard barcode region. Dr. Hebert is the Director of the Biodiversity Institute of Ontario, the Canadian Centre for DNA Barcoding, and the International Barcode of Life Project (iBOL), all headquartered at the University of Guelph. The Barcode of Life Data Systems (BOLD) is also located at the University of Guelph.

History of genetics

The history of genetics dates from the classical era with contributions by Pythagoras, Hippocrates, Aristotle, Epicurus, and others. Modern genetics began with the work of the Augustinian friar Gregor Johann Mendel. His work on pea plants, published in 1866, established the theory of Mendelian inheritance.

Sisyracera is a genus of snout moths in the subfamily Spilomelinae of the family Crambidae. It was described in 1890 by Heinrich Benno Möschler with Leucinodes preciosalis as type species, now considered a synonym of Sisyracera subulalis. The genus has been placed in the tribe Udeini.

DNA barcoding Method of species identification using a short section of DNA

DNA barcoding is a method of species identification using a short section of DNA from a specific gene or genes. The premise of DNA barcoding is that, by comparison with a reference library of such DNA sections, an individual sequence can be used to uniquely identify an organism to species, in the same way that a supermarket scanner uses the familiar black stripes of the UPC barcode to identify an item in its stock against its reference database. These "barcodes" are sometimes used in an effort to identify unknown species, parts of an organism, or simply to catalog as many taxa as possible, or to compare with traditional taxonomy in an effort to determine species boundaries.

In metagenomics, binning is the process of grouping reads or contigs and assigning them to individual genome. Binning methods can be based on either compositional features or alignment (similarity), or both.

<i>Nosferatu</i> (fish) Genus of fishes

Nosferatu is a genus of cichlid fishes endemic to the Rio Panuco Basin and the tributaries of the adjacent Tamiahua Lagoon and San Andrés Lagoon in the states of Veracruz, Hidalgo, San Luis Potosí, Tamaulipas and Querétaro, Mexico. The genus is characterized by a prolongation in the size of the symphysial pair of teeth relative to that of the other teeth in the outer row of the upper jaw ; breeding pigmentation that consists of darkening of ventral area extending over nostrils, opercular series, and pectoral fins; depressed dorsal fin rarely expands beyond anterior third of caudal fin; and an elongated, elastic, smooth caecum adhered to a saccular stomach.

Pollen DNA barcoding Process of identifying pollen donor plant species

Pollen DNA barcoding is the process of identifying pollen donor plant species through the amplification and sequencing of specific, conserved regions of plant DNA. Being able to accurately identify pollen has a wide range of applications though it has been difficult in the past due to the limitations of microscopic identification of pollen.

Aquatic macroinvertebrate DNA barcoding

DNA barcoding is an alternative method to the traditional morphological taxonomic classification, and has frequently been used to identify species of aquatic macroinvertebrates. Many are crucial indicator organisms in the bioassessment of freshwater and marine ecosystems.

Microbial DNA barcoding is the use of DNA metabarcoding to characterize a mixture of microorganisms. DNA metabarcoding is a method of DNA barcoding that uses universal genetic markers to identify DNA of a mixture of organisms.

Fish DNA barcoding

DNA barcoding methods for fish are used to identify groups of fish based on DNA sequences within selected regions of a genome. These methods can be used to study fish, as genetic material, in the form of environmental DNA (eDNA) or cells, is freely diffused in the water. This allows researchers to identify which species are present in a body of water by collecting a water sample, extracting DNA from the sample and isolating DNA sequences that are specific for the species of interest. Barcoding methods can also be used for biomonitoring and food safety validation, animal diet assessment, assessment of food webs and species distribution, and for detection of invasive species.

DNA barcoding in diet assessment

DNA barcoding in diet assessment is the use of DNA barcoding to analyse the diet of organisms. and further detect and describe their trophic interactions. This approach is based on the identification of consumed species by characterization of DNA present in dietary samples, e.g. individual food remains, regurgitates, gut and fecal samples, homogenized body of the host organism, target of the diet study.

Winifred Hallwachs U.S. entomologist and tropical ecologist

Winifred Hallwachs is an American tropical ecologist who helped to establish and expand northwestern Costa Rica's Área de Conservación Guanacaste (ACG). The work of Hallwachs and her husband Daniel Janzen at ACG is considered an exemplar of inclusive conservation.

Fungal DNA barcoding Identification of fungal species thanks to specific DNA sequences

Fungal DNA barcoding is the process of identifying species of the biological kingdom Fungi through the amplification and sequencing of specific DNA sequences and their comparison with sequences deposited in a DNA barcode database such as the ISHAM reference database, or the Barcode of Life Data System (BOLD). In this attempt, DNA barcoding relies on universal genes that are ideally present in all fungi with the same degree of sequence variation. The interspecific variation, i.e., the variation between species, in the chosen DNA barcode gene should exceed the intraspecific (within-species) variation.

Cheverella is a monotypic genus of snout moths in the subfamily Spilomelinae of the family Crambidae. It contains only one species, Cheverella galapagensis, which is endemic to the Galápagos Islands of Ecuador. Both the genus and the species were first described by Bernard Landry in 2011. The genus is placed in the tribe Udeini.


Metabarcoding is the barcoding of DNA/RNA in a manner that allows for the simultaneous identification of many taxa within the same sample. The main difference between barcoding and metabarcoding is that metabarcoding does not focus on one specific organism, but instead aims to determine species composition within a sample.


  1. Ratnasingham, Sujeevan; Hebert, Paul D. N. (2013). "A DNA-Based Registry for All Animal Species: The Barcode Index Number (BIN) System". PLOS ONE. 8 (7): e66213. Bibcode:2013PLoSO...866213R. doi: 10.1371/journal.pone.0066213 . ISSN   1932-6203. PMC   3704603 . PMID   23861743.
  2. 1 2 RATNASINGHAM, SUJEEVAN; HEBERT, PAUL D. N. (2007-01-24). "BARCODING: bold: The Barcode of Life Data System (". Molecular Ecology Notes. 7 (3): 355–364. doi: 10.1111/j.1471-8286.2007.01678.x . ISSN   1471-8278. PMC   1890991 . PMID   18784790.
  3. 1 2 "BOLD Print Handbook for BOLD v4". Retrieved 2020-11-16.
  4. "Kingdoms of Life Being Barcoded | BOLDSYSTEMS". Retrieved 2020-11-16.