Binary Alignment Map

Last updated
BAM file format
Filename extension
.bam
Developed by
Type of format Bioinformatics
Extended from Tab-separated values
Website samtools.github.io/hts-specs/

Binary Alignment Map (BAM) is the comprehensive raw data of genome sequencing; [2] it consists of the lossless, compressed binary representation of the Sequence Alignment Map-files. [3] [4]

Contents

Schema

BAM is the compressed binary representation of SAM (Sequence Alignment Map), a compact and index-able representation of nucleotide sequence alignments. [5] The goal of indexing is to retrieve alignments that overlap a specific location quickly without having to go through all of them. Before indexing, BAM must be sorted by reference ID and then leftmost coordinate. [1] BAM is in compressed BGZF format.

The BAM format; image from: https://samtools.github.io/hts-specs/SAMv1.pdf BAMFormat.png
The BAM format; image from: https://samtools.github.io/hts-specs/SAMv1.pdf

The structure of BAM files include a header section and an alignment section: [6]

BAM format uses 0-based coordinate system, where as SAM uses 1-based coordinate system. BAM can represent values in the range [−2^31 , 2^32). [1]

Tools

To view a list of sequencing and analysis tools that work with SAM/BAM click here.

See also

References

  1. 1 2 3 "Sequence Alignment/Map Format Specification" (PDF). The SAM/BAM Format Specification Working Group. 3 Jun 2021.
  2. "Carl Zimmer's Game of Genomes, Season 1: Episode 3, BAM Reveals All". STAT. 11 July 2016. Retrieved 2016-08-21.
  3. Li, Heng (2009-06-08). "The Sequence Alignment/Map format and SAMtools" (PDF). Bioinformatics. 25 (16): 2078–9. doi:10.1093/bioinformatics/btp352. PMC   2723002 . PMID   19505943.
  4. "Binary Alignment Map". National Cancer Institute Wiki. Retrieved 2016-08-21.
  5. "Genome Browser BAM Track Format". genome.ucsc.edu. Retrieved 2022-05-05.
  6. "BAM File Format". support.illumina.com. Retrieved 2022-05-05.