Difference between revisions of "BAM"

From Genome Analysis Wiki
Jump to navigationJump to search
m (moved BAM Format to BAM)
(elaborate on details)
Line 1: Line 1:
The '''BAM Format''' is a binary format for storing aligned reads.  
+
The '''BAM Format''' is a binary format for storing sequence data.  
  
 
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
 
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
  
A sister [[SAM Format]] can be used to store aligned reads in a text file.
+
The corresponding [[SAM Format]] can be used to store sequence data, both aligned as well as unaligned, in a human readable format.
 +
 
 +
BAM and SAM formats are designed to contain exactly the same information.  The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on.  The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.
 +
 
 +
Our [[C++ Library: libbam|BamFile]] library reads both SAM and BAM format files.

Revision as of 13:28, 30 July 2010

The BAM Format is a binary format for storing sequence data.

The current definition of the format is at [BAM/SAM Specification].

The corresponding SAM Format can be used to store sequence data, both aligned as well as unaligned, in a human readable format.

BAM and SAM formats are designed to contain exactly the same information. The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on. The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.

Our BamFile library reads both SAM and BAM format files.