Difference between revisions of "BAM"

From Genome Analysis Wiki
Jump to navigationJump to search
(Created page with 'The '''BAM Format''' is a binary format for storing aligned reads. The current definition of the format is at http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification…')
 
 
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
The '''BAM Format''' is a binary format for storing aligned reads.  
+
The '''BAM Format''' is a binary format for storing sequence data.  
  
 
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
 
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
 +
 +
The corresponding [[SAM Format]] can be used to store sequence data, both aligned as well as unaligned, in a human readable format.
 +
 +
BAM and SAM formats are designed to contain the same information.  The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on.  The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.
 +
 +
Our [[C++ Library: libStatGen|libStatGen]] library reads both SAM and BAM format files.

Latest revision as of 23:16, 26 February 2013

The BAM Format is a binary format for storing sequence data.

The current definition of the format is at [BAM/SAM Specification].

The corresponding SAM Format can be used to store sequence data, both aligned as well as unaligned, in a human readable format.

BAM and SAM formats are designed to contain the same information. The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on. The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.

Our libStatGen library reads both SAM and BAM format files.