Difference between revisions of "BAM"

From Genome Analysis Wiki
Jump to navigationJump to search
(elaborate on details)
 
(One intermediate revision by the same user not shown)
Line 5: Line 5:
 
The corresponding [[SAM Format]] can be used to store sequence data, both aligned as well as unaligned, in a human readable format.
 
The corresponding [[SAM Format]] can be used to store sequence data, both aligned as well as unaligned, in a human readable format.
  
BAM and SAM formats are designed to contain exactly the same information.  The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on.  The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.
+
BAM and SAM formats are designed to contain the same information.  The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on.  The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.
  
Our [[C++ Library: libbam|BamFile]] library reads both SAM and BAM format files.
+
Our [[C++ Library: libStatGen|libStatGen]] library reads both SAM and BAM format files.

Latest revision as of 23:16, 26 February 2013

The BAM Format is a binary format for storing sequence data.

The current definition of the format is at [BAM/SAM Specification].

The corresponding SAM Format can be used to store sequence data, both aligned as well as unaligned, in a human readable format.

BAM and SAM formats are designed to contain the same information. The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on. The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.

Our libStatGen library reads both SAM and BAM format files.