Changes

From Genome Analysis Wiki
Jump to navigationJump to search
477 bytes added ,  13:28, 30 July 2010
elaborate on details
Line 1: Line 1: −
The '''BAM Format''' is a binary format for storing aligned reads.  
+
The '''BAM Format''' is a binary format for storing sequence data.  
    
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
 
The current definition of the format is at [[http://samtools.sourceforge.net/SAM1.pdf BAM/SAM Specification]].
   −
A sister [[SAM Format]] can be used to store aligned reads in a text file.
+
The corresponding [[SAM Format]] can be used to store sequence data, both aligned as well as unaligned, in a human readable format.
 +
 
 +
BAM and SAM formats are designed to contain exactly the same information.  The SAM format is more human readable, and easier to process by conventional text based processing programs, such as awk, sed, python, cut and so on.  The BAM format provides binary versions of most of the same data, and is designed to compress reasonably well.
 +
 
 +
Our [[C++ Library: libbam|BamFile]] library reads both SAM and BAM format files.
75

edits

Navigation menu