Changes

From Genome Analysis Wiki
Jump to navigationJump to search
65 bytes added ,  10:19, 3 April 2012
Line 27: Line 27:  
=== SAM/BAM ===
 
=== SAM/BAM ===
   −
==== General Tools ====
   
*[[QPLOT]] - Calculate & plot summary statistics
 
*[[QPLOT]] - Calculate & plot summary statistics
*[[BamUtil: validate|Validate]] – Check file format & print statistics
   
*[[VerifyBamID]] – Check sample identities for contamination/sample swap
 
*[[VerifyBamID]] – Check sample identities for contamination/sample swap
 
**Genotype concordance based detection
 
**Genotype concordance based detection
 
**Estimate based on population allele frequencies without genotype data
 
**Estimate based on population allele frequencies without genotype data
*[[BamUtil: diff|Diff]] - Print the diffs between 2 bams
  −
*[[BamUtil: stats|Stats]] - Generate some statistics for a SAM/BAM file
        −
==== Rewrite SAM/BAM file ====
  −
*[[BamUtil: convert|Convert]] – Convert between SAM & BAM
  −
*[[BamUtil: splitBam|SplitBam]] – Split into 1 file per Read Group
  −
*[[BamUtil: splitChromosome|SplitChromosome]] – Split into 1 file per Chromosome
  −
*[[BamUtil: writeRegion|WriteRegion]] – Write only reads in the specified region and/or have the specified read name
   
*[[Pileup]] – Pileup every base or just bases in specified region and write VCF
 
*[[Pileup]] – Pileup every base or just bases in specified region and write VCF
*[[BamUtil: convert#BAM File Recovery | BAM Recovery]] - Recover corrupted BAM files
  −
*[[BamUtil: asp | ASP]] - perform an asynchronous pileup producing an ASP file.  <span style="color:#D2691E">ASP is a new format that is currently in production, so this tool is not yet available for public release.</span>
     −
==== Update the File ====
   
*[[SuperDeDuper]] - Determine duplicate alignments, either marking or removing the lower quality duplicates. In addition, it may modify paired-end reads where the ends overlap by soft clipping the end with the lower quality bases in the region of overlap.
 
*[[SuperDeDuper]] - Determine duplicate alignments, either marking or removing the lower quality duplicates. In addition, it may modify paired-end reads where the ends overlap by soft clipping the end with the lower quality bases in the region of overlap.
 
*[[RGMergeBam]] – Merge sorted BAM files adding Read Groups
 
*[[RGMergeBam]] – Merge sorted BAM files adding Read Groups
 
*[[PolishBam]] – Add/Update header lines & add RG tag to each record
 
*[[PolishBam]] – Add/Update header lines & add RG tag to each record
 
*[[TrimBam]] – Trim end of reads, changing read ends to ‘N’ & quality to ‘!’
 
*[[TrimBam]] – Trim end of reads, changing read ends to ‘N’ & quality to ‘!’
*[[BamUtil: filter|Filter]] – Soft clip ends with too high mismatch % and mark unmapped if quality of mismatches is too high
  −
*[[BamUtil: revert|Revert]] - Revert SAM/BAM replacing the specified fields with their previous values (if known) and removes specified tags
  −
*[[BamUtil: squeeze|Squeeze]] - Reduce files size by dropping OQ fields, duplicates, specified tags, using '=' when a base matches the reference, binning quality scores, and replacing readNames with unique integers
  −
*[[BamUtil: clipOverlap|ClipOverlap]] - Clip overlapping read pairs so they do not overlap
     −
 
+
The following tools are part of the [[BamUtil|BamUtil program]].
==== Helper Tools to Print Readable Information ====
+
==== BAM Util Tools ====
*[[BamUtil: dumpHeader|DumpHeader]] - Print the File Header to the screen.
+
'''QC/Stats'''
*[[BamUtil: dumpRefInfo|DumpRefInfo]] - Print the reference information from the SAM/BAM header.
+
*[[BamUtil: validate|validate]] – Check file format & print statistics
*[[BamUtil: dumpIndex|DumpIndex]] - Print the BAM Index to the screen in a readable format
+
*[[BamUtil: diff|diff]] - Print the diffs between 2 bams
*[[BamUtil: readReference|ReadReference]] - Print the reference string for the specified region to the screen.
+
*[[BamUtil: stats|stats]] - Generate some statistics for a SAM/BAM file
*[[BamUtil: dumpAsp|DumpAsp]] - perform an asynchronous pileup producing an ASP file.  <span style="color:#D2691E">ASP is a new format that is currently in production, so this tool is not yet available for public release.</span>
+
'''Rewrite SAM/BAM file'''
 +
*[[BamUtil: convert|convert]] – Convert between SAM & BAM
 +
*[[BamUtil: splitBam|splitBam]] – Split into 1 file per Read Group
 +
*[[BamUtil: splitChromosome|splitChromosome]] – Split into 1 file per Chromosome
 +
*[[BamUtil: writeRegion|writeRegion]] – Write only reads in the specified region and/or have the specified read name
 +
*[[BamUtil: convert#BAM File Recovery | BAM Recovery]] - Recover corrupted BAM files
 +
*[[BamUtil: asp | asp]] - perform an asynchronous pileup producing an ASP file.  <span style="color:#D2691E">ASP is a new format that is currently in production, so this tool is not yet available for public release.</span>
 +
'''File Updates'''
 +
*[[BamUtil: filter|filter]] – Soft clip ends with too high mismatch % and mark unmapped if quality of mismatches is too high
 +
*[[BamUtil: revert|revert]] - Revert SAM/BAM replacing the specified fields with their previous values (if known) and removes specified tags
 +
*[[BamUtil: squeeze|squeeze]] - Reduce files size by dropping OQ fields, duplicates, specified tags, using '=' when a base matches the reference, binning quality scores, and replacing readNames with unique integers
 +
*[[BamUtil: clipOverlap|clipOverlap]] - Clip overlapping read pairs so they do not overlap
 +
'''Helper Tools to Print Readable Information'''
 +
*[[BamUtil: dumpHeader|dumpHeader]] - Print the File Header to the screen.
 +
*[[BamUtil: dumpRefInfo|dumpRefInfo]] - Print the reference information from the SAM/BAM header.
 +
*[[BamUtil: dumpIndex|dumpIndex]] - Print the BAM Index to the screen in a readable format
 +
*[[BamUtil: readReference|readReference]] - Print the reference string for the specified region to the screen.
 +
*[[BamUtil: dumpAsp|dumpAsp]] - perform an asynchronous pileup producing an ASP file.  <span style="color:#D2691E">ASP is a new format that is currently in production, so this tool is not yet available for public release.</span>
     

Navigation menu