Difference between revisions of "Mapping Quality Scores"

From Genome Analysis Wiki
Jump to: navigation, search
Line 8: Line 8:
  
 
MapQuality = SumBaseQual(best) / (Sigma_i (SumBaseQual(i))
 
MapQuality = SumBaseQual(best) / (Sigma_i (SumBaseQual(i))
 +
 +
 +
For paired end reads, we calculate SUM_BASE_Q as the sum of base quality scores at mismatched bases for both reads.
  
 
== Reference ==
 
== Reference ==
  
 
Li H, Ruan J, Durbin R. (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. ''Genome Research'' '''18''':1851-8.
 
Li H, Ruan J, Durbin R. (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. ''Genome Research'' '''18''':1851-8.

Revision as of 13:35, 18 December 2009

Mapping Quality Scores quantify the probability that a read is misplaced. They were introduced by Heng Li and Richard Durbin in their paper describing MAQ and are usually reported on a Phred scale.

Calculating a Mapping Quality Score

For a particular short sequence read, consider its best alignment in the genome. For this alignment, calculate the sum of base quality scores at mismatched bases and define a quantity SUM_BASE_Q(best). Also, consider all other possible alignments for the read. For the alignment i, define SUM_BASE_Q(i) as the sum of base quality scores at mismatched bases for that alignment.

Then, the mapping quality is defined as:

MapQuality = SumBaseQual(best) / (Sigma_i (SumBaseQual(i))


For paired end reads, we calculate SUM_BASE_Q as the sum of base quality scores at mismatched bases for both reads.

Reference

Li H, Ruan J, Durbin R. (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Research 18:1851-8.