Difference between revisions of "AF"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 9: Line 9:
 
</math>
 
</math>
  
where $G_{i,j}$ denotes the genotype composed of alleles $i$ and $j$$k$ indexes the individuals from $1$ to $N$.
+
where <math>G_{i,j}</math> denotes the genotype composed of alleles <math>i</math> and <math>j</math><math>k</math> indexes the individuals from <math>1</math> to <math>N</math>.
 
The initial genotype probability is given by:  
 
The initial genotype probability is given by:  
  

Revision as of 14:01, 11 April 2013

Estimation of Genotype Frequencies without assuming HWE

We propose an EM algorithm to estimate the genotype frequencies without assuming HWE. The posterior probability of the genotype given the reads for individual k () for the th iteration is given by:

where denotes the genotype composed of alleles and . indexes the individuals from to . The initial genotype probability is given by:

The E step equates the expectation of the genotype $G_{i,j}$ for individual k as:

The M step estimates the genotype frequency using the individual expected genotype counts:

This is repeated till the appropriate convergence criteria is achieved.

Estimation of Genotype Frequencies assuming HWE

In order to estimate allele frequencies under HWE assumption, the E step estimates the individual expected posterior allele count for each individual.

In the M step, the posterior genotype frequencies are derived from the computed genotype allele frequencies obtained in the E step assuming HWE.

This is repeated till the appropriate convergence criteria is achieved.

Derivation by

Adrian with much help from Hyun.

Maintained by

This page is maintained by Adrian.