|
|
(8 intermediate revisions by the same user not shown) |
Line 1: |
Line 1: |
− | Many data sets consist of individuals from different populations, in the cases of structured populations,
| + | #REDIRECT [[Genotype_Likelihood_based_Inbreeding_Coefficient]] |
− | this usually result in an increased number of homozygotes.
| |
− | | |
− | The inbreeding coefficient FIC is a measure of deviation away from the Hardy Weinberg Equilibrium.
| |
− | A value of 0 implies no deviation, a negative value implies an excess of heterozygotes and a positive value implies an excess of homozygotes.
| |
− | | |
− | The following equation gives the estimate of F where the observed genotypes are available.
| |
− | | |
− | <math>
| |
− | \begin{align}
| |
− | F_{IC} & = 1 - \frac{O(Het)}{E(Het)} \\
| |
− | & = 1 - \frac{\text{No. observed HETs}}{E(Het|\textbf{p)}} \\
| |
− | & = 1 - \frac{\text{No. observed HETs}}{\sum_{i=1}^{n}{\sum_j{P(Het_j|\textbf{p})}}} \\
| |
− | | |
− | \end{align}
| |
− | </math>
| |
− |
| |
− | The following equation gives the estimate of F where genotype likelihoods are available.
| |
− | | |
− | <math>
| |
− | \begin{align}
| |
− | F_{IC} & = 1 - \frac{O(Het)}{E(Het)} \\
| |
− | & = 1 - \frac{E(Het|R_i, \textbf{p})}{E(Het|\textbf{p})} \\
| |
− | & = 1 - \frac{\sum_{i=1}^{n}{\sum_j{P(Het_j|R_i , \textbf{p})}}} {\sum_{i=1}^{n}{\sum_{j}{P(Het_j|\textbf{p})}}} \\
| |
− | & = 1 - \frac{\sum_{i=1}^{n}{\sum_j{\frac{P(R_i|Het_j,\textbf{p})P(Het_j|\textbf{p})}{\sum_{(k,l)}{P(R_i|G_{(k,l)},\textbf{p})P(G_{(k,l)}|\textbf{p})}}}}}
| |
− | {\sum_{i=1}^{n}{\sum_j{P(Het_j|\textbf{p})}}} \\
| |
− | \end{align}
| |
− | </math>
| |
− | | |
− | where:
| |
− | | |
− | <math>
| |
− | \begin{align}
| |
− | | |
− | P(G_{(k,l)}|\textbf{g}) & = & g_{(k,l)}
| |
− | | |
− | \end{align}
| |
− | </math>
| |
− | | |
− | <math>
| |
− | P(G_{(k,l)}|\textbf{p}) =
| |
− | \begin{cases}
| |
− | p_k^2, & \text{if }k=l \\
| |
− | 2p_kp_l, & \text{if }k \ne l
| |
− | \end{cases}
| |
− | </math>
| |