From Genome Analysis Wiki
Jump to navigationJump to search
No change in size
, 15:29, 19 February 2014
Line 12: |
Line 12: |
| | | |
| Indel representation is not unique, you should normalize them and remove duplicates. | | Indel representation is not unique, you should normalize them and remove duplicates. |
| + | |
| + | Variant normalization is implemented in [[vt#Normalization|vt]] and this page explains the algorithm |
| + | and also provides a simple proof of correctness - [[Variant_Normalization|Variant Normalization]] |
| | | |
| The following table shows the number of variants that had to be normalized and the corresponding | | The following table shows the number of variants that had to be normalized and the corresponding |
Line 20: |
Line 23: |
| Out of 9996 passed variants, it was found that after normalization, only 8904 distinct Indels remain | | Out of 9996 passed variants, it was found that after normalization, only 8904 distinct Indels remain |
| - about a loss of 11% of variant thought distinct. | | - about a loss of 11% of variant thought distinct. |
− |
| |
− | Variant normalization is implemented in [[vt#Normalization|vt]] and this page explains the algorithm
| |
− | and also provides a simple proof of correctness - [[Variant_Normalization|Variant Normalization]]
| |
| | | |
| {| class="wikitable" | | {| class="wikitable" |