Changes

From Genome Analysis Wiki
Jump to navigationJump to search
574 bytes removed ,  22:04, 7 May 2015
Line 682: Line 682:  
</div>
 
</div>
   −
=== Compute Features ===
+
=== Estimate ===
   −
Compute features in a VCF file.  Example of statistics are Allele counts, [[Genotype_Likelihood_based_Inbreeding_Coefficient|Genotype Likelihood based Inbreeding Coefficient]].
+
  Compute variant based estimates.  Example of statistics are Allele counts, [[Genotype_Likelihood_based_Inbreeding_Coefficient|Genotype Likelihood based Inbreeding Coefficient]].
[[Genotype_Likelihood_based_Allele_Frequency|Hardy-Weinberg Genotype Likelihood based Allele Frequencies]]
+
  [[Genotype_Likelihood_based_Allele_Frequency|Hardy-Weinberg Genotype Likelihood based Allele Frequencies]]
    
<div class=" mw-collapsible mw-collapsed">
 
<div class=" mw-collapsible mw-collapsed">
 
   #compute features for the variants found in vt.vcf
 
   #compute features for the variants found in vt.vcf
   #requires GT, PL and DP
+
   #requires GT and PL
   vt compute_features vt.vcf
+
   vt estimate -e AF,MLEAF vt.vcf
   −
  #annotates variants with the following fields
+
  AF        Genotype (GT) based allele frequencies
  ##INFO=<ID=AC,Number=A,Type=Integer,Description="Alternate Allele Counts">
+
              If genotypes are unavailable, best guess
  ##INFO=<ID=AN,Number=1,Type=Integer,Description="Total Number Allele Counts">
+
              genotypes are inferred based on genotype
  ##INFO=<ID=NS,Number=1,Type=Integer,Description="Number of Samples With Data">
+
              likelihoods (GL or PL)
  ##INFO=<ID=AF,Number=A,Type=Float,Description="Alternate Allele Frequency">
+
              AC       : Alternate Allele counts
  ##INFO=<ID=GC,Number=G,Type=Integer,Description="Genotype Counts">
+
              AN       : Total allele counts
  ##INFO=<ID=GN,Number=1,Type=Integer,Description="Total Number of Genotypes Counts">
+
              NS       : No. of samples.
  ##INFO=<ID=GF,Number=G,Type=Float,Description="Genotype Frequency">
+
              AF       : Alternate allele frequencies.
  ##INFO=<ID=HWEAF,Number=A,Type=Float,Description="Genotype likelihood based MLE Allele Frequency assuming HWE">
+
  MLEAF      GL based allele frequencies estimates
  ##INFO=<ID=HWEGF,Number=G,Type=Float,Description="Genotype likelihood based MLE Genotype Frequency assuming HWE">
+
              MLEAF    : Alternate allele frequency derived from MLEGF
  ##INFO=<ID=MLEAF,Number=A,Type=Float,Description="Genotype likelihood based MLE Allele Frequency">
+
              MLEGF    : Genotype frequencies.
  ##INFO=<ID=MLEGF,Number=G,Type=Float,Description="Genotype likelihood based MLE Genotype Frequency">
+
  HWEAF     GL based allele frequencies estimates assuming HWE
   ##INFO=<ID=HWE_LLR,Number=1,Type=Float,Description="Genotype likelihood based Hardy Weinberg ln(Likelihood Ratio)">
+
              HWEAF    : Alternate allele frequencies
  ##INFO=<ID=HWE_LPVAL,Number=1,Type=Float,Description="Genotype likelihood based Hardy Weinberg Likelihood Ratio Test Statistic ln(p-value)">
+
              HWEGF     : Genotype frequencies derived from HWEAF.
  ##INFO=<ID=HWE_DF,Number=1,Type=Integer,Description="Degrees of freedom for Genotype likelihood based Hardy Weinberg Likelihood Ratio Test Statistic">
+
  HWE       GL based Hardy-Weinberg statistics.
  ##INFO=<ID=FIC,Number=1,Type=Float,Description="Genotype likelihood based Inbreeding Coefficient">
+
              HWE_LLR   : log likelihood ratio
  ##INFO=<ID=AB,Number=1,Type=Float,Description="Genotype likelihood based Allele Balance">
+
              HWE_LPVAL : log p-value
 +
              HWE_DF   : degrees of freedom
 +
  AB        GL based Allele Balance.
 +
  FIC       GL based Inbreeding Coefficient
    
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
   usage : vt compute_features for variants [options] <in.vcf>
+
   usage : vt estimate [options] <in.vcf>
 
   
 
   
 
   options : -s  print site information only without genotypes [false]
 
   options : -s  print site information only without genotypes [false]
 
             -o  output VCF/VCF.GZ/BCF file [-]
 
             -o  output VCF/VCF.GZ/BCF file [-]
 +
            -e  comma separated estimates to be computed []
 
             -f  filter expression []
 
             -f  filter expression []
 
             -I  File containing list of intervals
 
             -I  File containing list of intervals
1,102

edits

Navigation menu