Difference between revisions of "VcfCooker"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 51: Line 51:
  
 
  vcfCooker --in-vcf /home/csidore/1000g_CEUTSI_WG/analysis_chr20/vcf/TSI+CEU+GBR.Q10.chr20.vcf --out 1KG.20100517.EUR.chr20.vcf.gz --bgzf --upgrade \
 
  vcfCooker --in-vcf /home/csidore/1000g_CEUTSI_WG/analysis_chr20/vcf/TSI+CEU+GBR.Q10.chr20.vcf --out 1KG.20100517.EUR.chr20.vcf.gz --bgzf --upgrade \
--filter --maxAB 65 --indelVCF /share/swg/hmkang/data/1000G/pilot_indels_2010_07/1kg.pilot_release.merged.indels.sites.hg19.chr20.vcf --winIndel 10 --minDP 93 --maxDP 1860 --minNS 19 --minQUAL 10 --write-vcf --winFFRQ 10 --maxFFRQ 30
+
  --filter --maxAB 65 --indelVCF /share/swg/hmkang/data/1000G/pilot_indels_2010_07/1kg.pilot_release.merged.indels.sites.hg19.chr20.vcf --winIndel 10 \
 +
  --minDP 93 --maxDP 1860 --minNS 19 --minQUAL 10 --write-vcf --winFFRQ 10 --maxFFRQ 30
  
 
== Acknowledgements ==
 
== Acknowledgements ==
  
 
VerifyBamID is a result from collaborative effort by Hyun Min Kang, Matthew Flickenger, Matthew Snyder, Paul Anderson, Tom Blackwell, Mary Kate Trost, and Goncalo Abecasis. Please email to Hyun Min Kang [[mailto:hmkang@umich.edu| hmkang@umich.edu ]] for any questions.
 
VerifyBamID is a result from collaborative effort by Hyun Min Kang, Matthew Flickenger, Matthew Snyder, Paul Anderson, Tom Blackwell, Mary Kate Trost, and Goncalo Abecasis. Please email to Hyun Min Kang [[mailto:hmkang@umich.edu| hmkang@umich.edu ]] for any questions.

Revision as of 23:54, 11 November 2010


vcfCooker is a software that converts VCF/BED file formats in various forms. vcfCooker is currently under development, and will be publicly released soon. The current documentation contains the minimal information of currently working functions.

Current Binary Location

Current binary version of vcfCooker is available at /share/swg/hmkang/bin/vcfCooker . Once it is thoroughly tested, it will be installed at /usr/cluster/bin/, and this page will reflect the changes.

Basic Usage

The following parameters are available.  Ones with "[]" are in effect:

Available Options
                      Recipes : --write-bed, --write-vcf, --upgrade,
                                --summarize, --filter
            VCF Input options : --in-vcf []
            BED Input options : --in-bfile [], --in-bed [], --in-bim [],
                                --in-fam [],
                                --ref [/data/local/ref/karma.ref/human.g1k.v37.fa]
               Output Options : --out [./vcfCooker]
   Output compression Options : --plain [ON], --bgzf, --gzip
               Filter Options : --winIndel, --indelVCF [], --minQUAL, --minMQ,
                                --maxDP [2147483647], --minDP, --maxAB [100],
                                --winFFRQ, --maxFFRQ, --minNS,
                                --maxSTP [2147483647], --maxTTT [2147483647],
                                --minTTT [-2147483648]


Upgrading glfMultiples outputs (v 3.3)

If you have a output from glfMultiples, you can upgrade the output files using the following command

vcfCooker --in-vcf [input-vcf-file-from-glfMultiples] --upgrade --out [output-vcf-file]

Upgraded VCFs will have the following improvements.

  • The additional tab between FORMAT field and genotype values will be removed, if exists
  • The REF and ALT alleles will be presented as capital letters.
  • The FORMAT field value, GT:GD:GQ will be changed to GT:DP:GQ:PL
  • depth will be changed to DP in the INFO field
  • mapQ will be changed to MQ in the INFO field
  • MAF will be changed to AF in the INFO field, with proper changes if needed.
  • NS will be added as a new INFO field
  • AC will be added as a new INFO field
  • AN will be added as a new INFO field
  • AB will be added as a new INFO field (suggested by Tom Blackwell at Genotype_Likelihood_Based_Allele_Balance)

Filtering a VCF file

A example command line of upgrading / filtering a glfMultiples output is as follows.

vcfCooker --in-vcf /home/csidore/1000g_CEUTSI_WG/analysis_chr20/vcf/TSI+CEU+GBR.Q10.chr20.vcf --out 1KG.20100517.EUR.chr20.vcf.gz --bgzf --upgrade \
 --filter --maxAB 65 --indelVCF /share/swg/hmkang/data/1000G/pilot_indels_2010_07/1kg.pilot_release.merged.indels.sites.hg19.chr20.vcf --winIndel 10 \
 --minDP 93 --maxDP 1860 --minNS 19 --minQUAL 10 --write-vcf --winFFRQ 10 --maxFFRQ 30

Acknowledgements

VerifyBamID is a result from collaborative effort by Hyun Min Kang, Matthew Flickenger, Matthew Snyder, Paul Anderson, Tom Blackwell, Mary Kate Trost, and Goncalo Abecasis. Please email to Hyun Min Kang [hmkang@umich.edu ] for any questions.