Difference between revisions of "Polymutt2"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 11: Line 11:
 
* NOTE: current version can only process one chromosome at a time
 
* NOTE: current version can only process one chromosome at a time
  
* vcf2map: generate a sparse map file
+
* vcf2map: generate a sparse map file (see Download section for files genetic_map_GRCh37_chr1.txt  and 1000G.SNV.clean.MAF0.05.tbl.gz)
 
   vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --out_map chr1.map
 
   vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --out_map chr1.map
 
   vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --max_r2 0.2 --min_avg_dp 2 --out_map chr1.r0.2.map
 
   vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --max_r2 0.2 --min_avg_dp 2 --out_map chr1.r0.2.map

Revision as of 23:13, 1 July 2015

Updates

The latest version of 0.1 is available for Download.

Compilation

  • After downloading the source code, unzip and untar it, and cd polymutt2, and then type Make
  • Two executables will be generated in bin/ directory: polymutt2 and vcf2map
  • vcf2map is to prune LD and generate a map file with high quality SNPs
  • polymutt2 is to generate genotype calls taking a VCF file and a map file as input

Usage

  • NOTE: current version can only process one chromosome at a time
  • vcf2map: generate a sparse map file (see Download section for files genetic_map_GRCh37_chr1.txt and 1000G.SNV.clean.MAF0.05.tbl.gz)
 vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --out_map chr1.map
 vcf2map --vcf input.vcf --ped input.ped --map genetic_map_GRCh37_chr1.txt --include_list 1000G.SNV.clean.MAF0.05.tbl.gz --max_r2 0.2 --min_avg_dp 2 --out_map chr1.r0.2.map
  • polymutt2: taking a VCF and the map file generated by vcf2map
polymutt2 -p input.ped -m chr1.map --in_vcf input.vcf --out_vcf out.vcf
polymutt2 -p input.ped -m chr1.map --in_vcf input.vcf --out_vcf out.vcf --best_path
polymutt2 -p input.ped -m chr1.map --in_vcf input.vcf --out_vcf out.vcf --fam_id fam1
polymutt2 -p input.ped -m chr1.map --in_vcf input.vcf --out_vcf out.vcf --out_range 1:1000000-2000000

Download

  • The latest version of source code v0.1 can be downloaded here.
  • The genetic map files (genetic_map_GRCh37_chr1.txt) used above can be downloaded here.
  • The clean and common variants in the 1000 Genome Project (1000G.SNV.clean.MAF0.05.tbl.gz) used above can be downloaded here.

Contact

For questions please contact the authors (Bingshan Li: bingshan@umich.edu)

Citation

Li B, Wei Q, Zhan X, Zhong X, Chen W, Li C, et al. (2015) Leveraging Identity-by-Descent for Accurate Genotype Inference in Family Sequencing Data. PLoS Genet 11(6): e1005271. doi:10.1371/journal.pgen.1005271