From Genome Analysis Wiki
Jump to navigationJump to search
955 bytes removed
, 20:33, 15 June 2014
Line 108: |
Line 108: |
| =Analyses= | | =Analyses= |
| | | |
− | ==File Preparation==
| |
| | | |
− | The VCF file you work with should preferably be BCF2.1 compatible. Here we provide an example in /net/fantasia/home/atks/indel_analysis_tutorial. \\
| + | vt peek all.genotypes.bcf |
− | | |
− | To convert to BCF format which will work fast with vt:
| |
− | | |
− | vt view mills.vcf -o mills.bcf
| |
− | | |
− | You will encounter an error as the header does not contain contigs. To fix this, you should construct a complete header for mills.vcf. This is done for you in mills.with.alt.hdr.*
| |
− | | |
− | vt view mills.with.alt.hdr.vcf -o mills.genotypes.bcf
| |
− | | |
− | To index:
| |
− | | |
− | vt index mills.genotypes.bcf
| |
− | | |
− | To extract just the site list which is convenient for working with if you are not analysing the genotypes of the individuals
| |
− | | |
− | vt view -s mills.genotypes.bcf -o mills.sites.bcf
| |
− | | |
− | To index:
| |
− | | |
− | vt index mills.sites.bcf
| |
− | | |
− | You may also work with vcf.gz, just name the output as *.vcf.gz. But it will be slower with vt.
| |
− | | |
− | | |
− | === A quick summary ===
| |
− | | |
− | atks@1000g:~/dev/vt/comparisons/seq_workshop$ vt peek run/final/all.genotypes.bcf
| |
− | peek v0.5
| |
| | | |
| options: input VCF file run/final/all.genotypes.bcf | | options: input VCF file run/final/all.genotypes.bcf |