Difference between revisions of "Vmatch"

From Genome Analysis Wiki
Jump to navigationJump to search
(Created page with 'Category:Software '''vmatch''' is a variant matching program for MNPs, INDELs and precise SVs in VCF files. == Basic Usage Example == vmatch <vcf-file-1> <vcf-file-2>…')
 
Line 30: Line 30:
 
     possible variant pair.  The percentage of matches is given at 3 levels for each
 
     possible variant pair.  The percentage of matches is given at 3 levels for each
 
     variant total of both VCF files.
 
     variant total of both VCF files.
 +
 
     The 3 match levels (in order of decreasing strictness) are given as:
 
     The 3 match levels (in order of decreasing strictness) are given as:
 
       Level 1) SRSA    - Same Position, same REF and ALT
 
       Level 1) SRSA    - Same Position, same REF and ALT
Line 37: Line 38:
 
       Level 3) DRDA    - Same Position, different REF and different ALT
 
       Level 3) DRDA    - Same Position, different REF and different ALT
 
       Level 3) DRDNA  - Same Position, different REF and different number of ALT
 
       Level 3) DRDNA  - Same Position, different REF and different number of ALT
 +
 
 
       Level 1 represents matches in position and alleles
 
       Level 1 represents matches in position and alleles
 
       Level 2 represents matches in position and reference alleles but different alternate alleles
 
       Level 2 represents matches in position and reference alleles but different alternate alleles

Revision as of 11:05, 16 January 2012

vmatch is a variant matching program for MNPs, INDELs and precise SVs in VCF files.

Basic Usage Example

 vmatch <vcf-file-1> <vcf-file-2> -g <genome-file> -w <int> -d

Here is an example of how vmatch works:

  vmatch 1000g.vcf got2d.vcf -g hg18.fa  -w 10 -d

Command Line Options

   vcf-file       VCF file
   genome-file    Memory Mapped Sequence file
   w              window size
   d              debug

Description

   Outputs 2 files
     match.txt : gives the matched pairs
                 1)id1
                 2)id2
                 3)match type
                 4)extended no of bases
                 5)normalized
     match.log : Details of the extension and normalization process for all compared pairs
   vmatch matches the variants in 2 VCF files by choosing the best match for every
   possible variant pair.  The percentage of matches is given at 3 levels for each
   variant total of both VCF files.
   The 3 match levels (in order of decreasing strictness) are given as:
      Level 1) SRSA    - Same Position, same REF and ALT
      Level 1) SRSAN   - Same Position, same REF and ALT after normalization
      Level 2) SRDA    - Same Position, same REF and different ALT
      Level 2) SRDNA   - Same Position, same REF and different number of ALT
      Level 3) DRDA    - Same Position, different REF and different ALT
      Level 3) DRDNA   - Same Position, different REF and different number of ALT
 
      Level 1 represents matches in position and alleles
      Level 2 represents matches in position and reference alleles but different alternate alleles
      Level 3 represents matches only in position

Download

For the current lfSingle, please go to our GLF Tools Website.

TODO

Support for X chromosome variant calling.

Support for a two pass depth filter that uses the data to automatically work out appropriate filtering thresholds.