Changes

From Genome Analysis Wiki
Jump to navigationJump to search
337 bytes added ,  17:56, 11 July 2017
Line 57: Line 57:  
== Convert to MaCH Files ==
 
== Convert to MaCH Files ==
   −
The following command line should convert a input VCF dosage file to a MaCH/minimac dosage file (the format that available in the earlier version of [Minimac minimac]). The generated dosage files can be tested for association using [http://genome.sph.umich.edu/wiki/Mach2dat:_Association_with_MACH_output mach2dat].  
+
The following command line should convert a input VCF dosage file to a MaCH/minimac dosage file (the format that was available in the earlier versions of [[Minimac | minimac]]). The generated dosage files can be tested for association using [http://genome.sph.umich.edu/wiki/Mach2dat:_Association_with_MACH_output mach2dat].  
    
  ./DosageConvertor        --vcfDose      TestDataImputedVCF.dose.vcf.gz
 
  ./DosageConvertor        --vcfDose      TestDataImputedVCF.dose.vcf.gz
                           --info        TestDataImputedVCF.info (NOT mandatory)
+
                           --info        TestDataImputedVCF.info         (NOT mandatory)
 
                           --prefix      OutPrefix
 
                           --prefix      OutPrefix
                           --type        plink
+
                           --type        mach
                           --format      1
+
                           --format      1                               (or 2)
 
  −
This command line would create three files :  <code> OutPrefix.plink.dosage.gz, OutPrefix.fam, OutPrefix.map </code>. The <code>--format</code> parameter can take values 1, 2 and 3. Each of these values correspond to the three different formats available for PLINK dosage files (details on PLINK dosage files are given [http://www.cog-genomics.org/plink/1.9/assoc#dosage here]). Note that the generated <code>OutPrefix.map</code> does NOT contain any phenotype information (which needs to be manually edited before PLINK can run association testing).
      +
When <code>--type mach</code> is used, the <code>--format</code> parameter can only take values 1 and 2. If the value is 1, the code generates <code> OutPrefix.mach.dose.gz, OutPrefix.info</code> where <code>OutPrefix.mach.dose.gz</code> contains the expected alternate allele count (one value per sample per marker). If the value is 2, it generates <code> OutPrefix.mach.gprob.gz, OutPrefix.info</code> where <code>OutPrefix.mach.gprob.gz</code> contains the genotype likelihoods for reference homozygote and heterozygote  (two values per sample per marker). Note that in the input <code>--info</code> is NOT mandatory. However, if this info file is NOT provided, the output <code>OutPrefix.info</code> file will have some empty columns. Thus, if available, the generated info file should be provided along with the VCF file as input.
    
== Converting Chromosome X Files ==
 
== Converting Chromosome X Files ==
487

edits

Navigation menu