Changes

From Genome Analysis Wiki
Jump to navigationJump to search
35 bytes added ,  09:57, 21 October 2010
no edit summary
Line 1: Line 1:  
haploxt is a C/C++ software developed by [https://www.sph.umich.edu/csg/yli/ Yun Li] and [https://www.sph.umich.edu/csg/abecasis/ Goncalo Abecasis]. It calculates LD (D' and r<sup>2</sup>) from phased haplotypes.  
 
haploxt is a C/C++ software developed by [https://www.sph.umich.edu/csg/yli/ Yun Li] and [https://www.sph.umich.edu/csg/abecasis/ Goncalo Abecasis]. It calculates LD (D' and r<sup>2</sup>) from phased haplotypes.  
   −
= Input Files =
+
= Input Files =
== Required ==
  −
=== Haplotype File ===
  −
The input haplotype file is in the form of one haplotype per line (no delimiter between alleles). The file could contain fields other than the actual haplotypes but must proceed the actual haplotype field.
     −
Sample haplotype file 1:
+
== Required  ==
Indiv1 HAPLO1 142344111132344413444421333122313342113321324233112323134222
  −
Indiv1 HAPLO2 144222444221443323334431431242131234311333144223411321334422
  −
Indiv2 HAPLO1 144224441221443323334331431243131232311321123323432321334412
  −
Indiv2 HAPLO2 342222444221344312343433333122313344113321324223112323334412
     −
Sample haplotype file 2:
+
=== Haplotype File  ===
142344111132344413444421333122313342113321324233112323134222
+
 
144222444221443323334431431242131234311333144223411321334422
+
The input haplotype file is in the form of one haplotype per line (no delimiter between alleles). The file could contain fields other than the actual haplotypes but must proceed the actual haplotype field.<br><br>
144224441221443323334331431243131232311321123323432321334412
+
 
342222444221344312343433333122313344113321324223112323334412
+
Sample haplotype file 1:<br>
 +
Indiv1 HAPLO1 142344111132344413444421333122313342113321324233112323134222<br>
 +
Indiv1 HAPLO2 144222444221443323334431431242131234311333144223411321334422<br>
 +
Indiv2 HAPLO1 144224441221443323334331431243131232311321123323432321334412<br>
 +
Indiv2 HAPLO2 342222444221344312343433333122313344113321324223112323334412<br>
 +
 
 +
Sample haplotype file 2:<br>
 +
142344111132344413444421333122313342113321324233112323134222<br>
 +
144222444221443323334431431242131234311333144223411321334422<br>
 +
144224441221443323334331431243131232311321123323432321334412<br>
 +
342222444221344312343433333122313344113321324223112323334412<br>
    
= Options  =
 
= Options  =
Line 37: Line 40:     
     (1) SNP&nbsp;: SNP name
 
     (1) SNP&nbsp;: SNP name
  (2) gErr&nbsp;: genotypic discordance rate
+
  (2) gErr&nbsp;: genotypic discordance rate
  (3) aErr&nbsp;: allelic discordance rate
+
  (3) aErr&nbsp;: allelic discordance rate
  (4) matchedG&nbsp;: number of genotypes matched
+
  (4) matchedG&nbsp;: number of genotypes matched
  (5) matchedA: number of alleles matched
+
  (5) matchedA: number of alleles matched
  (6) maskedG: total number of genotypes evaluated/masked (&lt;=n of course) (I should change the naming to comparedG or evaluatedG)
+
  (6) maskedG: total number of genotypes evaluated/masked (&lt;=n of course) (I should change the naming to comparedG or evaluatedG)
    
<br>  
 
<br>  
Line 50: Line 53:     
     (7) hetAerr&nbsp;: allelic discordance rate among heterozygotes
 
     (7) hetAerr&nbsp;: allelic discordance rate among heterozygotes
  (8) AL1: allele 1 (an arbitrary allele)
+
  (8) AL1: allele 1 (an arbitrary allele)
  (9) AL2: allele 2
+
  (9) AL2: allele 2
  (10) freq1: frequency of AL1
+
  (10) freq1: frequency of AL1
  (11) MAF
+
  (11) MAF
  (12) #true 1/1: # individuals with experimental genotype AL1/AL1
+
  (12) #true 1/1: # individuals with experimental genotype AL1/AL1
  (13) mm1/2: # of true AL1/AL1 being imputed as AL1/AL2
+
  (13) mm1/2: # of true AL1/AL1 being imputed as AL1/AL2
  (14) mm2/2: # of true AL1/AL1 being imputed as AL2/AL2
+
  (14) mm2/2: # of true AL1/AL1 being imputed as AL2/AL2
  (15) #true 1/2
+
  (15) #true 1/2
  (16) mm1/1
+
  (16) mm1/1
  (17) mm2/2
+
  (17) mm2/2
  (18) #true 2/2
+
  (18) #true 2/2
  (19) mm1/1
+
  (19) mm1/1
  (20) mm1/2
+
  (20) mm1/2
    
<br>  
 
<br>  
Line 73: Line 76:     
     (A) almajor: major allele
 
     (A) almajor: major allele
  (B) alminor: minor allele
+
  (B) alminor: minor allele
  (C) freq1: major allele frequency
+
  (C) freq1: major allele frequency
  (D) accuracy11: allelic concordance rate for homozygotes major allele
+
  (D) accuracy11: allelic concordance rate for homozygotes major allele
  (E) accuracy12: allelic concordance rate for heterozygotes
+
  (E) accuracy12: allelic concordance rate for heterozygotes
  (F) accuracy22: allelic concordance rate for homozygotes minor allele
+
  (F) accuracy22: allelic concordance rate for homozygotes minor allele
    
<br>  
 
<br>  
Line 86: Line 89:     
     (1) famid
 
     (1) famid
  (2) subjID
+
  (2) subjID
  (3) gErr
+
  (3) gErr
  (4) aErr
+
  (4) aErr
  (5) matchedG
+
  (5) matchedG
  (6) matchedA
+
  (6) matchedA
  (7) maskedG
+
  (7) maskedG
    
<br> This --byPerson option is useful if there is potential sample swap or inter-individual difference, e.g., sequencing depth, number of markers genotyped.  
 
<br> This --byPerson option is useful if there is potential sample swap or inter-individual difference, e.g., sequencing depth, number of markers genotyped.  
212

edits

Navigation menu