Changes

From Genome Analysis Wiki
Jump to navigationJump to search
36 bytes removed ,  15:49, 20 May 2010
no edit summary
Line 106: Line 106:  
A: http://www.sph.umich.edu/csg/yli/mach/download/HapMap-r21.html <br><br>  
 
A: http://www.sph.umich.edu/csg/yli/mach/download/HapMap-r21.html <br><br>  
   −
'''Q: Where can I find HapMap III reference files? '''  
+
=== '''Where can I find HapMap III / 1000 Genomes reference files? ''' ===
 +
A: http://www.sph.umich.edu/csg/yli/mach/download/ <br>
   −
'''Q: Where can I find 1000 Genomes reference files?'''<br> A: http://www.sph.umich.edu/csg/yli/mach/download/ <br>
+
=== '''Does --mle overwrite fed-in genotypes?''' ===
 
+
A: Yes. But rarely. --mle outputs the most likely genotype guesses by integrating over the probabilities of all possible configurations based on the reference haplotypes. The overwriting happens when the most likely guess differs from the experimental counterpart.<br><br>  
'''Q: Does --mle overwrite fed-in genotypes?'''<br> A: Yes. But rarely. --mle outputs the most likely genotype guesses by integrating over the probabilities of all possible configurations based on the reference haplotypes. The overwriting happens when the most likely guess differs from the experimental counterpart.<br><br>  
  −
 
  −
'''Q: How do I get imputation quality estimates?'''
      +
=== '''How do I get imputation quality estimates?''' ===
 
A: A simple approach is to use --mask option. For example, --mask 0.02 masks 2% of the genotypes at random, impute them and compare with the masked original to estimate genotypic and allelic error rates. Messages like the following will be generated to stdout:  
 
A: A simple approach is to use --mask option. For example, --mask 0.02 masks 2% of the genotypes at random, impute them and compare with the masked original to estimate genotypic and allelic error rates. Messages like the following will be generated to stdout:  
   Line 126: Line 125:  
<br>  
 
<br>  
   −
'''Q: How do I get reference files for an region of interest? '''<br> A: (1) For HapMapII format, download http://www.sph.umich.edu/csg/ylwtx/HapMapForMach.tgz <br>  
+
===  '''How do I get reference files for an region of interest? ''' ===
 +
A: (1) For HapMapII format, download http://www.sph.umich.edu/csg/ylwtx/HapMapForMach.tgz <br>  
    
&nbsp;&nbsp;&nbsp; (2) For MACH format, you can do the following:  
 
&nbsp;&nbsp;&nbsp; (2) For MACH format, you can do the following:  
Line 141: Line 141:  
   awk '{print $3}' orig.hap | cut -c${first}-${last} &gt; region.hap
 
   awk '{print $3}' orig.hap | cut -c${first}-${last} &gt; region.hap
   −
<br>'''Q: Do I have to sort the pedigree file by physical positions? '''<br> A: If you use external reference, you do not have to as long as the external reference is in correct order. **HOWEVER**, we strongly recommend sorting the pedigree files. <br><br>  
+
=== '''Do I have to sort the pedigree file by physical positions? ''' ===
 +
A: If you use external reference, you do not have to as long as the external reference is in correct order. **HOWEVER**, we strongly recommend sorting the pedigree files. <br><br>  
   −
<br>'''Q: What if I specified --states R where R exceeds the maximum possible (2*number diploid individuals - 2 + number_haplotypes)? '''<br> A: mach automatically resets it to maximum possible value.  
+
=== '''What if I specified --states R where R exceeds the maximum possible (2*number diploid individuals - 2 + number_haplotypes)? ''' ===
 +
A: mach automatically resets it to maximum possible value.  
    
<br>  
 
<br>  
   −
'''Q: Can I used unphased reference?'''<br>
+
=== '''Can I used unphased reference?''' ===
    
A: Yes. You simply need a combined pedigree (.ped) and marker information file (.dat). <br>  
 
A: Yes. You simply need a combined pedigree (.ped) and marker information file (.dat). <br>  
212

edits

Navigation menu