From Genome Analysis Wiki
Jump to navigationJump to search
3 bytes removed
, 11:22, 2 February 2017
Line 62: |
Line 62: |
| Estimated per allele error rate is 0.0293 | | Estimated per allele error rate is 0.0293 |
| | | |
− | A better approach is to mask a small proportion of SNPs (vs. genotypes in the above simple approach). One can generate a mask.dat from the original .dat file by simply changing the flag of a subset of markers from M to S2 without duplicating the .ped file. Post-imputation, one can use [http://genome.sph.umich.edu/wiki/CalcMatch CalcMatch ]and [http://csg.sph.umich.edu//ylwtx/doseR2.tgz doseR2.pl ]to estimate genotypic/allelic error rate and correlation respectively. Both programs can be downloaded from [http://csg.sph.umich.edu//ylwtx/software.html http://www.sph.umich.edu/csg/ylwtx/software.html]. | + | A better approach is to mask a small proportion of SNPs (vs. genotypes in the above simple approach). One can generate a mask.dat from the original .dat file by simply changing the flag of a subset of markers from M to S2 without duplicating the .ped file. Post-imputation, one can use [http://genome.sph.umich.edu/wiki/CalcMatch CalcMatch ]and [http://csg.sph.umich.edu//ylwtx/doseR2.tgz doseR2.pl ]to estimate genotypic/allelic error rate and correlation respectively. Both programs can be downloaded from [http://csg.sph.umich.edu//ylwtx/software.html http://csg.sph.umich.edu//ylwtx/software.html]. |
| | | |
| '''Warning''': Imputation involving masked datasets should be performed separately for imputation quality estimation. For production, one should use all available information. | | '''Warning''': Imputation involving masked datasets should be performed separately for imputation quality estimation. For production, one should use all available information. |