From Genome Analysis Wiki
Jump to navigationJump to search
93 bytes added
, 15:11, 30 April 2010
Line 98: |
Line 98: |
| For MACH format, you can do the following: | | For MACH format, you can do the following: |
| | | |
− | First, find the first and last SNP in the region you are interested in. Say "rsFIRST" and "rsLAST", defined according to position. | + | First, find the first and last SNP in the region you are interested in. Say "rsFIRST" and "rsLAST", defined according to position. |
| | | |
− | Then: | + | Then: |
| | | |
| @ first = `grep -n rsFIRST orig.snps | cut -f1 -d ':'` | | @ first = `grep -n rsFIRST orig.snps | cut -f1 -d ':'` |
| @ last = `grep -n rsLAST orig.snps | cut -f1 -d ':'` | | @ last = `grep -n rsLAST orig.snps | cut -f1 -d ':'` |
| | | |
− | Finally (assuming the third field contains the actual haplotypes, where alleles are separated by nothing): | + | Finally (assuming the third field contains the actual haplotypes, where alleles are separated by nothing): |
| | | |
| awk '{print $3}' orig.hap | cut -c${first}-${last} > region.hap | | awk '{print $3}' orig.hap | cut -c${first}-${last} > region.hap |