From Genome Analysis Wiki
Jump to navigationJump to search
121 bytes added
, 14:04, 6 November 2012
Line 34: |
Line 34: |
| Reference files are required for doing Variant Calling. | | Reference files are required for doing Variant Calling. |
| | | |
− | See [[#Reference Files| Configuration Files: Reference Files]] for information on how to specify the reference files in the configuration.
| + | * Reference Sequence in fasta format. |
| + | ** Configuration File Setting: <code>REF = path/file.fa</code> |
| + | * Indel VCF File Prefix |
| + | ** Configuration File Setting: <code>INDEL_PREFIX = path/indels.sites.hg19</code> |
| + | ** <code>path/</code> contains <code>indels.sites.hg19.chr20.vcf</code> for each chromosome being processed |
| + | * DBSNP File Prefix |
| + | ** Configuration File Setting: <code>DBSNP_PREFIX = path/dbsnp_135_b37.rod</code> |
| + | ** <code>path/</code> contains <code>dbsnp_135_b37.rod.chr20.map</code> for each chromosome being processed |
| + | * HapMap3 polymorphic site prefix |
| + | ** Configuration File Setting: <code>HM3_PREFIX = path/hapmap3.qc.poly</code> |
| + | ** <code>path/</code> contains <code>hapmap3.qc.poly.chr20.bim</code> & <code>hapmap3.qc.poly.chr20.frq</code> for each chromosome being processed |
| + | |
| + | A set of reference files can be downloaded from: [[ftp://share.sph.umich.edu/1000genomes/umake-resources/ | FTP Download of Full Resource Files]] |
| + | |
| + | Configuration File Example Reference Settings: |
| + | REF = path/file.fa |
| + | INDEL_PREFIX = path/indels.sites.hg19 |
| + | DBSNP_PREFIX = path/dbsnp_135_b37.rod |
| + | HM3_PREFIX = path/hapmap3.qc.poly |
| + | |
| | | |
| | | |
Line 79: |
Line 98: |
| | | |
| | | |
− | ==== Reference Files ==== | + | ==== Configure Reference Files ==== |
− | * Reference Sequence in fasta format.
| + | See [[#Reference Files| Reference Files]] for information on how to specify the reference files. |
− | ** REF = path/file.fa
| |
− | * Indel VCF File Prefix
| |
− | ** INDEL_PREFIX = path/indels.sites.hg19
| |
− | ** path/ contains indels.sites.hg19.chr20.vcf for each chromosome being processed
| |
− | * DBSNP File Prefix
| |
− | ** DBSNP_PREFIX = path/dbsnp_135_b37.rod
| |
− | ** path/ contains dbsnp_135_b37.rod.chr20.map for each chromosome being processed
| |
− | * HapMap3 polymorphic site prefix
| |
− | ** HM3_PREFIX = path/hapmap3.qc.poly
| |
− | ** path/ contains hapmap3.qc.poly.chr20.bim & hapmap3.qc.poly.chr20.frq for each chromosome being processed
| |
− | | |
− | Can be downloaded from: [[ftp://share.sph.umich.edu/1000genomes/umake-resources/ | FTP Download of Full Resource Files]]
| |
− | | |
− | INDEL_PREFIX = $(UMAKE_ROOT)/ref/indels/1kg.pilot_release.merged.indels.sites.hg19 # 1000 Genomes Pilot 1 indel VCF prefix
| |
− | DBSNP_PREFIX = $(UMAKE_ROOT)/ref/dbSNP/dbsnp_135_b37.rod # dbSNP file prefix
| |
− | HM3_PREFIX = $(UMAKE_ROOT)/ref/HapMap3/hapmap3_r3_b37_fwd.consensus.qc.poly # HapMap3 polymorphic site prefix
| |
| | | |
| ==== Chromosome X Calling ==== | | ==== Chromosome X Calling ==== |