From Genome Analysis Wiki
Jump to navigationJump to search
1,398 bytes removed
, 12:49, 23 October 2014
Line 104: |
Line 104: |
| | | |
| = Using Your own Reference Files = | | = Using Your own Reference Files = |
− |
| |
− | == Human Reference ==
| |
− |
| |
− | === Generating BWA Reference Files ===
| |
− | Use "bwa index" to generate the human reference files with the required extensions:
| |
− | * .amb
| |
− | * .ann
| |
− | * .bwt
| |
− | * .pac
| |
− | * .sa
| |
− |
| |
− | See http://bio-bwa.sourceforge.net/bwa.shtml for more information about using "bwa index".
| |
− |
| |
− | === Generating Reference Index Files ===
| |
− | Use "samtools faidx" to generate the human reference files with the required extensions:
| |
− | * .fai
| |
− |
| |
− | === Generating GC Content File ===
| |
− | The GC Content file is used by QPLOT. It is assumed to be at the same location as the reference file.
| |
− |
| |
− | If the reference file is at path/ref.fa, the GC Content file is expected to be:path/ref.winsize100.gc
| |
− |
| |
− |
| |
− | To generate the GC content file, run qplot:
| |
− | GOTCLOUD_DIR/bin/qplot --reference reference.fa --winsize windowSize
| |
− | * Replace reference.fa with the name of your human reference fasta file.
| |
− | * Replace windowSize with your desired window size, or leave out --winsize to use the default (100).
| |
− |
| |
− | NOTE: You will get an error at the end of qplot that says:
| |
− | <pre>
| |
− | FATAL ERROR -
| |
− | No SAM/BAM files provided, stopped!
| |
− | </pre>
| |
− | This error is due to using qplot to just generate a GC Content file and not also process a BAM file.
| |
− |
| |
− | But it was successful as long as you see (where reference is the name of your reference file):
| |
− | <pre>
| |
− | GC content file [ reference.winsize100.gc ] created.
| |
− | </pre>
| |
− |
| |
− |
| |
− | See [[QPLOT#Input_files|QPLOT: InputFiles]] for more information.
| |