Changes

From Genome Analysis Wiki
Jump to navigationJump to search
Line 104: Line 104:     
= Using Your own Reference Files =
 
= Using Your own Reference Files =
  −
== Human Reference ==
  −
  −
=== Generating BWA Reference Files ===
  −
Use "bwa index" to generate the human reference files with the required extensions:
  −
* .amb
  −
* .ann
  −
* .bwt
  −
* .pac
  −
* .sa
  −
  −
See http://bio-bwa.sourceforge.net/bwa.shtml for more information about using "bwa index".
  −
  −
=== Generating Reference Index Files ===
  −
Use "samtools faidx" to generate the human reference files with the required extensions:
  −
* .fai
  −
  −
=== Generating GC Content File ===
  −
The GC Content file is used by QPLOT.  It is assumed to be at the same location as the reference file.
  −
  −
If the reference file is at path/ref.fa, the GC Content file is expected to be:path/ref.winsize100.gc
  −
  −
  −
To generate the GC content file, run qplot:
  −
GOTCLOUD_DIR/bin/qplot --reference reference.fa --winsize windowSize
  −
* Replace reference.fa with the name of your human reference fasta file.
  −
* Replace windowSize with your desired window size, or leave out --winsize to use the default (100).
  −
  −
NOTE: You will get an error at the end of qplot that says:
  −
<pre>
  −
FATAL ERROR -
  −
No SAM/BAM files provided, stopped!
  −
</pre>
  −
This error is due to using qplot to just generate a GC Content file and not also process a BAM file.
  −
  −
But it was successful as long as you see (where reference is the name of your reference file):
  −
<pre>
  −
GC content file [ reference.winsize100.gc ] created.
  −
</pre>
  −
  −
  −
See [[QPLOT#Input_files|QPLOT: InputFiles]] for more information.
 

Navigation menu