Changes

From Genome Analysis Wiki
Jump to navigationJump to search
Line 248: Line 248:     
To discover large deletions from the 62 BAMs we are using for this workshop, you can run the following command
 
To discover large deletions from the 62 BAMs we are using for this workshop, you can run the following command
 +
 +
time perl ${SS}/svtoolkit/bin/genomestrip.pl -run-discovery --metadata ${SS}/svtoolkit/metadata --out ${OUT}/sv --conf ${SS}/gotcloud.conf --region 22:36000000-37000000 --numjobs 2 --gs-dir ${SS}/svtoolkit  --index ${OUT}/bam.index --base-prefix ${SS}
 +
* <code>${SS}/svtoolkit/bin/genomestrip.pl -run-discovery</code> runs the GenomeSTRiP Discovery Pipeline
 +
* <code>--metadata ${SS}/svtoolkit/metadata</code> points to the pre-made metadata file as explained in the previous section, [[#Running GotCloud/GenomeSTRiP Metadata Pipeline|Running GotCloud/GenomeSTRiP Metadata Pipeline]].
 +
* <code>--out ${OUT}/sv</code> tells the pipeline where to put it's output (in an <code>sv</code> sub-directory of ${OUT}
 +
* <code>--conf ${SS}/gotcloud.conf</code> points to the configuration file to use.
 +
** The configuration for this test was downloaded with the seqshop input files.
 +
* <code>--region 22:36000000-37000000</code>
 +
** The sample files are just a small region of chromosome 22, so to save time, we tell the pipeline to ignore the other regions
 +
* <code>--numjobs</code> tells GotCloud how many jobs to run in parallel
 +
** Depends on your system
 +
 +
* <code>--gs-dir ${SS}/svtoolkit</code>
 +
 +
--base_prefix tells GotCloud the prefix to append to relative paths.
 +
The Configuration file cannot read environment variables, so we need to tell GotCloud the path to the input files, ${SS}
 +
Alternatively, gotcloud.conf could be updated to specify the full paths
 +
--out_dir tells GotCloud where to write the output.
 +
This could be specified in gotcloud.conf, but to allow you to use the ${OUT} to change the output location, it is specified on the command-line
 +
    
  time perl ${GC}/bin/genomestrip.pl -run-discovery --metadata ${SS}/svtoolkit/metadata --out ${OUT}/sv --conf ${SS}/gotcloud.conf --region 22:36000000-37000000 --numjobs 2 --gs-dir ${SS}/svtoolkit
 
  time perl ${GC}/bin/genomestrip.pl -run-discovery --metadata ${SS}/svtoolkit/metadata --out ${OUT}/sv --conf ${SS}/gotcloud.conf --region 22:36000000-37000000 --numjobs 2 --gs-dir ${SS}/svtoolkit

Navigation menu