Changes

From Genome Analysis Wiki
Jump to navigationJump to search
Line 1: Line 1:  
== Introduction ==
 
== Introduction ==
See the [[Media:SeqShop - GotCloud Align.pdf|introductory slides]] for an intro to this tutorial.
+
Main Workshop wiki page: [[SeqShop: December 2014]]
 +
 
 +
See the [[Media:Dec2014 SeqShop - GotCloud Align.pdf|introductory slides]] for an intro to this tutorial.
    
== Goals of This Session ==
 
== Goals of This Session ==
Line 11: Line 13:  
== Setup in person at the SeqShop Workshop ==
 
== Setup in person at the SeqShop Workshop ==
 
''This section is specifically for the SeqShop Workshop computers.''
 
''This section is specifically for the SeqShop Workshop computers.''
<div class="mw-collapsible" style="width:600px">
+
<div class="mw-collapsible mw-collapsed" style="width:600px">
 
''If you are not running during the SeqShop Workshop, please skip this section.''
 
''If you are not running during the SeqShop Workshop, please skip this section.''
 
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
Line 24: Line 26:  
* Tutorial input files
 
* Tutorial input files
 
* Setup an output directory
 
* Setup an output directory
  source /home/mktrost/seqshop/setup.txt
+
  source /net/seqshop-server/home/mktrost/seqshop/setup.txt
 
* You won't see any output after running <code>source</code>
 
* You won't see any output after running <code>source</code>
 
** It silently sets up your environment
 
** It silently sets up your environment
 +
 +
Look at setup.txt
 +
cat /net/seqshop-server/home/mktrost/seqshop/setup.txt
 
<div class="mw-collapsible mw-collapsed" style="width:200px">
 
<div class="mw-collapsible mw-collapsed" style="width:200px">
View setup.txt
+
* setup.txt screenshot
 
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
 
[[File:setup.png|500px]]
 
[[File:setup.png|500px]]
Line 38: Line 43:  
== Setup when running on your own outside of the SeqShop Workshop ==
 
== Setup when running on your own outside of the SeqShop Workshop ==
 
''This section is specifically for running on your own outside of the SeqShop Workshop.''
 
''This section is specifically for running on your own outside of the SeqShop Workshop.''
<div class="mw-collapsible mw-collapsed" style="width:600px">
+
<div class="mw-collapsible" style="width:600px">
 
''If you are running during the SeqShop Workshop, please skip this section.''
 
''If you are running during the SeqShop Workshop, please skip this section.''
 
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
 +
 +
=== Download the example data ===
 +
Download and untar file containing the example data used in the practicals:
 +
mkdir -p ~/seqshop
 +
cd ~/seqshop
 +
wget http://csg.sph.umich.edu/mktrost/seqshopExampleDec2014.tar.gz
 +
tar xvf seqshopExampleDec2014.tar.gz
 +
 +
You will see the names of all the files included in the example data scrolling on the screen as they are unpacked from the tar file.
 +
 
=== Download & Build GotCloud ===
 
=== Download & Build GotCloud ===
 
If you do not already have GotCloud:
 
If you do not already have GotCloud:
* cd to where you want GotCloud installed (you can change this to any directory you want)
  −
mkdir -p ~/seqshop
  −
cd ~/seqshop/
   
* download, decompress, and build the version of gotcloud that was tested with this tutorial:
 
* download, decompress, and build the version of gotcloud that was tested with this tutorial:
  wget https://github.com/statgen/gotcloud/archive/gotcloud.workshop.tar.gz
+
  wget https://github.com/statgen/gotcloud/archive/gotcloud.1.15.tar.gz
  tar xvf gotcloud.workshop.tar.gz
+
  tar xvf gotcloud.1.15.tar.gz
  mv gotcloud-gotcloud.workshop gotcloud
+
  mv gotcloud-gotcloud.1.15 gotcloud
 
  cd gotcloud/src
 
  cd gotcloud/src
 
  make
 
  make
Line 55: Line 67:     
Remember the path to gotcloud/ that is what you will need to set your GC variable to.
 
Remember the path to gotcloud/ that is what you will need to set your GC variable to.
  −
=== Download the example data ===
  −
Download and untar file containing the example data used in the practicals:
  −
wget http://www.sph.umich.edu/csg/mktrost/seqshopExample.tar.gz
  −
tar xvf seqshopExample.tar.gz
  −
  −
You will see the names of all the files included in the example data scrolling on the screen as they are unpacked from the tar file.
      
{{SeqShopRemoteEnv}}
 
{{SeqShopRemoteEnv}}
 +
</div>
 +
</div>
    
== Examining [[GotCloud]] Align Input Files ==
 
== Examining [[GotCloud]] Align Input Files ==
Line 281: Line 288:     
Now that we have all of our input files, we need just a simple command to run them
 
Now that we have all of our input files, we need just a simple command to run them
  ${GC}/gotcloud align --conf ${SS}/gotcloud.conf --numjobs 2 --base_prefix ${SS} --outdir ${OUT}
+
* When running at home if you don't have 4 CPUs, reduce the <code>--numjobs</code> setting (it will take longer to run).
 +
  ${GC}/gotcloud align --conf ${SS}/gotcloud.conf --numjobs 4 --base_prefix ${SS} --outdir ${OUT}
    
* <code>${GC}/gotcloud</code> runs GotCloud
 
* <code>${GC}/gotcloud</code> runs GotCloud
Line 287: Line 295:  
* <code>--conf</code> tells GotCloud the name of the configuration file to use.
 
* <code>--conf</code> tells GotCloud the name of the configuration file to use.
 
** The configuration for this test was downloaded with the seqshop input files.
 
** The configuration for this test was downloaded with the seqshop input files.
* <code>--numjobs</code> means to run 2 samples at a time.
+
* <code>--numjobs</code> means to run 4 samples at a time.
 
** How many you can run concurrently depends on your system.
 
** How many you can run concurrently depends on your system.
 
* <code>--base_prefix</code> tells GotCloud the prefix to append to relative paths.
 
* <code>--base_prefix</code> tells GotCloud the prefix to append to relative paths.
 
** The Configuration file cannot read environment variables, so we need to tell GotCloud the path to the input files, ${SS}
 
** The Configuration file cannot read environment variables, so we need to tell GotCloud the path to the input files, ${SS}
 
** Alternatively, gotcloud.conf could be updated to specify the full paths
 
** Alternatively, gotcloud.conf could be updated to specify the full paths
* <code>--out_dir</code> tells GotCloud where to write the output.
+
* <code>--outdir</code> tells GotCloud where to write the output.
 
** This could be specified in gotcloud.conf, but to allow you to use the ${OUT} to change the output location, it is specified on the command-line
 
** This could be specified in gotcloud.conf, but to allow you to use the ${OUT} to change the output location, it is specified on the command-line
   −
[[File:gcalignStartNew.png|850px]]
+
[[File:gcalignStartNew.png|650px]]
   −
This should take 1-3 minutes to run.
+
This should take about 1 minute to run.
   −
It should end with a line like: <code>Processing finished in 47 secs with no errors reported</code>
+
It should end with a line like: <code>Processing finished in 54 secs with no errors reported</code>
 
* The <code>WARNING</code> messages are just to let you know that the default Read Group field settings are being used.
 
* The <code>WARNING</code> messages are just to let you know that the default Read Group field settings are being used.
   Line 482: Line 490:  
''If you are not running during the SeqShop Workshop, please skip this section.''
 
''If you are not running during the SeqShop Workshop, please skip this section.''
 
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
 +
To logout of seqshop1/2/3/4, type:
 +
exit
 
To logout of seqshop-server, type:
 
To logout of seqshop-server, type:
 
  exit
 
  exit
Line 489: Line 499:  
</div>
 
</div>
 
</div>
 
</div>
 +
 +
== Return to Workshop Wiki Page ==
 +
Return to main workshop wiki page: [[SeqShop: December 2014]]

Navigation menu