Changes

From Genome Analysis Wiki
Jump to navigationJump to search
500 bytes added ,  16:24, 12 November 2012
no edit summary
Line 41: Line 41:     
* GotCloud software and reference files
 
* GotCloud software and reference files
 +
 
* Your sequence data
 
* Your sequence data
 
* Output of the aligner
 
* Output of the aligner
 
* Output of umake
 
* Output of umake
   −
The first of these is a small volume based on a snapshot containing the GotCloud files you will need.
+
The '''first of these''' is a small volume based on a snapshot containing the GotCloud files you will need.
 
We provide an AWS snapshot of a small volume which contains the aligner and umake software and reference files.
 
We provide an AWS snapshot of a small volume which contains the aligner and umake software and reference files.
 
Create an EBS volume based on our snapshot and then mount that volume on your instance.
 
Create an EBS volume based on our snapshot and then mount that volume on your instance.
 +
In the EC2 Management Console under ELASTIC BLOCK STORE, select Volumes -> Create Volume.
 +
In the prompt supply the size and Snapshot (based on the table below).
 +
You may take the defaults for the Volume Type and IOPS.
   −
    
+
<code>
 +
  Availability Zone   Snapshot        Size
 +
      us-west-2a      snap-1a13913c    40GB
 +
</code>
    +
and create the volume.  This will create a device which you need to mount in your instance.
 +
This will create a device like /dev/sdf, which unfortunately actually translates to
 +
the device /dev/xvdf in your Linux instance. Once the volume is ready, mount it
 +
by logging into your instance with ssh and issuing the command:
    +
<code>
 +
  sudo mkdir -p /gotcloud
 +
  sudo mount /dev/xvdf  /gotcloud    # or whatever device yours is
 +
  df -h
 +
</code>
 +
 +
This will make the GotCloud software available under the path /gotcloud/bin etc.
 +
Each time your instance is started, you'll need to mount this volume.
 +
You may want to create a small shell script to mount the device.
 +
 +
The next storage volumes will varying based on what you data you have.
 +
The sequence data might already be in some volume, provided someone else.
 +
You'll have to mount volume too.
 +
 +
You should expect the three data volumes will all need to be the same size. That is, if your sequence data is 300GB, then you'll need an additional 300GB for the aligner output and then another 300GB of storage for the umake output. We suggest you consider making each set of data be separate volumes.
   −
You should expect these will all require the same amount of storage. That is, if your sequence data is 300GB, then you'll need an additional 300GB for the aligner output and then another 300GB of storage for the umake output.
  −
We suggest you consider making each of these storage volume be separate volumes.
   
You may also find that your sequence data is too large to be easily handled in one go,
 
You may also find that your sequence data is too large to be easily handled in one go,
 
so you might choose to only use the aligner/umake on part of your sequence data, capture the files
 
so you might choose to only use the aligner/umake on part of your sequence data, capture the files
 
of interest from umake, and then go back and rerun the software with the next bit of sequence data.
 
of interest from umake, and then go back and rerun the software with the next bit of sequence data.
  −
  −
  −
'''Prepare Your Instance'''
  −
  −
If you launched some other instance than the one prepared for our software, you will need to install
  −
the Pipeline software. This is quite simple - see [[Pipeline Debian Package|debian package]] or
  −
[[Pipeline RedHatPackage|red hat package]].
  −
This should only take 15 minutes.
  −
  −
The last step is to organize your storage so you have enough space for the input sequence data
  −
and the output of the aligner and umake steps.
  −
This is described in more detail in [[Amazon Storage|Amazon Storage]].
  −
If you are not using AWS, the process will be similar to that described above,
  −
but the details will vary based on your environment.
 
283

edits

Navigation menu