Amazon Snapshot

From Genome Analysis Wiki
Revision as of 15:54, 12 November 2012 by Terry Gliedt (talk | contribs)
Jump to navigationJump to search

Back to the beginning [1]

GotCloud is made available in various forms. In Amazon Web Services the software is made available in a EBS (Elastic Block Store) Snapshot. This is simple a copy of a data volume we have created that has our software and, additionally, some reference files you will find useful. You simply need to create your own EBS volume from our snapshot, mount your new volume and you are ready.

If this does not work or is unacceptable for some reason, you may install the software and reference files wherever you'd like (read about this in from a Debian package.

Your first task is get an AWS account and keys so that you can use the AWS EC2 Console Dashboard (see https://console.aws.amazon.com/ec2/). From here you can launch instances prepared by others or create your own. We cannot assist in this step - Amazon has plenty of documentation. Once you are at the AWS EC2 Console Dashboard, you're almost ready for GotCloud.


Launch Your First Instance

You'll need to know some details when launching an instance:

  • Launch an Instance to launch. Any instance running 64 bit software and

either an Ubuntu of any version or Redhat/CentOS 6.3 distribution will work.

  • Instance size (memory and number of processors). The pipeline software will require at least 4GB of memory (type m1.medium) and can use as many processors as is available.
  • Storage for the instance refers to the size for root (/) partition. This can be quite small, as little as 8GB should work. Of course if you intend to bring lots of other files/programs to the instance, you may want to increase this to something a bit larger (e.g. 30GB).


Prepare Your Instance

If you launched some other instance than the one prepared for our software, you will need to install the Pipeline software. This is quite simple - see debian package or red hat package. This should only take 15 minutes.

The last step is to organize your storage so you have enough space for the input sequence data and the output of the aligner and umake steps. This is described in more detail in Amazon Storage. If you are not using AWS, the process will be similar to that described above, but the details will vary based on your environment.