Changes

From Genome Analysis Wiki
Jump to navigationJump to search
256 bytes added ,  08:35, 29 October 2012
no edit summary
Line 15: Line 15:  
* '''What Instance''' to launch. You have several choices
 
* '''What Instance''' to launch. You have several choices
 
** ami-be59d78e which is an instance we have prepared based on ''Ubuntu Server 12.04.1 LTS''. It has all of our software installed.
 
** ami-be59d78e which is an instance we have prepared based on ''Ubuntu Server 12.04.1 LTS''. It has all of our software installed.
** Some other instance. The instance must run 64 bit software and is either Ubuntu of any version or Redhat/CentOS 6.3. You will also need to install the Pipeline software which will require about 15 minutes.
+
** Some other instance. The instance must run 64 bit software and is either Ubuntu of any version or Redhat/CentOS 6.3. You will also need to install the Pipeline software.
    
* '''Instance size'''  (memory and number of processors). The pipeline software will require at least 8GB of memory (type m1.large) and can use as many processors as is available.
 
* '''Instance size'''  (memory and number of processors). The pipeline software will require at least 8GB of memory (type m1.large) and can use as many processors as is available.
Line 25: Line 25:     
If you launched some other instance than the one prepared for our software, you will need to install
 
If you launched some other instance than the one prepared for our software, you will need to install
the Pipeline software. This is quite simple - see  
+
the Pipeline software. This is quite simple - see [[Pipeline Debian Package|debian package]] or
 +
[[Pipeline RedHatPackage|red hat package]].
 +
This should only take 15 minutes.
    +
Setting up your storage is perhaps the most difficult step as it is controlled completely by the size of your data.
 +
As a generate rule you will need three times the space required for your sequence data.
 +
For instance in the 1000 Genomes data, the data for one individual takes about 45G.
 +
If you have 1000 Genome data for nine individuals, you'll need about 1500GB of space (9x450x3 plus a little extra space).
 +
 +
Making your data available for the Pipeline can be accomplished in many way.
 +
Here is a simple straightforward organization you might want to use.
   −
is perhaps the most difficult detail as it is controlled completely by the size of your data. As a generate rule you will need three times the space required for your sequence data. For instance in the 1000 Genomes data, the data for one individual takes about 45G. If you have 1000 Genome data for nine individuals, you'll need about 1500GB of space (9x450x3 plus a little extra space).
       
283

edits

Navigation menu