Changes

From Genome Analysis Wiki
Jump to navigationJump to search
no edit summary
Line 32: Line 32:  
(You can use your favorite editor instead of nedit. I typically use emacs, but nedit is more like Windows.)
 
(You can use your favorite editor instead of nedit. I typically use emacs, but nedit is more like Windows.)
   −
== Examining Raw Sequence Reads ==
+
== Examining GotCloud Align Input Files ==
 +
=== Examining Raw Sequence Reads : FASTQs ===
 
FASTQ : standard file format provided to you by those who did the sequencing.
 
FASTQ : standard file format provided to you by those who did the sequencing.
 
: For more information on the FASTQ format, see: http://en.wikipedia.org/wiki/FASTQ_format
 
: For more information on the FASTQ format, see: http://en.wikipedia.org/wiki/FASTQ_format
Line 95: Line 96:  
</ul>
 
</ul>
   −
== GotCloud Alignment Pipeline==
  −
  −
[[File:AlignDiagram.png|500px]]
  −
  −
=== Why GotCloud?===
  −
* Easy to learn & run
  −
** All-in-one sequence analysis pipeline
  −
** You don’t have to know the details of individual component
  −
* Robust parallelization
  −
** Automatic partition of multi-sample jobs
  −
** Reliable and fault-tolerant parallelization via GNU make
  −
*** Restart from where it stopped upon unexpected crash
  −
* Cloud & Cluster-friendly
  −
** Supports multiple clusters such as MOSIX, Slurm, & SGE
  −
** Amazon instances allow running large-scale jobs without having your own cluster
  −
  −
Sequence Processing Recommendations
  −
* Be consistent within a project
  −
** Process all samples with same pipeline
  −
*** Batch effect may arise if different pipelines are used due to pipeline differences
  −
** Use the same configuration within a project
  −
  −
== Examining GotCloud Align Input Files ==
  −
=== Sequence Data Files : FASTQs ===
  −
We already looked at those in: [[#Examining Raw Sequence Reads|Examining Raw Sequence Reads]]
      
=== Reference Files ===
 
=== Reference Files ===
Line 162: Line 138:  
</ul>
 
</ul>
 
</ul>
 
</ul>
 +
    
=== GotCloud FASTQ Index File ===
 
=== GotCloud FASTQ Index File ===
Line 208: Line 185:     
The command-line setting takes precedence over the configuration file setting.
 
The command-line setting takes precedence over the configuration file setting.
 +
    
=== GotCloud Configuration File ===
 
=== GotCloud Configuration File ===
Line 236: Line 214:  
</ul>
 
</ul>
 
</ul>
 
</ul>
 +
    
== Run GotCloud Align ==
 
== Run GotCloud Align ==
 +
 +
[[File:AlignDiagram.png|500px]]
 +
 
Now that we have all of our input files, we need just a simple command to run them
 
Now that we have all of our input files, we need just a simple command to run them
 
  ${GC}/gotcloud align --conf ${IN}/gotcloud.conf --numcs 2
 
  ${GC}/gotcloud align --conf ${IN}/gotcloud.conf --numcs 2

Navigation menu