Changes

From Genome Analysis Wiki
Jump to navigationJump to search
no edit summary
Line 3: Line 3:       −
== Overview of Alignment Pipeline Steps ==  
+
== List of Alignment Sub-Pipelines ==  
The Alignment/Mapping Pipeline takes [http://en.wikipedia.org/wiki/FASTQ_format FASTQ files] and generates recalibrated [[BAM|BAM (Binary Sequence Alignment/Map format) files]] from them.  
+
 
 +
===recab===
 +
 
 +
This sub-pipeline takes in a list of bam files for each sample, merges the BAMs for samples that have multiple BAMs, dedups and recalibrates, and then indexes the recalibrated BAM.
 +
 
 +
===recabQC===
 +
 
 +
This sub-pipeline does everything that *recab* does (takes in a list of bam files for each sample, merges the BAMs for samples that have multiple BAMs, dedups and recalibrates, and then indexes the recalibrated BAM). It then goes the next step to perform quality control (running qplot and verifyBamID).
 +
 
 +
===bamQC===
 +
 
 +
This sub-pipeline takes in a single, recalibrated BAM file and its index file (.bai) and performs quality control (running qplot and verifyBamID). It differs from *bamQC_createIndex* in that it requires that the user already have .bai files for the recalibrated BAM files.
 +
 +
===bamQC_createIndex===
 +
 
 +
This sub-pipeline takes in a single, recalibrated BAM file, creates an index file for it, and performs quality control (running qplot and verifyBamID). It differs from *bamQC* in that it does not require that the user already have a .bai file for the recalibrated BAM file.
 +
 
 +
== recab ==
 +
 
 +
 
 +
== recabQC ==
 +
== bamQC ==
 +
== bamQC_createIndex ==
   −
[[File:MappingSteps.png]]
     −
== Running the GotCloud Alignment Pipeline ==
     −
The alignment pipeline is run using the <code>align</code> option of the <code>gotcloud</code> script.  This option calls <code>align.pl</code> found in the <code>bin/</code> directory under the <code>gotcloud</code> installation.
     −
Use the <code>--conf</code> parameter followed by the configuration file to specify the configuration to use for this run of the alignment pipeline.
     −
You must specify the input list of FASTQs mapped to sample id to tell the alignment pipeline what files to process.  You can do this by setting either:
  −
* <code>FASTQ_LIST</code> in the configuration file
  −
* <code>--list</code> on the command-line
     −
You must specify an output directory to tell the alignment pipeline where to write its output by either setting:
  −
* <code>OUT_DIR</code> in the configuration file
  −
* <code>--outdir</code> on the command-line
     −
'''Example of a Basic Alignment Command'''
  −
gotcloud align --conf myAlignTest.conf --outdir ~/gotcloudOutput/align/
        −
===Running the Automated Test===
      
The automated test runs the alignment pipeline on a small set of test data and checks that the results against expected results validating that GotCloud is installed correctly.  
 
The automated test runs the alignment pipeline on a small set of test data and checks that the results against expected results validating that GotCloud is installed correctly.  
87

edits

Navigation menu