Line 32: |
Line 32: |
| (You can use your favorite editor instead of nedit. I typically use emacs, but nedit is more like Windows.) | | (You can use your favorite editor instead of nedit. I typically use emacs, but nedit is more like Windows.) |
| | | |
− | == Examining Raw Sequence Reads == | + | == Examining GotCloud Align Input Files == |
| + | === Examining Raw Sequence Reads : FASTQs === |
| FASTQ : standard file format provided to you by those who did the sequencing. | | FASTQ : standard file format provided to you by those who did the sequencing. |
| : For more information on the FASTQ format, see: http://en.wikipedia.org/wiki/FASTQ_format | | : For more information on the FASTQ format, see: http://en.wikipedia.org/wiki/FASTQ_format |
Line 95: |
Line 96: |
| </ul> | | </ul> |
| | | |
− | == GotCloud Alignment Pipeline==
| |
− |
| |
− | [[File:AlignDiagram.png|500px]]
| |
− |
| |
− | === Why GotCloud?===
| |
− | * Easy to learn & run
| |
− | ** All-in-one sequence analysis pipeline
| |
− | ** You don’t have to know the details of individual component
| |
− | * Robust parallelization
| |
− | ** Automatic partition of multi-sample jobs
| |
− | ** Reliable and fault-tolerant parallelization via GNU make
| |
− | *** Restart from where it stopped upon unexpected crash
| |
− | * Cloud & Cluster-friendly
| |
− | ** Supports multiple clusters such as MOSIX, Slurm, & SGE
| |
− | ** Amazon instances allow running large-scale jobs without having your own cluster
| |
− |
| |
− | Sequence Processing Recommendations
| |
− | * Be consistent within a project
| |
− | ** Process all samples with same pipeline
| |
− | *** Batch effect may arise if different pipelines are used due to pipeline differences
| |
− | ** Use the same configuration within a project
| |
− |
| |
− | == Examining GotCloud Align Input Files ==
| |
− | === Sequence Data Files : FASTQs ===
| |
− | We already looked at those in: [[#Examining Raw Sequence Reads|Examining Raw Sequence Reads]]
| |
| | | |
| === Reference Files === | | === Reference Files === |
Line 162: |
Line 138: |
| </ul> | | </ul> |
| </ul> | | </ul> |
| + | |
| | | |
| === GotCloud FASTQ Index File === | | === GotCloud FASTQ Index File === |
Line 208: |
Line 185: |
| | | |
| The command-line setting takes precedence over the configuration file setting. | | The command-line setting takes precedence over the configuration file setting. |
| + | |
| | | |
| === GotCloud Configuration File === | | === GotCloud Configuration File === |
Line 236: |
Line 214: |
| </ul> | | </ul> |
| </ul> | | </ul> |
| + | |
| | | |
| == Run GotCloud Align == | | == Run GotCloud Align == |
| + | |
| + | [[File:AlignDiagram.png|500px]] |
| + | |
| Now that we have all of our input files, we need just a simple command to run them | | Now that we have all of our input files, we need just a simple command to run them |
| ${GC}/gotcloud align --conf ${IN}/gotcloud.conf --numcs 2 | | ${GC}/gotcloud align --conf ${IN}/gotcloud.conf --numcs 2 |