Line 2: |
Line 2: |
| Main Workshop wiki page: [[SeqShop: May 2015]] | | Main Workshop wiki page: [[SeqShop: May 2015]] |
| | | |
− | See the [[Media:Dec2014 SeqShop - GotCloud indel.pdf|introductory slides]] for an intro to this tutorial. | + | See the [[Media:May2015 SeqShop - GotCloud indel.pdf|introductory slides]] for an intro to this tutorial. |
| | | |
| == Goals of This Session == | | == Goals of This Session == |
Line 10: |
Line 10: |
| ** How to evaluate the quality of INDEL calls | | ** How to evaluate the quality of INDEL calls |
| | | |
− | == Setup in person at the SeqShop Workshop == | + | == Setup == |
| ''This section is specifically for the SeqShop Workshop computers.'' | | ''This section is specifically for the SeqShop Workshop computers.'' |
− | <div class="mw-collapsible mw-collapsed" style="width:600px"> | + | <div class="mw-collapsible" style="width:600px"> |
| ''If you are not running during the SeqShop Workshop, please skip this section.'' | | ''If you are not running during the SeqShop Workshop, please skip this section.'' |
| <div class="mw-collapsible-content"> | | <div class="mw-collapsible-content"> |
| | | |
− | {{SeqShopLogin}} | + | {{SeqShopLoginA}} |
− | | |
− | === Setup your run environment===
| |
− | This is the same setup you did for the previous tutorial, but you need to redo it each time you log in.
| |
− | | |
− | This will setup some environment variables to point you to
| |
− | * [[GotCloud]] program
| |
− | * Tutorial input files
| |
− | * Setup an output directory
| |
− | ** It will leave your output directory from the previous tutorial in tact.
| |
− | source /net/seqshop-server/home/mktrost/seqshop/setup.txt
| |
− | * You won't see any output after running <code>source</code>
| |
− | ** It silently sets up your environment
| |
− | ** If you want to view the detail of the setup, type
| |
− | less /net/seqshop-server/home/mktrost/seqshop/setup.txt
| |
− | and press 'q' to finish.
| |
| | | |
− | <div class="mw-collapsible mw-collapsed" style="width:200px">
| |
− | View setup.txt
| |
− | <div class="mw-collapsible-content">
| |
− | [[File:setup.png|500px]]
| |
| </div> | | </div> |
| </div> | | </div> |
− | </div>
| |
− | </div>
| |
− |
| |
− | == Setup when running on your own outside of the SeqShop Workshop ==
| |
− | ''This section is specifically for running on your own outside of the SeqShop Workshop.''
| |
− | <div class="mw-collapsible" style="width:600px">
| |
− | ''If you are running during the SeqShop Workshop, please skip this section.''
| |
− | <div class="mw-collapsible-content">
| |
| | | |
| + | === Prerequisite Tutorials === |
| This tutorial builds on the alignment tutorial, if you have not already, please first run that tutorial: [[SeqShop:_Sequence_Mapping_and_Assembly_Practical, May 2015|Alignment Tutorial]] | | This tutorial builds on the alignment tutorial, if you have not already, please first run that tutorial: [[SeqShop:_Sequence_Mapping_and_Assembly_Practical, May 2015|Alignment Tutorial]] |
| | | |
| It also uses the bam.list file created in the SnpCall Tutorial. If you have not yet run that tutorial, please follow the directions at: [[SeqShop:_Variant_Calling_and_Filtering_for_SNPs_Practical, May 2015#GotCloud_BAM_List_File|GotCloud BAM List File]] | | It also uses the bam.list file created in the SnpCall Tutorial. If you have not yet run that tutorial, please follow the directions at: [[SeqShop:_Variant_Calling_and_Filtering_for_SNPs_Practical, May 2015#GotCloud_BAM_List_File|GotCloud BAM List File]] |
− |
| |
| | | |
| {{SeqShopRemoteEnv}} | | {{SeqShopRemoteEnv}} |
− | </div>
| |
− | </div>
| |
| | | |
| == Examining GotCloud Indel Input files == | | == Examining GotCloud Indel Input files == |
Line 64: |
Line 35: |
| | | |
| == Running GotCloud Indel == | | == Running GotCloud Indel == |
− | ${GC}/gotcloud indel --conf ${SS}/gotcloud.conf --numjobs 6 --region 22:36000000-37000000 --base_prefix ${SS} --outdir ${OUT} | + | ${GC}/gotcloud indel --conf ${SS}/gotcloud.conf --numjobs 3 --region 22:36000000-37000000 --base_prefix ${SS} --outdir ${OUT} |
| * <code>${GC}/gotcloud</code> runs GotCloud | | * <code>${GC}/gotcloud</code> runs GotCloud |
| * <code>indel</code> tells GotCloud you want to run the indel calling pipeline. | | * <code>indel</code> tells GotCloud you want to run the indel calling pipeline. |
Line 328: |
Line 299: |
| | | |
| It is usually useful to examine the call sets against known data sets for the passed variants. | | It is usually useful to examine the call sets against known data sets for the passed variants. |
− | <div class="mw-collapsible mw-collapsed" style="width:500px">
| + | |
− | ''Command to use at SeqShop Workshop:''
| + | Currently indel.reference.txt needs absolute paths to the reference files, so update indel.reference.txt to point to the reference files. |
− | <div class="mw-collapsible-content">
| |
− | ${GC}/bin/vt profile_indels -g ${SS}/ref22/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS"
| |
− | </div>
| |
− | </div>
| |
− | <div class="mw-collapsible" style="width:500px">
| |
− | ''Commands outside of SeqShop Workshop:''
| |
− | <div class="mw-collapsible-content">
| |
− | Outside the workshop, you need to update indel.reference.txt to point to the reference files.
| |
| cp ${SS}/ref22/indel.reference.txt ${OUT}/indel.reference.txt | | cp ${SS}/ref22/indel.reference.txt ${OUT}/indel.reference.txt |
| | | |
Line 346: |
Line 309: |
| | | |
| ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS" | | ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS" |
− | </div>
| |
− | </div>
| |
| | | |
| | | |
Line 387: |
Line 348: |
| We perform the same analysis for the failed variants again, the relatively low overlap with known data sets imply a reasonable tradeoff in sensitivity and specificity. | | We perform the same analysis for the failed variants again, the relatively low overlap with known data sets imply a reasonable tradeoff in sensitivity and specificity. |
| | | |
− | <div class="mw-collapsible mw-collapsed" style="width:500px">
| |
− | ''Command to use at SeqShop Workshop:''
| |
− | <div class="mw-collapsible-content">
| |
− | ${GC}/bin/vt profile_indels -g ${SS}/ref22/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "~PASS"
| |
− | </div>
| |
− | </div>
| |
− | <div class="mw-collapsible" style="width:500px">
| |
− | ''Command outside of SeqShop Workshop:''
| |
− | <div class="mw-collapsible-content">
| |
| ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "~PASS" | | ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "~PASS" |
− | </div>
| |
− | </div>
| |
| | | |
| | | |
Line 562: |
Line 512: |
| UMICH's algorithm for normalization has been adopted by Petr Danecek in bcftools and is also used in GKNO. | | UMICH's algorithm for normalization has been adopted by Petr Danecek in bcftools and is also used in GKNO. |
| | | |
| + | |
| + | == FEEDBACK! == |
| + | Please provide feedback on today: |
| + | https://docs.google.com/forms/d/1xLrrEJ3XeZjxR_e5Fo-OTKHQvUFQLgX8MYwLZtxroig/viewform |
| | | |
| == Return to Workshop Wiki Page == | | == Return to Workshop Wiki Page == |
− | Return to main workshop wiki page: [[SeqShop: December 2014]] | + | Return to main workshop wiki page: [[SeqShop: May 2015]] |