Changes

From Genome Analysis Wiki
Jump to navigationJump to search
Line 2: Line 2:  
Main Workshop wiki page: [[SeqShop: May 2015]]
 
Main Workshop wiki page: [[SeqShop: May 2015]]
   −
See the [[Media:Dec2014 SeqShop - GotCloud indel.pdf|introductory slides]] for an intro to this tutorial.
+
See the [[Media:May2015 SeqShop - GotCloud indel.pdf|introductory slides]] for an intro to this tutorial.
    
== Goals of This Session ==
 
== Goals of This Session ==
Line 10: Line 10:  
** How to evaluate the quality of INDEL calls
 
** How to evaluate the quality of INDEL calls
   −
== Setup in person at the SeqShop Workshop ==
+
== Setup ==
 
''This section is specifically for the SeqShop Workshop computers.''
 
''This section is specifically for the SeqShop Workshop computers.''
<div class="mw-collapsible mw-collapsed" style="width:600px">
+
<div class="mw-collapsible" style="width:600px">
 
''If you are not running during the SeqShop Workshop, please skip this section.''
 
''If you are not running during the SeqShop Workshop, please skip this section.''
 
<div class="mw-collapsible-content">
 
<div class="mw-collapsible-content">
   −
{{SeqShopLogin}}
+
{{SeqShopLoginA}}
 
  −
=== Setup your run environment===
  −
This is the same setup you did for the previous tutorial, but you need to redo it each time you log in.
  −
 
  −
This will setup some environment variables to point you to
  −
* [[GotCloud]] program
  −
* Tutorial input files
  −
* Setup an output directory
  −
** It will leave your output directory from the previous tutorial in tact.
  −
source /net/seqshop-server/home/mktrost/seqshop/setup.txt
  −
* You won't see any output after running <code>source</code>
  −
** It silently sets up your environment
  −
** If you want to view the detail of the setup, type
  −
less /net/seqshop-server/home/mktrost/seqshop/setup.txt
  −
and press 'q' to finish.
     −
<div class="mw-collapsible mw-collapsed" style="width:200px">
  −
View setup.txt
  −
<div class="mw-collapsible-content">
  −
[[File:setup.png|500px]]
   
</div>
 
</div>
 
</div>
 
</div>
</div>
  −
</div>
  −
  −
== Setup when running on your own outside of the SeqShop Workshop ==
  −
''This section is specifically for running on your own outside of the SeqShop Workshop.''
  −
<div class="mw-collapsible" style="width:600px">
  −
''If you are running during the SeqShop Workshop, please skip this section.''
  −
<div class="mw-collapsible-content">
      +
=== Prerequisite Tutorials ===
 
This tutorial builds on the alignment tutorial, if you have not already, please first run that tutorial: [[SeqShop:_Sequence_Mapping_and_Assembly_Practical, May 2015|Alignment Tutorial]]
 
This tutorial builds on the alignment tutorial, if you have not already, please first run that tutorial: [[SeqShop:_Sequence_Mapping_and_Assembly_Practical, May 2015|Alignment Tutorial]]
    
It also uses the bam.list file created in the SnpCall Tutorial.  If you have not yet run that tutorial, please follow the directions at: [[SeqShop:_Variant_Calling_and_Filtering_for_SNPs_Practical, May 2015#GotCloud_BAM_List_File|GotCloud BAM List File]]
 
It also uses the bam.list file created in the SnpCall Tutorial.  If you have not yet run that tutorial, please follow the directions at: [[SeqShop:_Variant_Calling_and_Filtering_for_SNPs_Practical, May 2015#GotCloud_BAM_List_File|GotCloud BAM List File]]
      
{{SeqShopRemoteEnv}}
 
{{SeqShopRemoteEnv}}
</div>
  −
</div>
      
== Examining GotCloud Indel Input files ==
 
== Examining GotCloud Indel Input files ==
Line 64: Line 35:     
== Running GotCloud Indel ==
 
== Running GotCloud Indel ==
  ${GC}/gotcloud indel --conf ${SS}/gotcloud.conf --numjobs 6 --region 22:36000000-37000000 --base_prefix ${SS} --outdir ${OUT}
+
  ${GC}/gotcloud indel --conf ${SS}/gotcloud.conf --numjobs 3 --region 22:36000000-37000000 --base_prefix ${SS} --outdir ${OUT}
 
* <code>${GC}/gotcloud</code> runs GotCloud
 
* <code>${GC}/gotcloud</code> runs GotCloud
 
* <code>indel</code> tells GotCloud you want to run the indel calling pipeline.
 
* <code>indel</code> tells GotCloud you want to run the indel calling pipeline.
Line 328: Line 299:     
It is usually useful to examine the call sets against known data sets for the passed variants.
 
It is usually useful to examine the call sets against known data sets for the passed variants.
<div class="mw-collapsible mw-collapsed" style="width:500px">
+
 
''Command to use at SeqShop Workshop:''
+
Currently indel.reference.txt needs absolute paths to the reference files, so update indel.reference.txt to point to the reference files.
<div class="mw-collapsible-content">
  −
${GC}/bin/vt profile_indels -g ${SS}/ref22/indel.reference.txt -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS"
  −
</div>
  −
</div>
  −
<div class="mw-collapsible" style="width:500px">
  −
''Commands outside of SeqShop Workshop:''
  −
<div class="mw-collapsible-content">
  −
Outside the workshop, you need to update indel.reference.txt to point to the reference files.
   
  cp ${SS}/ref22/indel.reference.txt ${OUT}/indel.reference.txt
 
  cp ${SS}/ref22/indel.reference.txt ${OUT}/indel.reference.txt
   Line 346: Line 309:     
  ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt  -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS"
 
  ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt  -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f "PASS"
</div>
  −
</div>
        Line 387: Line 348:  
We perform the same analysis for the failed variants again, the relatively low overlap with known data sets imply a reasonable tradeoff in sensitivity and specificity.
 
We perform the same analysis for the failed variants again, the relatively low overlap with known data sets imply a reasonable tradeoff in sensitivity and specificity.
   −
<div class="mw-collapsible mw-collapsed" style="width:500px">
  −
''Command to use at SeqShop Workshop:''
  −
<div class="mw-collapsible-content">
  −
  ${GC}/bin/vt profile_indels -g ${SS}/ref22/indel.reference.txt  -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f  "~PASS"
  −
</div>
  −
</div>
  −
<div class="mw-collapsible" style="width:500px">
  −
''Command outside of SeqShop Workshop:''
  −
<div class="mw-collapsible-content">
   
   ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt  -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f  "~PASS"
 
   ${GC}/bin/vt profile_indels -g ${OUT}/indel.reference.txt  -r ${SS}/ref22/human.g1k.v37.chr22.fa ${OUT}/indel/final/all.genotypes.vcf.gz -i 22:36000000-37000000 -f  "~PASS"
</div>
  −
</div>
        Line 562: Line 512:  
UMICH's algorithm for normalization has been adopted by Petr Danecek in bcftools and is also used in GKNO.
 
UMICH's algorithm for normalization has been adopted by Petr Danecek in bcftools and is also used in GKNO.
    +
 +
== FEEDBACK! ==
 +
Please provide feedback on today:
 +
https://docs.google.com/forms/d/1xLrrEJ3XeZjxR_e5Fo-OTKHQvUFQLgX8MYwLZtxroig/viewform
    
== Return to Workshop Wiki Page ==
 
== Return to Workshop Wiki Page ==
Return to main workshop wiki page: [[SeqShop: December 2014]]
+
Return to main workshop wiki page: [[SeqShop: May 2015]]

Navigation menu