Changes

From Genome Analysis Wiki
Jump to navigationJump to search
Line 102: Line 102:  
</div>
 
</div>
   −
<div class="mw-collapsible" style="width:500px">
+
<div class="mw-collapsible mw-collapsed" style="width:500px">
    
== Thursday ==
 
== Thursday ==
Line 163: Line 163:  
</div>
 
</div>
   −
<div class="mw-collapsible mw-collapsed" style="width:1000px">
+
<div class="mw-collapsible" style="width:1000px">
    
== Friday ==
 
== Friday ==
<div class="mw-collapsible-content mw-collapsed">
+
<div class="mw-collapsible-content">
      Line 172: Line 172:  
[[SeqShop: Ancestry On Your Own Genome, May 2015]]
 
[[SeqShop: Ancestry On Your Own Genome, May 2015]]
   −
=== Return to SeqShop: Ancestry On Your Own Genome, May 2015 ===
+
=== Setup Variables ===
Return to [[SeqShop:_Ancestry_On_Your_Own_Genome,_May_2015#Checking_if_Pileup_finished]]
+
Set these values.  Also, be sure to specify your sample name instead of SampleXX
 +
export SAMPLE=SampleXX
 +
or
 +
export SAMPLE=NA12878
 +
 
 +
Point to your GotCloud & your output directory:
 +
export GC=~/seqshop/gotcloud
 +
export OUT=~/$SAMPLE/output
    
=== Reviewing Indel Results ===
 
=== Reviewing Indel Results ===
Set these values.  Also, be sure to specify your sample name (or NA12878) instead of SampleXX
  −
export SAMPLE=SampleXX
  −
source /net/seqshop-server/home/mktrost/seqshop/setupSS.txt
  −
   
Look in the output directory
 
Look in the output directory
 
  ls ~/$SAMPLE/output
 
  ls ~/$SAMPLE/output
Line 244: Line 247:  
The insertion deletion ratio increases from 0.91 to 0.92.   
 
The insertion deletion ratio increases from 0.91 to 0.92.   
    +
=== Return to SeqShop: Ancestry On Your Own Genome, May 2015 ===
 +
Return to [[SeqShop:_Ancestry_On_Your_Own_Genome,_May_2015#Checking_if_Pileup_finished]]
    
=== Friday: Reviewing SNPCALL Results ===
 
=== Friday: Reviewing SNPCALL Results ===
Line 253: Line 258:     
=== Friday : More SNP Analysis ===
 
=== Friday : More SNP Analysis ===
 +
In addition, set another environmental variable for locating the binaries for custom analysis
   −
==== Environmental Variables ====
+
export HK=/net/seqshop-server/home/hmkang/apigenome/bin
 
+
  export EPACTS=/net/seqshop-server/home/mktrost/seqshop/epacts/
If you didn't set the environmental variable, you can set it again
+
  export REF=/net/seqshop-server/home/mktrost/seqshop/singleSample/ref/gotcloud.ref
 
+
export GC=~/seqshop/gotcloud
  source /net/seqshop-server/home/mktrost/seqshop/setup.txt
  −
  export SAMPLE=SampleXX (MAKE SURE TO CHANGE XX to your number or use NA12878 instead)
  −
source /net/seqshop-server/home/mktrost/seqshop/setupSS.txt
     −
In addition, set another environmental variable for locating the binaries for custom analysis
     −
  export HK=/home/hmkang/apigenome/bin/
+
  export SAMPLE=SampleXX
 +
export OUT=~/$SAMPLE/output
    
==== Annotation / Lookup against dbSNP ====
 
==== Annotation / Lookup against dbSNP ====
Line 270: Line 273:  
If you want to add rsIDs to your variant files, you can do this by running the following command
 
If you want to add rsIDs to your variant files, you can do this by running the following command
   −
  $HK/vcf-add-rsid -vcf $OUT/vcfs/chr1/chr1.filtered.vcf.gz --db $HK/../data/dbSNP.b138/dbsnp_138.b37.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz
+
  $HK/vcf-add-rsid -vcf $OUT/vcfs/chr1/chr1.filtered.vcf.gz --db $HK/../data/dbsnp_142.b37.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz
 
   
 
   
 
If you want to run this command across all chromosomes in parallel, you can use the special script run-command-wgs
 
If you want to run this command across all chromosomes in parallel, you can use the special script run-command-wgs
   −
  $HK/run-command-wgs --cmd "$HK/vcf-add-rsid -vcf $OUT/vcfs/chr1/chr1.filtered.vcf.gz --db $HK/../data/dbSNP.b138/dbsnp_138.b37.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz" --numjobs 6
+
  $HK/run-make --repeat-chr --cmd "$HK/vcf-add-rsid -vcf $OUT/vcfs/chr1/chr1.filtered.vcf.gz --db $HK/../data/dbsnp_142.b37.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz" --numjobs 6 --out runmake.rsid
   −
Looking up SNPs by rsID is possible by (for example)
+
Looking up SNPs by rsID is possible by (for example, rs17766217) -- How can we find its position?
  $HK/vcf-lookup-rsid --vcf $OUT/vcfs/chr1/chr1.filtered.vcf.gz --sepchr --rs rs17766217
+
  $HK/tabix $OUT/vcfs/chr8/chr8.filtered.rsid.vcf.gz 8:128504497 | less
 
* Be sure to look at the QUAL & your sample's PL, and not just the GL field.  Check if QUAL is 0 or PL is 0,0,0 - NS is also probably 0; DP is probably 0.  That means you probably didn't have any copies, so your GT may not be correct/is unknown.
 
* Be sure to look at the QUAL & your sample's PL, and not just the GL field.  Check if QUAL is 0 or PL is 0,0,0 - NS is also probably 0; DP is probably 0.  That means you probably didn't have any copies, so your GT may not be correct/is unknown.
    
If you want to browse the rsIDs of known GWAS SNPs, you can do this by
 
If you want to browse the rsIDs of known GWAS SNPs, you can do this by
  cut -f 1,8,22 $HK/../data/gwascatalog/gwascatalog.txt | less
+
  cut -f 1,8,12,13,22 $HK/../data/gwascatalog/gwascatalog.txt | grep -w rs17766217
    
==== Annotating your genome ====
 
==== Annotating your genome ====
Line 289: Line 292:     
Or you can run multiple chromosomes in parallel in one command
 
Or you can run multiple chromosomes in parallel in one command
  $HK/run-command-wgs --cmd "$EPACTS/bin/epacts anno --in $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz" --numjobs 6  
+
  $HK/run-make --repeat-chr --cmd "$EPACTS/bin/epacts anno --in $OUT/vcfs/chr1/chr1.filtered.rsid.vcf.gz --out $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz" --numjobs 6 --out runmake.anno
+
 
 
==== Extracting only exonic SNPs ====
 
==== Extracting only exonic SNPs ====
    
If you want to look at the exonic SNPs, you can extract using the following command
 
If you want to look at the exonic SNPs, you can extract using the following command
  $HK/run-command-wgs --cmd "($HK/tabix -H $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz; zcat $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz | grep Exon;)| $HK/bgzip -c > $OUT/vcfs/chr1/chr1.filtered.rsid.anno.exon.vcf.gz" --numjobs 6
+
  $HK/run-make --repeat-chr --cmd "($HK/tabix -H $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz; zcat $OUT/vcfs/chr1/chr1.filtered.rsid.anno.vcf.gz | grep Exon;)| $HK/bgzip -c > $OUT/vcfs/chr1/chr1.filtered.rsid.anno.exon.vcf.gz" --numjobs 6 --out runmake.exome
    
And they can be combined as follows
 
And they can be combined as follows
 
  (zcat $OUT/vcfs/chr1/chr1.filtered.rsid.anno.exon.vcf.gz; zcat $OUT/vcfs/chr[2-9]/chr*.filtered.rsid.anno.exon.vcf.gz $OUT/vcfs/chr??/chr*.filtered.rsid.anno.exon.vcf.gz $OUT/vcfs/chrX/chrX.filtered.rsid.anno.exon.vcf.gz | grep -v ^#) | $HK/bgzip -c > $OUT/wgs.filtered.rsid.anno.exon.vcf.gz
 
  (zcat $OUT/vcfs/chr1/chr1.filtered.rsid.anno.exon.vcf.gz; zcat $OUT/vcfs/chr[2-9]/chr*.filtered.rsid.anno.exon.vcf.gz $OUT/vcfs/chr??/chr*.filtered.rsid.anno.exon.vcf.gz $OUT/vcfs/chrX/chrX.filtered.rsid.anno.exon.vcf.gz | grep -v ^#) | $HK/bgzip -c > $OUT/wgs.filtered.rsid.anno.exon.vcf.gz
 +
$HK/tabix -pvcf $OUT/wgs.filtered.rsid.anno.exon.vcf.gz
    
==== Exonic Variants NOT found by 1000G ====
 
==== Exonic Variants NOT found by 1000G ====
Line 321: Line 325:     
Want to see this from the BAM file?  Use samtools tview:
 
Want to see this from the BAM file?  Use samtools tview:
  $GC/bin/samtools tview $SAMPLE/output/bams/$SAMPLE.recal.bam $GC/gotcloud.ref/human.g1k.v37.fa
+
  $GC/bin/samtools tview $SAMPLE/output/bams/$SAMPLE.recal.bam $REF/hs37d5.fa
 
Use 'g' & enter the Chr:Pos
 
Use 'g' & enter the Chr:Pos
 
* Some patterns may indicate not real variants.
 
* Some patterns may indicate not real variants.
Line 330: Line 334:  
   
 
   
 
The phred score at the last column quantifies the degree of functional significance
 
The phred score at the last column quantifies the degree of functional significance
         
</div>
 
</div>
 
</div>
 
</div>
 +
 +
== OVERALL COURSE FEEDBACK! ==
 +
Please provide feedback:
 +
https://docs.google.com/forms/d/1pxfPXKwWfA71ZJM99Sevs3MwAUz2UbHAR8dnRI-kRNM/viewform

Navigation menu