Changes

From Genome Analysis Wiki
Jump to navigationJump to search
no edit summary
Line 1: Line 1:  
''(WARNING: Under Construction)''
 
''(WARNING: Under Construction)''
  −
==Introduction==
  −
See the [[Media:SeqShop - GotCloud svcall.pdf|introductory slides]] for an intro to this tutorial.
  −
      
== Goals of This Session ==
 
== Goals of This Session ==
Line 17: Line 13:     
This is the same setup you did for the previous tutorial, but you need to redo it each time you log in. It will setup some environment variables to point you to:
 
This is the same setup you did for the previous tutorial, but you need to redo it each time you log in. It will setup some environment variables to point you to:
 +
 
* GotCloud program
 
* GotCloud program
 
* Tutorial input files
 
* Tutorial input files
Line 36: Line 33:  
== Examining GotCloud/GenomeSTRiP Input files ==
 
== Examining GotCloud/GenomeSTRiP Input files ==
   −
=== Sequnce Alignment Files: BAM Files ===
+
=== Sequnce Alignment Files: BAM Files and Index Files===
   −
Per sample BAM files contain sequence reads that are mapped to positions in the genome.
+
The GotCloud Indel caller takes the same inputs as GotCloud snpcall.
 +
* BAMs->SVs rather than BAMs->SNPs
   −
For a reminder on how to look at/read BAM files, see: [[SeqShop:_Sequence_Mapping_and_Assembly_Practical#BAM_Files|SeqShop Aligment: BAM Files]]
     −
For this tutorial, we will use the 4 BAMs produced in the [[SeqShop: Sequence Mapping and Assembly Practical]] as well as with 58 BAMs that were pre-aligned to that 1MB region of chromosome 22.
+
 
 +
If you want a reminder, of what they look like, here is a link to the previous tutorial : [[SeqShop:_Variant_Calling_and_Filtering_for_SNPs_Practical#Examining_GotCloud_SnpCall_Input_files|GotCloud SnpCall Input Files]]
    
=== Reference Files ===
 
=== Reference Files ===
Line 49: Line 47:  
* See [[GotCloud: Genetic Reference and Resource Files]] for more information on downloading/generating reference files
 
* See [[GotCloud: Genetic Reference and Resource Files]] for more information on downloading/generating reference files
   −
For GotCloud/GenomeSTRiP, you need:
+
Similar to SNP and Indel calling, you need
 
# Reference genome FASTA file
 
# Reference genome FASTA file
 +
 +
For running GenomeSTRiP, you additionally need:
 
# Masked FASTA file to exclude hard-to-align regions
 
# Masked FASTA file to exclude hard-to-align regions
 
# PloidyMap file indicating the regions of genomes with unusual ploidy (e.g. chrX, chrY)
 
# PloidyMap file indicating the regions of genomes with unusual ploidy (e.g. chrX, chrY)
Line 65: Line 65:  
</div>
 
</div>
 
</ul>
 
</ul>
 +
 +
ls $REF
 +
 +
<ul>
 +
<div class="mw-collapsible mw-collapsed" style="width:200px">
 +
<li>View Results</li>
 +
<div class="mw-collapsible-content">
 +
1000G_omni2.5.b37.sites.PASS.chr22.vcf.gz            hapmap_3.3.b37.sites.chr22.vcf.gz.tbi  human.g1k.v37.chr22.fa.bwt
 +
1000G_omni2.5.b37.sites.PASS.chr22.vcf.gz.tbi        human.g1k.v37.chr22-bs.umfa            human.g1k.v37.chr22.fa.fai
 +
1kg.pilot_release.merged.indels.sites.hg19.chr22.vcf  human.g1k.v37.chr22.dict              human.g1k.v37.chr22.fa.pac
 +
dbsnp_135.b37.chr22.vcf.gz                            human.g1k.v37.chr22.fa                human.g1k.v37.chr22.fa.sa
 +
dbsnp_135.b37.chr22.vcf.gz.tbi                        human.g1k.v37.chr22.fa.amb            human.g1k.v37.chr22.winsize100.gc
 +
hapmap_3.3.b37.sites.chr22.vcf.gz                    human.g1k.v37.chr22.fa.ann
 +
</div>
 +
</div>
 +
</ul>
 +
 +
 +
Additional reference and parameters
 +
 +
ls $SV/ref
 +
 +
human_g1k_v37.chr22.mask.100.fasta  human_g1k_v37.chr22.mask.100.fasta.dict  human_g1k_v37.chr22.mask.100.fasta.fai
 +
 +
ls $SV/conf
 +
genstrip_parameters.txt  humgen_g1k_v37_ploidy.chr22.map  humgen_g1k_v37_ploidy.map
    
=== GotCloud BAM Index File ===
 
=== GotCloud BAM Index File ===

Navigation menu