SeqShop: Calling Your Own Genome, December 2014

From Genome Analysis Wiki
Revision as of 23:22, 8 December 2014 by Mktrost (talk | contribs)
Jump to navigationJump to search

Login instructions for seqshop-server

Login to the seqshop-server Linux Machine

This section will appear redundantly in each session. If you are already logged in or know how to log in to the server, please skip this section

  1. Login to the windows machine
  • The username/password for the Windows machine should be written on the right-hand monitor
  • Start xming so you can open external windows on our Linux machine
    • Start->Enter "Xming" in the search and select "Xming" from the program list
    • Nothing will happen, but Xming was started.
    • View Screenshot
    • Xming.png

  • Open putty
    • Start->Enter "putty" in the search and select "PuTTY" from the program list
    • View Screenshot
    • PuttyS.png

  • Configure PuTTY in the PuTTY Configuration window
    • Host Name: seqshop-server.sph.umich.edu
    • View Screenshot
    • Seqshop.png

    • Setup to allow you to open external windows:
      • In the left pannel: Connection->SSH->X11
        • Add a check mark in the box next to Enable X11 forwarding
        • View Screenshot
        • SeqshopX11.png

    • Click Open
    • If it prompts about a key, click OK
  • Enter your provided username & password as provided

  • You should now be logged into a terminal on the seqshop-server and be able to access the test files.

    • If you need another terminal, repeat from step 3.

    Login to the seqshop Machine

    So you can each run multiple jobs at once, we will have you run on 4 different machines within our seqshop setup.

    • You can only access these machines after logging onto seqshop-server

    3 users logon to:

    ssh -X seqshop1
    

    3 users logon to:

    ssh -X seqshop2
    

    2 users logon to:

    ssh -X seqshop3
    

    2 users logon to:

    ssh -X seqshop4
    

    Setup

    The snpcall pipeline will run overnight, but you'll want to log out.

    How do I leave something running on the server even if I log out?
    One solution is screen!
    How do I use screen?
    Before running your command, you need to start screen:
    screen

     

    As it says, press Space or Return.

    • It should now look basically the same as your normal command line.
    Scrolling problems when using screen?
    If you want to scroll and screen doesn't scroll like you normally would?
    • Type Ctrl-a Esc and you should be able to scroll up with your mouse wheel
      • Or at least that is what I do from my Linux machine - (sorry I'm typing this up/testing these commands from Linux and not windows, so can't test it out)


    Set these values. Also, be sure to specify your sample name instead of SampleXX

    export SAMPLE=SampleXX
    source /net/seqshop-server/home/mktrost/seqshop/setupSS.txt
    

    See the settings you just used:

    cat /net/seqshop-server/home/mktrost/seqshop/setupSS.txt
    

    Shows you:

    export GC=/net/seqshop-server/home/mktrost/seqshop/gotcloud
    export OUT=~/$SAMPLE/output

    SnpCall Step 1 (Day 1): Start SnpCall

    List of BAMs

    The list of BAMs has already been created (just 1 BAM, your sample).

    • But it is simply SAMPLE\tBAM_name, so easy to figure out
    cat ~/$SAMPLE/output/bam.list
    
    SampleXX SampleXX/output/bams/SampleXX.recal.bam
    • Relative path, so assumes running from your home directory (I prefer absolute paths, but for simplicity of the workshop, we just use relative path).

    Configuring SnpCall

    cat ~/$SAMPLE/gotcloud.conf
    

    You will see something like this:

    # Cluster Settings
    BATCH_TYPE = 
    BATCH_OPTS = 
    
    OUT_DIR = Sample13/output
    
    # Align Settings
    MAP_TYPE = BWA_MEM
    BWA_THREADS = -t 24
    FASTQ_LIST = fastq.list
    
    # SNP Call Settings
    UNIT_CHUNK = 20000000      # Chunk size of SNP calling : 20Mb
    VCF_EXTRACT = /net/seqshop-server/home/mktrost/seqshop/singleSample/snpOnly.vcf.gz
    MODEL_GLFSINGLE = TRUE
    MODEL_SKIP_DISCOVER = FALSE
    MODEL_AF_PRIOR = TRUE
    
    EXT_DIR = /net/seqshop-server/home/mktrost/seqshop/singleSample/ext
    EXT = $(EXT_DIR)/ALL.chrCHR.phase3.combined.sites.unfiltered.vcf.gz $(EXT_DIR)/chrCHR.filtered.sites.vcf.gz
    

    Configuration Updates

    In order to complete SnpCall overnight, we are going to tell GotCloud to only call SNPs for the EXOME regions.

    Edit

    nedit $SAMPLE/gotcloud.conf&
    
    • Or you can use vi or emacs or your favorite editor

    Specify the target region:

    UNIFORM_TARGET_BED = /net/seqshop-server/home/mktrost/seqshop/singleSample/20130108.exome.targets.bed
    
    • See Sequencing Settings for more information on the GotCloud configuration settings for running Targeted/Exome runs.

    Running SnpCall

    Run GotCloud snpcall with 6 jobs running in parallel

    • Why 6?
      • You want to run as many as you can.
      • 5 of you on the machine - 5*6 = 30 jobs will be running in parallel on that machine
    ${GC}/gotcloud snpcall --conf $SAMPLE/gotcloud.conf --numjobs 6
    
    • Only need the configuration & number of threads, rest is specified within the configuration.

    This will run overnight. We will check if it completed at the practical in the morning.

    Log Out

    Want to log out and leave your job running?

    In the screen window, type:

    Ctrl-a d
    

    (Hold down Ctrl and type 'a', let go of both and type 'd')

    • This will "detach" from your screen session while your alignment continues to run.

    If you have not detached from screen:

    Ctrl-a d
    

    exit PuTTY

    Tuesday FEEDBACK!

    Please provide feedback on the lectures/tutorials from today:

    https://docs.google.com/forms/d/1n8xYxvsOq-HsabpDfGcHvwD84BYIRDx8_b-H5N3d-D8/viewform

    SnpCall Step 2 (Day 2): Checking SnpCall

    Logging Back in to Check Jobs

    How do you log back into screen tomorrow?
    screen -r
    

    This will resume an already running screen.

    Indel Step 1 (Day 1): Start Indel Calling

    List of BAMs

    The list of BAMs has already been created (just 1 BAM, your sample).

    • But it is simply SAMPLE\tBAM_name, so easy to figure out
    cat ~/$SAMPLE/output/bam.list
    
    SampleXX SampleXX/output/bams/SampleXX.recal.bam
    • Relative path, so assumes running from your home directory (I prefer absolute paths, but for simplicity of the workshop, we just use relative path).

    Configuring Indel

    No special Configuration settings for Indel calling

    cat ~/$SAMPLE/gotcloud.conf
    

    You will see something like this:

    # Cluster Settings
    BATCH_TYPE = 
    BATCH_OPTS = 
    
    OUT_DIR = Sample13/output
    
    # Align Settings
    MAP_TYPE = BWA_MEM
    BWA_THREADS = -t 24
    FASTQ_LIST = fastq.list
    
    # SNP Call Settings
    UNIT_CHUNK = 20000000      # Chunk size of SNP calling : 20Mb
    VCF_EXTRACT = /net/seqshop-server/home/mktrost/seqshop/singleSample/snpOnly.vcf.gz
    MODEL_GLFSINGLE = TRUE
    MODEL_SKIP_DISCOVER = FALSE
    MODEL_AF_PRIOR = TRUE
    
    EXT_DIR = /net/seqshop-server/home/mktrost/seqshop/singleSample/ext
    EXT = $(EXT_DIR)/ALL.chrCHR.phase3.combined.sites.unfiltered.vcf.gz $(EXT_DIR)/chrCHR.filtered.sites.vcf.gz
    

    Running Indel Calling

    Run GotCloud indel with 6 jobs running in parallel

    • Why 6?
      • You want to run as many as you can.
      • 5 of you on the machine - 5*6 = 30 jobs will be running in parallel on that machine
    ${GC}/gotcloud indel --conf $SAMPLE/gotcloud.conf --numjobs 6
    
    • Only need the configuration & number of threads, rest is specified within the configuration.

    This will run overnight. We will check if it completed at the practical in the morning.

    Log Out

    Want to log out and leave your job running?

    In the screen window, type:

    Ctrl-a d
    

    (Hold down Ctrl and type 'a', let go of both and type 'd')

    • This will "detach" from your screen session while your alignment continues to run.

    If you have not detached from screen:

    Ctrl-a d
    

    exit PuTTY

    Tuesday FEEDBACK!

    Please provide feedback on the lectures/tutorials from today:

    https://docs.google.com/forms/d/1n8xYxvsOq-HsabpDfGcHvwD84BYIRDx8_b-H5N3d-D8/viewform

    Step 2 (Day 2): Checking Indel

    Logging Back in to Check Jobs

    How do you log back into screen tomorrow?
    screen -r
    

    This will resume an already running screen.