Changes

From Genome Analysis Wiki
Jump to navigationJump to search
536 bytes removed ,  00:34, 14 April 2010
Line 57: Line 57:  
|}
 
|}
   −
== Uploading your own file and use the batch mode ==
+
== Using Batch Mode ==
   −
#Uploading a file<br>Association results can be uploaded to our web server using the LocusZoom webpage, which will accept a typical meta-analysis file for ~2.5 million SNPs provided the user selects only the required columns (SNP Name, p-value, and optionally, N) and gzips the file before uploading. In our tests, this results in a file ~17 Mb which is below the 20 Mb file size limit. This allows users to draw multiple plots from the LocusZoom website while only uploading the meta-analysis results file one time. Alternatively, for faster viewing of a single region, users can upload a file that contains only the rows corresponding to SNPs in the region of interest or a particular chromosome.
+
To start batch mode, first upload your results file just as you would in interactive mode. The same file size restrictions apply.
#Uploading the specification file<br>Users can upload a specification file which allows for the easy generation of dozens of plots, where each plot can be customized for even more features than available on the web interface for LocusZoom. The file is required to have 7 white space-delimited columns, where the last column can be blank. The header is not important, but LocusZoom expects a header to exist. To define a region to plot, users may specify either i) a SNP name in the first column and the appropriate flanking region (e.g. 200kb, 500kb, 1Mb) in the fifth column, or ii) a gene name in the first column and the appropriate flanking region in the fifth column, or iii) a chromosome number, start and stop positions in the 2nd, 3rd and 4th columns respectively. If option ii) is selected, LocusZoom will select the most significant SNP in the region as the index SNP. If option iii) is chosen, the index SNP for the plot must be specified in the first column. For distances &lt;= 500kb where the lead SNP is a HapMap SNP and LD from CEU is requested, the plots will be generated very quickly because we have pre-computed LD for all HapMap SNPs in the CEU. The sixth column is used to select which regions should be plotted. For example, if you run a specification file with 15 plots and 14 of them turn out very nicely, you may wish to re-run the specification file with some modifications to a single row of the specification file to tweak the last remaining plot. In this case, you could change the 14 plots you don’t need to re-run to “no” under the 6th column (“run” column) and leave the 15th plot you’d like to re-run as “yes” in this column. The 7th and final column contains optional LocusZoom arguments (see the 2nd column of Table 1.3 above). As many options as the user wishes to change can be specified in 7th column and LocusZoom options should be separated by spaces.
+
 
 +
=== Generating a Hit Spec File ===
 +
 
 +
Users can upload a specification file which allows for the easy generation of dozens of plots, where each plot can be customized for even more features than available on the web interface for LocusZoom. The file is required to have 7 white space-delimited columns, where the last column can be blank. The header is not important, but LocusZoom expects a header to exist. To define a region to plot, users may specify either i) a SNP name in the first column and the appropriate flanking region (e.g. 200kb, 500kb, 1Mb) in the fifth column, or ii) a gene name in the first column and the appropriate flanking region in the fifth column, or iii) a chromosome number, start and stop positions in the 2nd, 3rd and 4th columns respectively. If option ii) is selected, LocusZoom will select the most significant SNP in the region as the index SNP. If option iii) is chosen, the index SNP for the plot must be specified in the first column. For distances &lt;= 500kb where the lead SNP is a HapMap SNP and LD from CEU is requested, the plots will be generated very quickly because we have pre-computed LD for all HapMap SNPs in the CEU. The sixth column is used to select which regions should be plotted. For example, if you run a specification file with 15 plots and 14 of them turn out very nicely, you may wish to re-run the specification file with some modifications to a single row of the specification file to tweak the last remaining plot. In this case, you could change the 14 plots you don’t need to re-run to “no” under the 6th column (“run” column) and leave the 15th plot you’d like to re-run as “yes” in this column. The 7th and final column contains optional LocusZoom arguments (see the 2nd column of Table 1.3 above). As many options as the user wishes to change can be specified in 7th column and LocusZoom options should be separated by spaces.
    
Example of a specification file (must include a header);  
 
Example of a specification file (must include a header);  
<pre>specfile.txt
+
 
snp       chr start end flank run m2zargs
+
  <-- Contents of specfile.txt -->
rs1 NA NA NA 500kb yes rfrows=3 weightCol=”N” snpset=”HapMap” metalRug=”Our SNPs”  
+
  snp   chr     start end flank run m2zargs
rs2 1 540000 580000 NA yes rfrows=4 legend=”right” showAnnot=T  
+
  rs1 NA NA   NA   500kb yes     rfrows=3 weightCol=”N” snpset=”HapMap” metalRug=”Our SNPs”  
CETP NA NA NA 200kb yes rfrows=6 showAnnot=T annotPch=”1,24,24,25,22,21,8,7”</pre>  
+
  rs2 1 540000   580000   NA         yes     rfrows=4 legend=”right” showAnnot=T  
 +
  CETP NA NA   NA   200kb yes     rfrows=6 showAnnot=T annotPch=”1,24,24,25,22,21,8,7”</pre>
    
== Generate single plots using our publicly-available lipids GWAS data  ==
 
== Generate single plots using our publicly-available lipids GWAS data  ==

Navigation menu