Changes

From Genome Analysis Wiki
Jump to navigationJump to search
136 bytes removed ,  17:06, 11 October 2012
Line 130: Line 130:  
<pre>epacts2.1/epacts single -man
 
<pre>epacts2.1/epacts single -man
 
</pre>  
 
</pre>  
There are three separate association analyses to be completed:  
+
To simplify the reporting process, o
 +
 
 +
There are '''4''' separate association analyses to be completed (score and Firth, with and without adjustment for BMI):  
    
{| width="1500" border="1" align="left" cellpadding="1" cellspacing="1"
 
{| width="1500" border="1" align="left" cellpadding="1" cellspacing="1"
Line 151: Line 153:  
|-
 
|-
 
|  
 
|  
[http://genome.sph.umich.edu/wiki/EPACTS_for_DIAGRAM#A._Typical_DIAGRAM_analysis_using_existing_association_pipeline A. &nbsp;Typical DIAGRAM analysis using existing association pipeline]  
+
[http://genome.sph.umich.edu/wiki/EPACTS_for_DIAGRAM#A._Typical_DIAGRAM_analysis_using_existing_association_pipeline 1. &nbsp;Typical DIAGRAM analysis using existing association pipeline]  
    
|  
 
|  
Line 163: Line 165:     
|  
 
|  
DIAGRAMv4_QQQ_XXX_1000G_KKK_TTT_YYY_ZZZ.txt  
+
DIAGRAMv4_iSNPs_XXX_1000G_KKK_TTT_YYY_ZZZ.txt  
 
  −
|-
  −
|
  −
[http://genome.sph.umich.edu/wiki/EPACTS_for_DIAGRAM#B._Analysis_of_all_SNPs_using_logistic_regression_score_test B. &nbsp;Analysis of all SNPs using logistic regression score test]
  −
 
  −
|
  −
Score test
  −
 
  −
|
  −
All SNPs with MAC &gt;= 1
  −
 
  −
|
  −
EPACTS output file
     −
|
+
DIAGRAMv4_iSNPs_XXX_adjBMI_1000G_KKK_TTT_YYY_ZZZ.txt
DIAGRAMv4_QQQ_XXX_1000G_KKK_SCR_YYY_ZZZ.epacts
      
|-
 
|-
 
|  
 
|  
[http://genome.sph.umich.edu/wiki/EPACTS_for_DIAGRAM#C._Analysis_of_low_frequency_variants_using_Firth_bias-corrected_logistic_regression C. &nbsp;Analysis of low frequency variants using Firth bias-corrected logistic regression]  
+
[http://genome.sph.umich.edu/wiki/EPACTS_for_DIAGRAM#C._Analysis_of_low_frequency_variants_using_Firth_bias-corrected_logistic_regression 2. &nbsp;Analysis of low frequency variants using Firth bias-corrected logistic regression]  
    
|  
 
|  
Line 189: Line 177:     
|  
 
|  
SNPs with 200 &gt;= MAC &gt;= 1  
+
SNPs with
 +
 
 +
200 &gt;= MAC &gt;= 1  
    
|  
 
|  
Line 195: Line 185:     
|  
 
|  
DIAGRAMv4_QQQ_XXX_1000G_KKK_FBC_YYY_ZZZ.epacts  
+
DIAGRAMv4_iSNPs_XXX_1000G_KKK_FBC_YYY_ZZZ.epacts
 +
 
 +
DIAGRAMv4_iSNPs_XXX_adjBMI_1000G_KKK_FBC_YYY_ZZZ.epacts
    
|}
 
|}
Line 226: Line 218:  
'''YYY '''indicates the DATE of file generation (MMDDYY format, e.g. 021710 – apologies in advance to our&nbsp;European colleagues)<br>'''ZZZ '''indicates the name + other initials of the uploader (e.g., BFV, LJS, ABC, etc.)  
 
'''YYY '''indicates the DATE of file generation (MMDDYY format, e.g. 021710 – apologies in advance to our&nbsp;European colleagues)<br>'''ZZZ '''indicates the name + other initials of the uploader (e.g., BFV, LJS, ABC, etc.)  
   −
=== A. Typical DIAGRAM analysis using existing association pipeline<br>  ===
+
=== 1. Typical DIAGRAM analysis using existing association pipeline<br>  ===
   −
This is the typical DIAGRAM analysis using your current association pipeline and software. &nbsp; [[File:1000Genomes_march2012_imputation_analysis_plan_08312012.pdf]]
+
This is the typical DIAGRAM analysis using your current association pipeline and software. &nbsp; [[Image:1000Genomes march2012 imputation analysis plan 08312012.pdf]]  
   −
=== B. Analysis of all SNPs using logistic regression score test ===
+
=== 2. Analysis of low frequency variants using Firth bias-corrected logistic regression  ===
   −
The score test has well-controlled type I error rate and good power for meta-analysis of balanced (equal numbers of cases and controls) studies. &nbsp;It is also very computationally efficient. &nbsp;Please run the score test using the EPACTS software.  
+
The Firth bias-corrected test has well-controlled type I error rate and good power for analysis of balanced and unbalanced studies. &nbsp;However, it is more computationally intensive. &nbsp;We only run Firth on&nbsp;the subset of variants with 1&lt;= MAC &lt;= 200.  
   −
The EPACTS command for the score test analysis of all variants is:  
+
To run the Firth test using the EPACTS software:
<pre>epacts2.1/epacts single -vcf [INPUT VCF FILENAME] -ped [INPUT PED FILENAME] -out [OUTPUT FILENAME PREFIX] \
+
 
 +
epacts2.1/epacts single -vcf [INPUT VCF FILENAME] -ped [INPUT PED FILENAME] -out [OUTPUT FILENAME PREFIX] \<br>-test b.firth -pheno DISEASE -cov AGE -sepchr -anno -min-mac 1 -max-mac 200 -run 10<br>B. Analysis of all SNPs using logistic regression score test<br>The score test has well-controlled type I error rate and good power for meta-analysis of balanced (equal numbers of cases and controls) studies.&nbsp; It is also very computationally efficient.&nbsp; Please run the score test using the EPACTS software.
 +
<pre>
 +
The EPACTS command for the score test analysis of all variants is:
 +
epacts2.1/epacts single -vcf [INPUT VCF FILENAME] -ped [INPUT PED FILENAME] -out [OUTPUT FILENAME PREFIX] \
 
-test b.score -pheno DISEASE -cov AGE -sepchr -anno -min-mac 1 -run 10
 
-test b.score -pheno DISEASE -cov AGE -sepchr -anno -min-mac 1 -run 10
</pre>
+
This command will run single variant analysis using the score test logistic regression on the DISEASE phenotype adjusting for AGE. Add the relevant additional covariates with additional "-cov" options. This assumes that the VCF files are separated by chromosomes (option -sepchr). All variants with at least one minor allele count will be analyzed (option -min-mac 1). It will annotate results by functional category (option -anno) and run the analysis on 10 parallel CPUs (option -run 10).
This command will run single variant analysis using the score test logistic regression on the DISEASE phenotype adjusting for AGE. &nbsp;Add the relevant additional covariates with additional "-cov" options. &nbsp;This assumes that the VCF files are separated by chromosomes (option -sepchr). &nbsp;All variants with at least one minor allele count will be analyzed (option -min-mac 1). &nbsp;It will annotate results by functional category (option -anno) and run the analysis on 10 parallel CPUs (option -run 10).  
+
</pre>
   −
=== C. Analysis of low frequency variants using Firth bias-corrected logistic regression  ===
  −
  −
The Firth bias-corrected test has well-controlled type I error rate and good power for analysis of balanced and unbalanced studies. &nbsp;However, it is more computationally intensive. &nbsp;We only run Firth on&nbsp;the subset of variants with 1&lt;= MAC &lt;= 200.
  −
  −
To run the Firth test using the EPACTS software:
  −
<pre>epacts2.1/epacts single -vcf [INPUT VCF FILENAME] -ped [INPUT PED FILENAME] -out [OUTPUT FILENAME PREFIX] \
  −
-test b.firth -pheno DISEASE -cov AGE -sepchr -anno -min-mac 1 -max-mac 200 -run 10</pre>
   
== 5. &nbsp;Report EPACTS results<br>  ==
 
== 5. &nbsp;Report EPACTS results<br>  ==
  
216

edits

Navigation menu