Changes

From Genome Analysis Wiki
Jump to navigationJump to search
no edit summary
Line 222: Line 222:  
<br>  
 
<br>  
   −
<br>
+
<br>  
    
=== Analysis for QC  ===
 
=== Analysis for QC  ===
Line 249: Line 249:  
|}
 
|}
   −
<br>
+
<br>  
    
=== Secondary analyses (with BMI)  ===
 
=== Secondary analyses (with BMI)  ===
Line 364: Line 364:  
----
 
----
   −
=== 1. Typical DIAGRAM analysis using existing association pipeline<br>  ===
+
=== A. Typical DIAGRAM analysis using existing association pipeline<br>  ===
    
This is the typical DIAGRAM analysis using your current association pipeline and software. &nbsp; [[Image:1000Genomes march2012 imputation analysis plan 08312012.pdf]]  
 
This is the typical DIAGRAM analysis using your current association pipeline and software. &nbsp; [[Image:1000Genomes march2012 imputation analysis plan 08312012.pdf]]  
Line 376: Line 376:  
'''Important:''' To analyze dosages (not genotypes), you must specify the dosage field with the "--field EC" option. Without this option, you will be analyzing the hard genotypes (i.e. --field option defaults to "GT" or "genotypes")!  
 
'''Important:''' To analyze dosages (not genotypes), you must specify the dosage field with the "--field EC" option. Without this option, you will be analyzing the hard genotypes (i.e. --field option defaults to "GT" or "genotypes")!  
   −
=== 2. Analysis of low frequency variants using Firth bias-corrected logistic regression  ===
+
=== B. Analysis of low frequency variants using Firth bias-corrected logistic regression  ===
    
The Firth bias-corrected test has well-controlled type I error rate and good power for analysis of balanced and unbalanced studies. &nbsp;However, it is more computationally intensive. &nbsp;We only run Firth on&nbsp;the subset of variants with 1&lt;= MAC &lt;= 200.  
 
The Firth bias-corrected test has well-controlled type I error rate and good power for analysis of balanced and unbalanced studies. &nbsp;However, it is more computationally intensive. &nbsp;We only run Firth on&nbsp;the subset of variants with 1&lt;= MAC &lt;= 200.  
Line 386: Line 386:  
'''Important:''' &nbsp;To analyze dosages (not genotypes), you must specify the dosage field with the "--field EC" option. &nbsp;Without this option, you will be analyzing the hard genotypes (i.e. --field option defaults to "GT" or "genotypes")!  
 
'''Important:''' &nbsp;To analyze dosages (not genotypes), you must specify the dosage field with the "--field EC" option. &nbsp;Without this option, you will be analyzing the hard genotypes (i.e. --field option defaults to "GT" or "genotypes")!  
   −
=== 3. Analysis of chromosome 20 using logistic regression score test  ===
+
=== C. Analysis of chromosome 20 using logistic regression score test  ===
    
The score test has well-controlled type I error rate and good power for meta-analysis of balanced (equal numbers of cases and controls) studies.&nbsp; It is also very computationally efficient.&nbsp; Please run the score test using the EPACTS software.  
 
The score test has well-controlled type I error rate and good power for meta-analysis of balanced (equal numbers of cases and controls) studies.&nbsp; It is also very computationally efficient.&nbsp; Please run the score test using the EPACTS software.  
Line 396: Line 396:  
This command will run single variant analysis using the score test logistic regression on the DISEASE phenotype adjusting for AGE. Add the relevant additional covariates with additional "-cov" options. This assumes that the VCF files are separated by chromosomes (option -sepchr). All variants with at least one minor allele count will be analyzed (option -min-mac 1). It will annotate results by functional category (option -anno) and run the analysis on 10 parallel CPUs (option -run 10).  
 
This command will run single variant analysis using the score test logistic regression on the DISEASE phenotype adjusting for AGE. Add the relevant additional covariates with additional "-cov" options. This assumes that the VCF files are separated by chromosomes (option -sepchr). All variants with at least one minor allele count will be analyzed (option -min-mac 1). It will annotate results by functional category (option -anno) and run the analysis on 10 parallel CPUs (option -run 10).  
   −
=== 4. Typical DIAGRAM analysis using existing association pipeline (with BMI)<br>  ===
+
=== D. Typical DIAGRAM analysis using existing association pipeline (with BMI)<br>  ===
    
This is the typical DIAGRAM analysis using your current association pipeline and software including BMI adjustment. &nbsp; [[Image:1000Genomes march2012 imputation analysis plan 08312012.pdf]]  
 
This is the typical DIAGRAM analysis using your current association pipeline and software including BMI adjustment. &nbsp; [[Image:1000Genomes march2012 imputation analysis plan 08312012.pdf]]  
Line 404: Line 404:  
<br>  
 
<br>  
   −
=== 5. Analysis of low frequency variants using Firth bias-corrected logistic regression (with BMI)  ===
+
=== E. Analysis of low frequency variants using Firth bias-corrected logistic regression (with BMI)  ===
    
Again use the Firth test on EPACTS for your analysis with BMI
 
Again use the Firth test on EPACTS for your analysis with BMI
216

edits

Navigation menu