Difference between revisions of "RAREMETAL METHOD"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 56: Line 56:
<math> \left(T_{b\left(f_1\right)},T_{b\left(f_2\right)},\dots,T_{b\left(f_m\right)}\right)</math><math>\sim\mathbf{MVN}\left(\mathbf{0},\boldsymbol{\Omega}\right) </math>,  
<math> \left(T_{b\left(f_1\right)},T_{b\left(f_2\right)},\dots,T_{b\left(f_m\right)}\right)</math><math>\sim\mathbf{MVN}\left(\mathbf{0},\boldsymbol{\Omega}\right) </math>,  
where <math>\boldsymbol{\Omega_{ij}}=\frac{\boldsymbol{\phi}_{f_i}^T\left(\sum_{i=1}^n{\mathbf{V_i}}\right)\boldsymbol{\phi}_{f_j}}{\sqrt{\boldsymbol{\phi}_{f_i}^T\left(\sum_{i=1}^n{\mathbf{V_i}}\right)\boldsymbol{\phi}_{f_i}}\sqrt{\boldsymbol{\phi}_{f_j}^T\left(\sum_{i=1}^n{\mathbf{V_i}}\right)\boldsymbol{\phi}_{f_j}}}</math>
where <math>\boldsymbol{\Omega_{ij}}=\frac{\boldsymbol{\phi}_{f_i}^T\mathbf{V_{meta}}\boldsymbol{\phi}_{f_j}}{\sqrt{\boldsymbol{\phi}_{f_i}^T\mathbf{V_{meta}}\boldsymbol{\phi}_{f_i}}\sqrt{\boldsymbol{\phi}_{f_j}^T\mathbf{V_{meta}}\boldsymbol{\phi}_{f_j}}}</math>

Revision as of 23:08, 8 April 2014


The key idea behind meta-analysis with RAREMETAL is that various gene-level test statistics can be reconstructed from single variant score statistics and that, when the linkage disequilibrium relationships between variants are known, the distribution of these gene-level statistics can be derived and used to evaluate signifi-cance. Single variant statistics are calculated using the Cochran-Mantel-Haenszel method. The main formulae are tabulated in the following:



We denote the following to describe our methods:

is the score statistic for the variant from the study

is the covariance of the score statistics between the and the variant from the study

and are described in detail in RAREMETALWORKER method.

is the vector of score statistics of rare variants in a gene from the study.

is the variance-covariance matrix of score statistics of rare variants in a gene from the study, or

is the number of studies

is the vector of weights for rare variants in a gene.


Single variant meta-analysis score statistic can be reconstructed from score statistics and their variances generated by each study, assuming that samples are unrelated across studies. Define meta-analysis score statistics as

and its variance

Then the score test statistics for the variant asymptotically follows standard normal distribution


Burden test has been shown to be powerful detecting a group of rare variants that are unidirectional in effects. Once single variant meta analysis statistics are constructed, burden test score statistic can be easily reconstructed as



Including variants that are not associated to phenotype can hurt power. Variable threshold test is designed to choose the optimal allele frequency threshold amongst rare variants in a gene, to gain power. The test statistic is defined as the maximum burden score statistic calculated using every possible frequency threshold


where the burden test statistic under any allele frequency threshold can be constructed from single variant meta-analysis statistics using


where represents any allele frequency in a group of rare variants, is a vector of 0 and 1, indicating if a variant is included in the analysis using frequency threshold .

As described by Lin et. al, the p-value of this test can be calculated analytically using the fact that the burden test statistics together follow a multivariate normal distribution with mean and covariance , written as




SKAT is most powerful when detecting genes with rare variants having opposite directions in effect sizes. Meta-analysis statistic can also be re-constructed using single variant meta-analysis scores and their covariances


where is a diagonal matrix of weights of rare variants included in a gene.

As shown in Wu et. al, the null distribution of the statistic follows a mixture chi-sqaured distribution described as

where are eigen values of .

Formulae for RAREMETAL
Test Statistics Null Distribution Notation
Single Variant
un-weighted Burden
Weighted Burden