Difference between revisions of "RareMETALS2"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 23: Line 23:
 
== Exemplar Datasets ==
 
== Exemplar Datasets ==
  
The following exemplar datasets [[Media:ExampleDataaset.zip | ExampleDataset.zip]]can be downloaded and tested with rareMETALS2 package.
+
The following exemplar datasets [[Media:ExampleDataaset.zip | ExampleDataset.zip]] can be downloaded and tested with rareMETALS2 package.
 
* ''Score Statistics File''
 
* ''Score Statistics File''
 
* ''Covariance Matrix File''
 
* ''Covariance Matrix File''

Revision as of 23:25, 16 June 2015

The R package rareMETALS2 was an extension of the R package rareMETALS. It was designed to meta-analyze gene-level association tests for binary trait . While rareMETALS offers a near-complete solution for meta-analysis of gene-level tests for quantitative trait, it does not offer the optimal solution for binary trait. The package rareMETALS2 offers improved features for analyzing gene-level association tests in meta-analyses for binary trait. If you have any questions for using rareMETALS2 or rvtests, please post your questions to our google group https://groups.google.com/forum/#!forum/raremetals

The package rareMETALS2 is under development. It takes summary association statistics generated by rvtests as input. It offers the following unique features

  • 1.) It allows the meta-analysis of samples with related individuals and samples with unrelated individuals, and allows locally efficient estimate of genetic effects.
  • 2.) It allows the adjustment of covariates in meta-analysis.
  • 3.) It allows conditional meta-analysis of single variant and gene-level associations.

Change Log

June, 14, 2015 0.1 Version released

Download

The R package can be downloaded from rareMETALS2_0.1.tar.gz. It will be eventually released on the Comprehensive R-archive Network.

How to install

To install the package, please use "R CMD INSTALL rareMETALS2_XXX.tar.gz" command, where XXX is the version number for rareMETALS2

Forum to Ask Questions

I have created a google group for discussion on the usage and for bug reports etc. If you find any issues to the package and think that the discussions may also benefit others, please post them to the user group. Here is the link to the discussion group https://groups.google.com/forum/#!forum/raremetals

Exemplar Datasets

The following exemplar datasets ExampleDataset.zip can be downloaded and tested with rareMETALS2 package.

  • Score Statistics File
  • Covariance Matrix File
  • Tabix index file

These files are all automatically generated by rvtests.

Meta-analysis of Single Variant Associations

 rareMETALS2.single <- function(score.stat.file,range,alternative=c('two.sided','greater','less'),ix.gold=1,callrate.cutoff=0,hwe.cutoff=0,hwe.ctrl.cutoff=0)

Relevant Parameters:

  • score.stat.file files of score statistics
  • range tabix range of variants to be analyzed
  • alternative alternative hypothesis to be specified
  • ix.gold Gold standard population to align reference allele to.
  • callrate.cutoff Cutoffs of call rate, lower than which will NOT be analyzed (labelled as missing)
  • hwe.cutoff Cutoffs of HWE p-values; Variants with HWE p-value smaller than the cutoffs are removed from subsequent analysis and labelled as missing;
  • hwe.ctrl.cutoff Cutoffs of HWE p-values using controls; Variants with HWE p-value smaller than the cutoffs are removed from subsequent analysis and labelled as missing; In case control studies, it is recommended to use hwe.ctrl.cutoff, since large effect variants may violate HWE.

Meta-analysis of Gene-level Association

 rareMETALS2.range <- function(score.stat.file,cov.file,range,range.name,test='GRANVIL',maf.cutoff=1,alternative=c('two.sided','greater','less'),
 ix.gold=1,out.digits=4,callrate.cutoff=0,hwe.cutoff=0,hwe.ctrl.cutoff=0,max.VT=NULL)
  • score.stat.file files of score statistics
  • cov.file covariance matrix files
  • range tabix range for each gene/region
  • range.name The name of the range,e.g. gene names can be used
  • test rare variant tests to be used
  • maf.cutoff MAF cutoff used to analyze variants
  • alternative alternative hypothesis to be specified
  • ix.gold Gold standard population to align reference allele to
  • out.digits Number of digits used in the output
  • callrate.cutoff Cutoffs of call rate, lower than which will NOT be analyzed (labelled as missing)
  • hwe.cutoff Cutoffs of HWE p-values; Variants with HWE p-value smaller than the cutoffs are removed from subsequent analysis and labelled as missing;
  • hwe.ctrl.cutoff Cutoffs of HWE p-values using controls; Variants with HWE p-value smaller than the cutoffs are removed from subsequent analysis and labelled as missing; In case control studies, it is recommended to use hwe.ctrl.cutoff, since large effect variants may violate HWE.
  • max.VT The maximum number of thresholds used in VT; Setting max.VT to 10 can improve the speed for calculation without affecting the power too much. The default parameter is NULL, which does not set upper limit on the number of variable frequency threhsold.

Conditional Meta-analysis

 conditional.rareMETALS2.single <- function(candidate.variant.vec,score.stat.file,cov.file,known.variant.vec,maf.cutoff,no.boot=10000,alternative=c('two.sided','greater','less'),
 ix.gold=1,out.digits=4,callrate.cutoff=0,hwe.cutoff=0,hwe.ctrl.cutoff=0,p.value.known.variant.vec=NA,anno.known.variant.vec=NA,anno.candidate.variant.vec=NA)
  • candidate.variant Position of candidate variant
  • score.stat.file Files of score statistics
  • cov.file Covariance matrix files
  • known.variant.vec Range of candidate variant, expressed in a vector, e.g. c("1:12345","1:234567");
  • test test of rare variant tests
  • maf.cutoff Cutoffs of MAF used for determining rare variants
  • alternative Alternative hypothesis to be tested
  • out.digits The number of digits used in the output
  • callrate.cutoff Cutoff of call rates. Sites with callrates lower than the cutoff will be labeled as missing
  • hwe.cutoff Cutoff of HWE p-values. Sites with HWE pvalues lower than the cutoff will be labeled as missing