Difference between revisions of "RAREMETALWORKER command reference"
From Genome Analysis Wiki
|Line 1:||Line 1:|
Latest revision as of 17:47, 16 March 2018
- 1 Useful Links
- 2 Input Files
- 3 Output Files
- 4 VC Options
- 5 Trait Options
- 6 Model Options
- 7 Kinship Source
- 8 Kinship Options
- 9 Chromosome X
- 10 Others
- 11 PhoneHome
Here are some useful links to key pages:
- The RAREMETALWORKER documentation
- The RAREMETALWORKER method
- The RAREMETALWORKER special topics
- The RAREMETALWORKER quick start tutorial
- The FAQ
Input Files : --ped , --dat , --vcf , --dosage, --flagDosage [DS], --noeof Output Files : --prefix , --LDwindow , --zip, --thin, --labelHits VC Options : --vcX, --separateX Trait Options : --makeResiduals, --inverseNormal, --traitName  Model Options : --recessive, --dominant Kinship Source : --kinPedigree, --kinGeno, --kinFile , --kinxFile , --kinSave Kinship Options : --kinMaf [0.05], --kinMiss [0.05] Chromosome X : --xLabel [X], --xStart , --xEnd , --maleLabel , --femaleLabel  others : --cpu , --kinOnly, --geneMap [../data/refFlat_hg19.txt], --mergedVCFID PhoneHome : --noPhoneHome, --phoneHomeThinning 
- --ped takes a string of your MERLIN format PED file name.
- --ped takes a string of your MERLIN format DAT file name.
- --vcf takes a string of your VCF file name.
- When --dosage is issued in command line, RAREMETALWORKER reads dosage from your VCF file.
- --dosage must be used with --vcf option.
- Description of dosage format in a VCF file can be found in dosage.
- This option let user customize the name of field in VCF file that labels dosage data.
- The default is "DS".
- If you VCF file does not have the BGZF EOF markers, you should use --noeof option to let RAREMETALWORKER skip checking the BGZF EOF markers at the end of the file.
- Please see BGZF EOF for more details.
- --prefix takes a value of a string as the prefix of your output files.
- For a full list of output files generated by RAREMETALWORKER, please refer to output.
- --LDwindow takes a integer value as the size of the moving window.
- RAREMETALWORKER generates LD matrices between a current marker that it is working on and all markers within this window.
- The default size is 1 million bases.
- For more information about the LD matrix, please refer to LD matrix.
- By issuing --zip, RAREMETALWORKER compress the summary statistics and LD matrices generated automatically, using gzip. And the output zip files will be indexed using tabix.
- If --thin is issued, then RAREMETALWORKER generates QQ plots and Manhattan plots with less resolution (points), to make the pdf files smaller in size.
- If --thin is issued, then RAREMETALWORKER automatically label the loci that are above a threshold.
- The threshold is calculated using Bonferroni correction (0.05/N, where N is the total number of polymorphic markers).
- --vcX option has to be used with --kinPedigree (when pedigree kinship is used), or --kinGeno (when genomic relationship matrix is estimated), or --kinFile ( when GRM is read from a file).
- Using --vcX option let RAREMETALWORKER fit a linear mixed model to analyze chromosome X, using both autosomal kinship and chromosome X kinship.
- --separateX option must be used with --vcX option.
- Using --separateX option requests RAREMETALWORKER to fit a linear mixed model using only chromosome X kinship for analyses of chromosome X markers.
- If --makeResiduals is used, then covariates are adjusted before fitting linear models using residuals.
- If --inverseNormal is used, but not with --makeResiduals, then trait values are inverse normalized before fitting linear models.
- If --inverseNormal and --makeResiduals are used together, then covariates are adjusted and inverse normalized residuals are used to fit linear models.
- --traitName takes a string of the trait name that you want to analyze.
- If this option is not used, then all traits included in PED/DAT files are analyzed.
- If --recessive is used, then RAREMETALWORKER generates recessive results in addition to the additive results.
- The set of association results generated by default can be found in recessive output.
- A separate pdf file with QQ and Manhattan plots based on recessive results is generated with name yourprefix.traitname.recessive.plots.pdf.
- If --dominant is used, then RAREMETALWORKER generates recessive results in addition to the additive results.
- The set of association results generated by default can be found in dominant output.
- A separate pdf file with QQ and Manhattan plots based on recessive results is generated with name yourprefix.traitname.dominant.plots.pdf.
- If --kinPedigree is used, pedigree structure coded in PED file is used to generate a kinship matrix for later fitting linear mixed model before associations.
- If --kinPedigree is used, then a genomic relationship matrix is estimated from genotype.
- If --vcX option is used, then a separate genomic relationship matrix for chromosome X is also estimated.
- For details about how to estimate GRM, please refer to methods'.
- --kinFile takes a string of the file name of previously saved GRM with format described in format.
- This option reads GRM from the file and then extract the correct GRM based on samples to be analyzed according to your specifications, such as traits to be analyzed, missing covariates and genotypes (please refer to missing data for more details).
- --kinFile can not be used together with --kinGeno.
- --kinxFile must be used with --kinFile and --vcX.
- --kinxFile takes a string of file name of the previously saved GRM for chromosome X.
- If --kinxFile is not used, but --kinFile your.autosomal.Empirical.Kinship.gz --vcX are issued in a command line, then RAREMETALWORKER will look for a kinship X file named your.autosomal.Empirical.KinshipX.gz. If this file is still not found, a FATAL ERROR will occur.
- This option must be used with --kinGeno.
- Issuing --kinSave will request RAREMETALWORKER to store the estimated GMR in a file named yourprefix.Empirical.Kinship.gz.
- If --vcX is also issued in the command line, then a separate file named yourprefix.Empirical.KinshipX.gz will be generated where the GRM of chromosome X is saved.
- For formats of the saved genomic relationship matrix, please refer to format.
- --kinMaf takes a value that specifies the MAF cutoff for variants to be used to estimate GRMs.
- The default is 0.05, which means variants with MAF<0.05 are not used for estimating GRMs.
- --kinMiss takes a value that specifies the missing genotype cutoff for variants to be used to estimate GRMs.
- The default is 0.05, which means variants with genotype call rate <0.95 are not used for estimating GRMs.
- --xLabel takes a string that used as label for chromosome X in your file.
- The default is "X".
- --xStart takes an integer that described the start position of nonPAR region on chromosome X.
- The default is 2699520 based on Human Genome build 19.
- --xStart takes an integer that described the end position of nonPAR region on chromosome X.
- The default is 154931044 based on Human Genome build 19.
- --cpu takes an integer that specifies the number of cpus to use for estimating kinship matrix from genotypes.
- --kinOnly allows users to estimate kinship matrix without any association analysis of any traits included in the data set.
- To also estimate chromosome X kinship, --vcX option should be added in command line.
- --geneMap takes a string describing the path to find mapping file for manhattan plot annotation.
- The default is human genome build 19, saved in raremetal/data/refFlat_hg19.txt.
- This options allows RAREMETALWORKER to recognize VCF samples IDs in "FAMID_PID" format.
- The default value is OFF, which means VCF sample IDs are consistent with PID field in PED file.
- See PhoneHome for more information on how PhoneHome works and what it does.
- --noPhoneHome disables PhoneHome.
- PhoneHome is enabled by default based on the thinning parameter.
- --phoneHomeThinning (0-100) adjusts the frequency of PhoneHome.
- The default is 100, running 100% of the time.