Line 1: |
Line 1: |
− | '''RAREMETALWORKER''' is a tool for generating summary statistics for rare variants and gene level meta analyses using [http://genome.sph.umich.edu/wiki/RAREMETAL '''RAREMETAL''']. | + | [[Category:RAREMETALWORKER]] |
| + | |
| + | '''RAREMETALWORKER''' is a tool for single variant analysis, generating summary statistics for gene level meta analyses in [http://genome.sph.umich.edu/wiki/RAREMETAL '''RAREMETAL''']. |
| | | |
| If you feel this program is useful, please tell us your name and contact in this [https://docs.google.com/spreadsheet/ccc?key=0AuYjznTeEDYudFpqUk9sQ2pkN3d3endjYldqMEp6ZUE&usp=sharing '''registration''']. | | If you feel this program is useful, please tell us your name and contact in this [https://docs.google.com/spreadsheet/ccc?key=0AuYjznTeEDYudFpqUk9sQ2pkN3d3endjYldqMEp6ZUE&usp=sharing '''registration''']. |
| | | |
− | If you have any questions, please contact [[Shuang Feng|'''Shuang Feng''']] sfengsph at umich dot edu or [[Goncalo_Abecasis | '''Goncalo Abecasis''']] goncalo at umich dot edu. | + | If you have any questions, please contact Sai Chen (saichen at umich dot edu) or [[Goncalo_Abecasis | '''Goncalo Abecasis''']] (goncalo at umich dot edu). |
| | | |
| | | |
Line 15: |
Line 17: |
| * The [[RAREMETAL_Documentation | '''RAREMETAL documentation''']] | | * The [[RAREMETAL_Documentation | '''RAREMETAL documentation''']] |
| * The [[RAREMETAL FAQ | '''FAQ''']] | | * The [[RAREMETAL FAQ | '''FAQ''']] |
| + | * The [[RAREMETAL_Change_Log | '''Change Log''']] |
| | | |
| == Key Features == | | == Key Features == |
Line 46: |
Line 49: |
| | | |
| Method description and key formulae can be found in [http://genome.sph.umich.edu/wiki/RAREMETALWORKER_method '''RAREMETALWORKER METHOD''']. | | Method description and key formulae can be found in [http://genome.sph.umich.edu/wiki/RAREMETALWORKER_method '''RAREMETALWORKER METHOD''']. |
| + | |
| + | ==For Binary Traits== |
| + | |
| + | RAREMETALWORKER currently treat all traits as quantitative. If your trait is binary, the odds ratio can be approximated from effect size estimates generated by RAREMETALWORKER. The installation/source package has a script included to augment the odds ratio estimates to the last column of the RAREMETALWORKER output. For details, please refer to [[RAREMETAL_DOWNLOAD_%26_BUILD#Calculating_Odds_Ratio_from_RAREMETALWORKER_output | '''Calculate Odds Ratio from RAREMETALWORKER output''']]. |
| | | |
| == Software Specifications == | | == Software Specifications == |
Line 55: |
Line 62: |
| For detailed description of command options, please go to [[RAREMETALWORKER_command_reference | '''command reference''']]. | | For detailed description of command options, please go to [[RAREMETALWORKER_command_reference | '''command reference''']]. |
| | | |
− | RAREMETALWORKER 0.4.9 -- A Forerunner of RareMetal
| |
− | (c) 2012-2014 Shuang Feng, Dajiang Liu, Goncalo Abecasis
| |
− |
| |
− | Please go to "http://genome.sph.umich.edu/wiki/RAREMETALWORKER#Where_to_Download" for the latest version.
| |
− |
| |
| Options: | | Options: |
| Input Files : --ped [], --dat [], --vcf [], --dosage, --noeof | | Input Files : --ped [], --dat [], --vcf [], --dosage, --noeof |
Line 72: |
Line 74: |
| Chromosome X : --xLabel [X], --xStart [2699520], --xEnd [154931044], | | Chromosome X : --xLabel [X], --xStart [2699520], --xEnd [154931044], |
| --maleLabel [1], --femaleLabel [2] | | --maleLabel [1], --femaleLabel [2] |
− | others : --cpu [1] | + | others : --cpu [1], --kinOnly, |
| + | --geneMap [../data/refFlat_hg19.txt] |
| PhoneHome : --noPhoneHome, --phoneHomeThinning [100] | | PhoneHome : --noPhoneHome, --phoneHomeThinning [100] |
| | | |
Line 82: |
Line 85: |
| * When PED file has genotypes saved, there is no need for a VCF file as input. | | * When PED file has genotypes saved, there is no need for a VCF file as input. |
| * RMW takes PED/DAT file in Merlin format. Please refer to [http://www.sph.umich.edu/csg/abecasis/merlin/tour/input_files.html PED/DAT format description] for details. | | * RMW takes PED/DAT file in Merlin format. Please refer to [http://www.sph.umich.edu/csg/abecasis/merlin/tour/input_files.html PED/DAT format description] for details. |
| + | * PED file requires "dummy" parents to be included in the pedigree file. To check the integrity of your PED/DAT file, please use [http://www.sph.umich.edu/csg/abecasis/PedStats '''pedstats''']. To add dummy parents into the pedigree, please use the [[Media:Script.tgz | '''perl script''']]. |
| * An example PED file is in the following: | | * An example PED file is in the following: |
| 1 1 0 0 1 1.5 1 23 A A A A A A A A A A | | 1 1 0 0 1 1.5 1 23 A A A A A A A A A A |
Line 137: |
Line 141: |
| * If --zip option is used, then the following will be generated automatically: | | * If --zip option is used, then the following will be generated automatically: |
| prefix.traitName.singlevar.score.txt.gz | | prefix.traitName.singlevar.score.txt.gz |
| + | prefix.traitName.singlevar.score.txt.gz.tbi |
| prefix.traitName.singlevar.cov.txt.gz | | prefix.traitName.singlevar.cov.txt.gz |
| + | prefix.traitName.singlevar.cov.txt.gz.tbi |
| prefix.singlevar.log | | prefix.singlevar.log |
| | | |
Line 157: |
Line 163: |
| * In the file with summary statistics named prefix.traitName.singlevar.score.txt contains summary statistics that are needed by Rare-Metal. An example is shown in below: | | * In the file with summary statistics named prefix.traitName.singlevar.score.txt contains summary statistics that are needed by Rare-Metal. An example is shown in below: |
| | | |
− | LDL mean= -0.00, variance= 1.00, heritability= 34.30
| + | LDL mean= -0.00, variance= 1.00, heritability= 34.30 |
− | CHR POS REF_ALLELE ALT_ALLELE INFORMATIVE_N FOUNDER_AF ALL_AF INFORMATIVE_AC HWE_PVALUE STAT ALT_ALLELE_EFFSIZE PVALUE
| + | CHR POS REF_ALLELE ALT_ALLELE INFORMATIVE_N FOUNDER_AF ALL_AF INFORMATIVE_AC HWE_PVALUE STAT ALT_ALLELE_EFFSIZE PVALUE |
− | 10 45410002 G A 6103 0.0341589 0.0341589 410 0.165893 126.205 0.309798 4.03074e-10 | + | 10 45410002 G A 6103 0.034159 0.034159 410 0.165893 126.2050 0.309798 4.030740e-10 |
− | 19 45412079 G A 6103 0.0368124 0.0368124 434 0.714645 -265.84 -0.587356 7.87851e-36 | + | 19 45412079 G A 6103 0.036812 0.036812 434 0.714645 -265.8400 -0.587356 7.878510e-36 |
− | 19 45414451 G A 6103 0.444989 0.444989 5312 0.0759271 -26.1212 -0.00837122 0.640058 | + | 19 45414451 G A 6103 0.444989 0.444989 5312 0.075927 -26.1212 -0.008371 6.400580e-01 |
| + | |
| | | |
| * pvalues from the above output are from the family-based single variant score test. | | * pvalues from the above output are from the family-based single variant score test. |
Line 168: |
Line 175: |
| * prefix.traitName.singlevar.cov.txt contains the LD matrix among a variant and the adjacent markers within a prefixed-sized window. The default window size is 1MB. It has the following format: | | * prefix.traitName.singlevar.cov.txt contains the LD matrix among a variant and the adjacent markers within a prefixed-sized window. The default window size is 1MB. It has the following format: |
| | | |
− | CHR POS VAR_POS_IN_WINDOW LD_MATRIX
| + | CHR POS VAR_POS_IN_WINDOW LD_MATRIX |
− | 1 762320 762320,865628,865665,878744,879381,1560000 0.0359084,-0.000242112,-0.00125797,-0.000993422,-0.000344509,-0.00017077,
| + | 1 762320 762320,865628,865665,878744,879381,1560000 0.0359084,-0.000242112,-0.00125797,-0.000993422,-0.000344509,-0.00017077, |
− | 1 865628 865628,865665,878744,879381,1560000,1864659 0.419804,-0.0103663,-0.00635265,0.0594056,0.0534505,-0.00462183,
| + | 1 865628 865628,865665,878744,879381,1560000,1864659 0.419804,-0.0103663,-0.00635265,0.0594056,0.0534505,-0.00462183, |
− | 1 878744 878744,879381,1560000,1864659,1877659 0.000404537,-0.000235215,-1.4455e-05,-8.69137e-06,-3.1027e-05,
| + | 1 878744 878744,879381,1560000,1864659,1877659 0.000404537,-0.000235215,-1.4455e-05,-8.69137e-06,-3.1027e-05, |
| | | |
| =====Genomic Relationship Matrix (GRM)===== | | =====Genomic Relationship Matrix (GRM)===== |
Line 325: |
Line 332: |
| | | |
| [http://genome.sph.umich.edu/wiki/Tutorial:_RareMETAL '''RAREMETAL and RAREMETALWORKER Tutorial'''] | | [http://genome.sph.umich.edu/wiki/Tutorial:_RareMETAL '''RAREMETAL and RAREMETALWORKER Tutorial'''] |
− |
| |
− | == Change Log ==
| |
− | * Version 0.0.1 was released on 11/13/2012.
| |
− | * Modified Rare-Metal-Worker to let it output LD matrix by a sliding window. (11/14/2012)
| |
− | * Uploaded to public wiki. (11/16/2012)
| |
− | * Enabled writing log file by defalut. (11/18/2012)
| |
− | * Forced sample IDs to be matched when reading in kinship from a file. Perform a sanity check before reading in kinship file. If a sample of interest is not included in kinship file, then fatal error will occur. (11/19/2012)
| |
− | * Added HWE pvalue and call rate in summary statistics output. (11/27/2012)
| |
− | * Bugs fixed to solve compiling errors on some machines (Thank you Mary Kate!). Version 0.0.2 released. (11/30/2012)
| |
− | * Updated output format. Version 0.0.3 released. (12/3/2012)
| |
− | * More messages coded into log file. (12/4/2012)
| |
− | * Version 0.0.4 released. (12/5/2012)
| |
− | * Bug fixed for empirical kinship calculation when genotypes are read from VCF file. Version 0.0.5 released. (12/6/2012)
| |
− | * Version 0.0.6 released. (12/6/2012)
| |
− | * Updated output format for monomorphic sites. (12/7/2012)
| |
− | * Changed executable name into bin/raremetalworker. Version 0.0.7 released. (12/10/2012)
| |
− | * Fixed a bug when reading vcf file with ref or alt allele is missing. (2/5/2013)
| |
− | * Fixed a bug when there is missing genotype from VCF file. (2/2013)
| |
− | * Fixed a bug when handling chromosome X. Added sex labels option. (3/2/2013)
| |
− | * Optimized code to speed up the process of calculating empirical kinship. (3/3/2013)
| |
− | * Updated code to report allele frequencies calculated only from selected samples. (3/3/2013)
| |
− | * Fixed bug in handling chromosome X. Added sanity checking steps before analysis. Added graphic support by generating QQ and manhattan plots automatically. Upgraded tool to version 2.8. (till 8/12/2013)
| |
− | * Added support for analyzing dosages from VCF in version 2.9. (8/27/2013)
| |
− | * Fixed the bug which causes crash when writing PDF when all variants are monomorphic. (10/6/2013)
| |
− | * Fixed a few bugs handling chromosome X. Generated warning messages when male genotypes are coded wrong in VCF file. (11/25/2013)
| |
− | * Released version 0.3.6 and fixed a minor bug that caused by code upgrades from version 0.3.5. (12/4/2013)
| |
− | * Released version 0.3.7. Added dominant and recessive models as options. The default model is additive. (1/7/2014)
| |
− | * Released version 0.4.0. Added phone home function. Saved Recessive and dominant results in separate files.
| |
− | * Released version 0.4.1. Fixed a bug handling variants in nonPAR region on chromosome X when all samples are male.
| |
− | * Released version 0.4.2. Fixed a bug that could possibly cause compiling error in some Linux system. Also, in this version, male heterozygous genotypes on chromosome X are considered missing. (3/10/2014)
| |
− | * Released version 0.4.3. Fixed a few typo in messages. Added --noeof option for VCF files that does not have a BGZF EOF marker.
| |
− | * Released version 0.4.4. Fixed hwe=0.0 issue for monomorphic sites. (3/17/2014)
| |
− | * Released version 0.4.5. Fixed a bug when generating plots for recessive results. (3/18/2014)
| |
− | * Released version 0.4.6 binary. Fixed a bug in recessive and dominant results. (4/1/2014)
| |
− | * Released version 0.4.7 binary and source code. Optimized code to increase analysis efficiency and reduce memory use. Added --separateX option to provide another choice of analyzing chromosome X. Added --kinxFile option to allow kinship X matrix file with different prefix from the autosomal kinship.(4/2/2014)
| |