From Genome Analysis Wiki
Jump to navigationJump to search
712 bytes added
, 10:28, 2 February 2017
Line 1: |
Line 1: |
| = Download EPACTS = | | = Download EPACTS = |
| | | |
− | EPACTS is available for download [http://www.sph.umich.edu/csg/kang/epacts/download/epacts_v2.12.noref_binary.2012_10_01.tar.gz here (100Mb) ]. | + | EPACTS is available for download [http://csg.sph.umich.edu//kang/epacts/download/epacts_v2.12.noref_binary.2012_10_01.tar.gz here (100Mb) ]. |
| | | |
| Requirements | | Requirements |
Line 11: |
Line 11: |
| | | |
| Uncompress EPACTS package to the directory you would like to install | | Uncompress EPACTS package to the directory you would like to install |
− | <pre> tar xzvf epacts_v2_12.noref_binary.2012_10_01.tar.gz</pre> | + | <pre> tar xzvf epacts_v2_12.noref_binary.2012_10_01.tar.gz</pre> |
− | <br>Download the reference FASTA files by running the following commands
| + | Download the reference FASTA files from 1000 Genomes FTP automatically by running the following commands<br> |
| <pre>cd epacts2.1/ | | <pre>cd epacts2.1/ |
− | ./ref_download.sh (Or copy the FASTA and index file locally you have to ${EPACTS_DIR}/ext/ref/) | + | ./ref_download.sh |
− | | + | (For advanced users, to save time for downloading the FASTA files (~900MB), you may copy a local copy of GRCh37 FASTA file and the index file to ${EPACTS_DIR}/ext/ref/) |
| </pre> | | </pre> |
| | | |
Line 89: |
Line 89: |
| | | |
| </pre> | | </pre> |
− | Note: For variants that do not meet the minimum MAF threshold (MAF >= 0.001), the number of cases and controls (N.CASE, N.CTRL) are not outputted (listed as "NA").
| + | The columns in the results file are: |
| + | |
| + | #CHROM: chromosome |
| + | #BEGIN: starting position |
| + | #END: ending position (same as BEGIN if a SNP) |
| + | #MARKER_ID: name of varian |
| + | #NS: Number of samples (cases + controls) |
| + | #AC: Total allele count in sample |
| + | #CALLRATE: call rate |
| + | #MAF: minor allele frequency in full sample |
| + | #PVALUE: score test association p-value |
| + | #SCORE: test statistic for score test |
| + | #N.CASE: number of cases |
| + | #N.CTRL: number of controls |
| + | #AF.CASE: allele frequency in cases only |
| + | #AF.CTRL: allele frequency in controls only |
| | | |
| + | Note: For variants below the minimum MAF threshold (min-maf = 0.001), the number of cases and controls (N.CASE, N.CTRL) are not outputted (listed as "NA"). |
| | | |
| + | <br> |
| | | |
| 2. '''test.epacts.top5000''' contains the top 5000 associated variants ordered by p-value. | | 2. '''test.epacts.top5000''' contains the top 5000 associated variants ordered by p-value. |