Changes

Minimac (view source)

Revision as of 03:32, 12 October 2010

399 bytes added , 03:32, 12 October 2010

→‎Parameters

Line 42: Line 42:

==== Parameters ====

−

~~-p sample.ped~~

−

~~pedigree file in [http://www.sph.umich.edu/csg/abecasis/Merlin/tour/input_files.html Merlin] format~~

−

-d sample.dat

+

{| class="wikitable" border="1" cellpadding="2"

−

~~dat~~ file in [http://www.sph.umich.edu/csg/abecasis/Merlin/tour/input_files.html Merlin] ~~format~~

+

|- bgcolor="lightgray"

−

+

! Parameter

−

-~~-rounds R~~

+

! Description

−

~~how many iterations of the Markov sampler~~ should be ~~run~~.

+

|-

−

+

| <code>-d sample.dat</code>

−

--states ST

+

| Data file in [http://www.sph.umich.edu/csg/abecasis/Merlin/tour/input_files.html Merlin format]. It is important that markers should be listed according to their order along the chromosome.

−

~~use a random subset~~ of ST haplotypes ~~as reference. We recommend values between 200 - 500~~. ~~More states result in more accurate~~ haplotypes, but ~~are computational more expensive~~ and ~~require more~~ memory.

+

|-

−

+

| <code>-p sample.ped</code>

−

--~~interim I~~

+

| Pedigree file in [http://www.sph.umich.edu/csg/abecasis/Merlin/tour/input_files.html Merlin format]. It is important that alleles should be labeled on the forward strand.

−

~~output a set~~ of ~~best~~-~~guess haplotypes every I~~ rounds ~~by building consensus from all previous Markov iterations~~. ~~These intermediate haplotypes can be used for imputation~~.

+

|-

−

+

| <code>--states 200</code>

−

--~~sample SA~~

+

| Number of haplotypes to consider during each update. Increasing this value will typically lead to better haplotypes, but can also dramatically increase computing time and memory requirements. A value of 100 - 400 is typical.

−

~~output a set of haplotypes every SA rounds based on random sampling from the last Markov iteration. These~~ intermediate results ~~can~~ be ~~combined and used as input for the imputation process~~.

+

|-

−

+

| <code>--rounds 50</code>

−

--phase

+

| Iterations of the Markov sampler to use for haplotyping. Typically, using 20 - 100 rounds should give good results. To obtain better results, it is usually better to increase the <code>--states</code> parameter.

−

~~enables~~ [[MaCH]] ~~phasing mode~~.

+

|-

−

+

| <code>--interim 5</code>

−

--compact

+

| Request that intermediate results should be saved to disk periodically.

−

~~reduces~~ the ~~amount~~ of ~~memory needed dramatically, but doubles execution time~~.

+

|-

+

| <code>--phase</code>

+

| Tell [[MaCH]] to estimate phased haplotypes for each individual.

+

|-

+

| <code>--compact</code>

+

| Tell [[MaCH]] to reduce memory use at the cost of approximately doubling runtime. This option is recommended for most GWAS scale datasets and computing platforms.

+

|}

=== Step 2: Imputation ===

Goncalo

Bureaucrats, Administrators

1,555

edits

Changes

Minimac (view source)

Revision as of 03:32, 12 October 2010

Navigation menu

Page actions

Page actions

Personal tools

quick links

teaching

Navigation

Search

Tools