Line 25: |
Line 25: |
| | This option does some extra verification while reading M3VCF files. If your program crashed for some reason, try re-running it with this option ON and see if it gives an error message. | | | This option does some extra verification while reading M3VCF files. If your program crashed for some reason, try re-running it with this option ON and see if it gives an error message. |
| |} | | |} |
| + | |
| + | |
| | | |
| ===OUTPUT FILE OPTIONS=== | | ===OUTPUT FILE OPTIONS=== |
Line 39: |
Line 41: |
| | This option is used for converting input <font face=Courier>VCF</font> files to output <font face=Courier>M3VCF</font> files. It is also used when you import a <font face=Courier>M3VCF</font> file on a subset of variants or subset of samples and would wish to re-compress the sub-set file for efficient analysis in the future. | | | This option is used for converting input <font face=Courier>VCF</font> files to output <font face=Courier>M3VCF</font> files. It is also used when you import a <font face=Courier>M3VCF</font> file on a subset of variants or subset of samples and would wish to re-compress the sub-set file for efficient analysis in the future. |
| |} | | |} |
| + | |
| + | |
| | | |
| ==SITE FILTERING OPTIONS== | | ==SITE FILTERING OPTIONS== |
Line 53: |
Line 57: |
| | This option includes all sites with identifiers matching the value of <font face=Courier><chromosome></font face=Courier>. (m3vcftools only handles single chromosome files) | | | This option includes all sites with identifiers matching the value of <font face=Courier><chromosome></font face=Courier>. (m3vcftools only handles single chromosome files) |
| |- | | |- |
− | | <font face=Courier>--'''from-bp''' <integer></font> | + | | <font face=Courier>--'''from-bp''' <integer></font> <br> <font face=Courier>--'''to-bp''' <integer></font> |
− | | This option specifies a '''lower bound (inclusive)''' for the range of sites to be processed. This option must be used in conjunction with <font face=Courier>--'''chr''' </font>. | + | | This option specifies a '''lower / upper bound (inclusive)''' for the range of sites to be processed. This option must be used in conjunction with <font face=Courier>--'''chr''' </font>. |
| + | |- |
| + | | <font face=Courier>--'''positions''' <filename></font> <br> <font face=Courier>--'''exclude-positions''' <filename></font> |
| + | | This option is used to '''include / exclude''' a set of sites on the basis of list of positions in the given file. Each line of the input file should contain a (tab-separated) chromosome and position. The file can have comment lines that start with a "#", they will be ignored. Thus, <font face=Courier>M3VCF</font face=Courier> or <font face=Courier>VCF</font face=Courier> files can be used as <font face=Courier><filename></font face=Courier>. |
| + | |
| + | |} |
| + | |
| + | |
| + | ==OUTPUT OPTIONS== |
| + | |
| + | |
| + | ===OUTPUT ALLELE STATISTICS=== |
| + | |
| + | {| border="1" cellpadding="2" |
| + | |- bgcolor="lightgray" |
| + | ! scope="col" width="350px" | Parameter |
| + | ! scope="col" width="1225px" | Description |
| + | |- |
| + | | <font face=Courier>--'''freq '''</font> |
| + | | This option outputs the allele frequency for each site in a file with suffix <font face=Courier>".frq"</font face=Courier>. |
| + | |- |
| + | | <font face=Courier>--'''counts '''</font> |
| + | | This option outputs the raw allele count for each site in a file with suffix <font face=Courier>".frq.count"</font face=Courier>. |
| + | |} |
| + | |
| + | |
| + | |
| + | ===OUTPUT LINKAGE (LD) STATISTICS=== |
| + | |
| + | {| border="1" cellpadding="2" |
| + | |- bgcolor="lightgray" |
| + | ! scope="col" width="350px" | Parameter |
| + | ! scope="col" width="1225px" | Description |
| + | |- |
| + | | <font face=Courier>--'''hap-r2'''</font> |
| + | | This option outputs summary of linkage disequilibrium (reported as r2, D and D' statistics using phased haplotypes) in a file with suffix <font face=Courier>".hap.ld"</font face=Courier>. These statistics are only calculated for phased, bi-allelic sites. |
| + | |- |
| + | | <font face=Courier>--'''ld-window''' <integer> </br> --'''ld-window-min''' <integer></font> |
| + | | This optional parameter defines the '''maximum / minimum''' number of variants between the variants being analyzed. |
| |- | | |- |
− | | <font face=Courier>--'''to-bp''' <integer></font> | + | | <font face=Courier>--'''ld-window-bp''' <integer> </br> --'''ld-window-bp-min''' <integer></font> |
− | | This option specifies an '''upper bound (inclusive)''' for the range of sites to be processed. This option must be used in conjunction with <font face=Courier>--'''chr''' </font>
| + | | This optional parameter defines the '''maximum / minimum''' number of physical bases between the variants being analyzed. |
| |- | | |- |
− | | <font face=Courier>--'''positions''' <filename></font> | + | | <font face=Courier>--'''min-r2''' <float></font> |
− | | This option is used to '''include a set of sites''' on the basis of list of positions in the given file. Each line of the input file should contain a (tab-separated) chromosome and position. The file can have comment lines that start with a "#", they will be ignored. | + | | This optional parameter sets a minimum value of r2, below with LD statistics are NOT reported. It can be used in conjunction with the above parameters. |
− | |-
| |
− | | <font face=Courier>--'''exclude-positions''' <filename></font>
| |
− | | This option is used to exclude a set of sites on the basis of list of positions in the given file. . Each line of the input file should contain a (tab-separated) chromosome and position. The file can have comment lines that start with a "#", they will be ignored.
| |
| |} | | |} |
| | | |