From Genome Analysis Wiki
Jump to navigationJump to search
621 bytes added
, 15:38, 2 January 2014
Line 28: |
Line 28: |
| -o defines the out file which and has the STDOUT set as the default. | | -o defines the out file which and has the STDOUT set as the default. |
| You may modify the STDOUT to output the binary version of the format. | | You may modify the STDOUT to output the binary version of the format. |
| + | |
| + | == Uncompressed BCF streams == |
| + | |
| + | htslib is designed with BCF as the underlying data structure and it has incorporated |
| + | awareness of uncompressed BCF streams in the i/o API. One may use this feature to |
| + | stream uncompressed BCF records to save on computational time. |
| + | |
| + | #using textual VCF streams indicated by - |
| + | cat mills.vcf | vt normalize - -r hs37d5.fa | vt mergedups - -o out.bcf |
| + | |
| + | #using uncompressed BCF streams indicated by + |
| + | cat mills.vcf | vt normalize - -r hs37d5.fa -o + | vt mergedups + -o out.bcf |
| + | |
| + | In this example, the former took 0.84s while the latter took 0.64s to process. (24% speed up!) |
| + | |
| | | |
| == Alternate headers == | | == Alternate headers == |