From Genome Analysis Wiki
Jump to navigationJump to search
6 bytes added
, 16:56, 3 February 2012
Line 114: |
Line 114: |
| The VCF format is a simple text format. It starts with several header lines, which all start with the two '##' characters, and is followed by a single line per marker that provides both summary information about the marker and genotypes for each individual. You can review the contents of the VCF file using the 'more' command: | | The VCF format is a simple text format. It starts with several header lines, which all start with the two '##' characters, and is followed by a single line per marker that provides both summary information about the marker and genotypes for each individual. You can review the contents of the VCF file using the 'more' command: |
| | | |
− | more mpileup/chr20.mpileup.vcf | + | more result/chr20.mpileup.vcf |
| | | |
| Here are some questions for you to investigate: | | Here are some questions for you to investigate: |
Line 120: |
Line 120: |
| * How many variant sites were detected in this dataset? Try a command like this one: | | * How many variant sites were detected in this dataset? Try a command like this one: |
| | | |
− | grep -vE ^# chr20.mpileup.vcf | wc -l | + | grep -vE ^# result/chr20.mpileup.vcf | wc -l |
| | | |
| (The grep command line excludes all lines beginning with # and then the wc command counts the number of lines in the file). | | (The grep command line excludes all lines beginning with # and then the wc command counts the number of lines in the file). |