From Genome Analysis Wiki
Jump to navigationJump to search
125 bytes added
, 16:25, 12 December 2014
Line 454: |
Line 454: |
| For example, | | For example, |
| zcat $OUT/wgs.filtered.rsid.anno.exon.vcf.gz | grep "EXTFILTER=NA,NA" | grep -v -w "0/0" | perl -lane 'print "$1\t$F[6]" if ( /ANNO=([^;:]+)/)' | sort | uniq -c | | zcat $OUT/wgs.filtered.rsid.anno.exon.vcf.gz | grep "EXTFILTER=NA,NA" | grep -v -w "0/0" | perl -lane 'print "$1\t$F[6]" if ( /ANNO=([^;:]+)/)' | sort | uniq -c |
− |
| |
− | will give you the counts of variants, separate by the filtering results
| |
− |
| |
− | Also remove those in dbsnp:
| |
− | zcat $OUT/wgs.filtered.rsid.anno.exon.vcf.gz | grep "EXTFILTER=NA,NA" | grep -v -w "0/0" | grep -v rs| perl -lane 'print "$1\t$F[6]" if ( /ANNO=([^;:]+)/)' | sort | uniq -c
| |
| | | |
| will give you the counts of variants, separate by the filtering results | | will give you the counts of variants, separate by the filtering results |
Line 465: |
Line 460: |
| * Q2. Looking at each functional category, which functional categories has largest fraction of SNPs failed filter? Why do you think it is? | | * Q2. Looking at each functional category, which functional categories has largest fraction of SNPs failed filter? Why do you think it is? |
| * Q3. Can you exclude the sites that are also in dbSNP, and count how many nonsense variants are left? | | * Q3. Can you exclude the sites that are also in dbSNP, and count how many nonsense variants are left? |
| + | |
| + | |
| + | To also exclude those in dbsnp: |
| + | zcat $OUT/wgs.filtered.rsid.anno.exon.vcf.gz | grep "EXTFILTER=NA,NA" | grep -v -w "0/0" | grep -v rs| perl -lane 'print "$1\t$F[6]" if ( /ANNO=([^;:]+)/)' | sort | uniq -c |
| + | |
| + | Look at Stop_Gain variants |
| + | zcat $OUT/wgs.filtered.rsid.anno.exon.vcf.gz | grep "EXTFILTER=NA,NA" | grep -v -w "0/0" |grep -v rs | perl -lane 'print "$_" if ( /ANNO=Stop_Gain/)' |grep -w PASS |
| + | |
| | | |
| If you want to know predicted functional significance of a particular variant, you can search by | | If you want to know predicted functional significance of a particular variant, you can search by |