Changes

From Genome Analysis Wiki
Jump to navigationJump to search
1,076 bytes removed ,  10:22, 26 October 2016
Line 55: Line 55:  
** Note: In the VCF file either PL or GL has to be provided, and only the PL (or GL) field is used in the calling.
 
** Note: In the VCF file either PL or GL has to be provided, and only the PL (or GL) field is used in the calling.
   −
* A map file in the PLINK format. See blow for examples how to generate a high quality map file.
+
* A map file in the PLINK format. See blow for examples how to generate a map file with common and high quality variants
    
== Examples of generating the map file ==
 
== Examples of generating the map file ==
Line 89: Line 89:     
== Filtering ==
 
== Filtering ==
We recommend two filtering strategies. The first is a simple filtering and the second one is more advance. Please see the triodenovo page below for more information:
+
We recommend two filtering strategies. The first is a simple filtering and the second one is more advanced. Please see the triodenovo page below for more information:
    
http://genome.sph.umich.edu/wiki/Triodenovo
 
http://genome.sph.umich.edu/wiki/Triodenovo
  −
3. Further thoughts about filtering for SNVs without bam files (step 2 requires bam files). There is no consensus on filtering so this can be very flexible.
  −
* If you have a multi-sample call VCF it may be helpful to select those mutation candidates that appear only once in your VCF (AC=1 for example). This can be the top tier to consider. Relaxing AC to 2 or 3 can recover more real mutations but also increase false positives.
  −
* If it is too stringent to filter out known sites, it may be helpful to select candidates that have low (e.g. <0.002)1000G or ESP allele frequencies. Some mutations can occur on know variant sites but mutations with high population frequencies may not be of great interest, if indeed they are real.
  −
* Candidates in segmental duplications, low complexity regions or other copy number regions may be flagged for further analysis.
  −
* Candidates for which parents are not hom-ref or offspring is a double mutant are more likely to be due to artifacts so the interpretation of these candidates may require additional QC if they appear to be interesting to the investigators.
      
== Download ==
 
== Download ==
480

edits

Navigation menu