Changes

From Genome Analysis Wiki
Jump to navigationJump to search
1,438 bytes added ,  15:29, 9 September 2017
Line 5: Line 5:  
== Download verifyBamID  ==
 
== Download verifyBamID  ==
   −
To get a copy go to the [http://www.sph.umich.edu/csg/kang/verifyBamID/download VerifyBamID Download] download page.
+
To get a copy of verifyBamId, go to: https://github.com/statgen/verifyBamID/releases
 +
 
 +
Select the latest release and download in one of 3 ways:
 +
# Binary expected to run in Ubuntu x64 platform. In other platforms, please download the source distribution and build it.
 +
#* verifyBamID.#.#.#.gz
 +
#* You will need to run "gunzip" on the .gz file
 +
# Souce Code including libStatGen (uses a fixed version of libStatGen)
 +
#* verifyBamIDLibStatGen.#.#.#.tgz
 +
#* Run "tar xvf" on this file.  Cd into the resulting directory & type make.
 +
# Source Code without libStatGen (allows alternative/newer versions of libStatGen)
 +
#* Source code (tar.gz) or Source code (zip)
 +
#* You will need to download libStatGen separately if you do not already have it.
 +
 
 +
 
 +
To get a copy of older releases go to the [http://csg.sph.umich.edu//kang/verifyBamID/download VerifyBamID Download] download page.
    
== Join in verifyBamID mailing list ==
 
== Join in verifyBamID mailing list ==
Line 12: Line 26:     
== What's new ==
 
== What's new ==
 +
 +
(2014/02/13)
 +
* Put verifyBamID in github.
 +
* Added PhoneHome/Version Checking to VerifyBamID
    
(2012/06/20)  
 
(2012/06/20)  
Line 31: Line 49:  
== Build verifyBamID  ==
 
== Build verifyBamID  ==
   −
The binary download of verifyBamID is available. You may use the version in Ubuntu 64-bit platform. To build verifyBamID, download the statgen library and run the following series of commands
+
The binary download of verifyBamID is available. You may use that version in Ubuntu 64-bit platform.  
  tar xzvf verifyBamID.20120620.tar.gz
+
 
  cd verifyBamID
+
If you download the source that includes libStatGen:
  make cloneLib (in the case ../libStatGen does not exist)
+
tar xvf verifyBamIDLibStatGen.#.#.#.tgz
 +
cd verifyBamID_#.#.#
 +
make
 +
Executable: verifyBamID/bin/verifyBamID
 +
 
 +
If you download the source without libStatGen:
 +
  tar xvf verifyBamID-#.#.#.tar.gz
 +
  cd verifyBamID-1.1.0
 +
  make cloneLib (if ../libStatGen does not exist)
 
  make
 
  make
  ./bin/verifyBamID
+
  Executable: ./bin/verifyBamID
 +
 
 +
Note that '''make cloneLib''' command will create a directory ../libStatGen under your verifyBamID directory, and '''make''' will create binary of verifyBamID under verifyBamID/bin/
 +
 
 +
If you have a different version of libStatGen at that path, then skip the cloneLib step.  If the libStatGen you want to use is at a different location then update verifyBamID's Makefile.inc.  Replace: LIB_PATH_VERIFY_BAM_ID ?= $(LIB_PATH_GENERAL) with
 +
LIB_PATH_VERIFY_BAM_ID = /path/to/libStatGen
   −
Note that '''git clone''' command will create a directory ./libStatGen under your working directory, and '''make''' will create binary of verifyBamID under verifyBamID/bin/
      
verifyBamID is designed to be reasonably portable.  
 
verifyBamID is designed to be reasonably portable.  
   −
However, since development occurs only on Ubuntu 9.10 x86 and x64 platforms, and later, there are likely other portability issues.  
+
However, since development occurs only on Ubuntu (9.10-13.10) x86 and x64 platforms, and later, there are likely other portability issues.  
   −
Currently we support verifyBamID only on Ubuntu 9.10 and later on 64-bit processors.
      
== Basic Usage ==
 
== Basic Usage ==
Line 104: Line 133:     
== Interpreting output files ==
 
== Interpreting output files ==
 +
 +
See also [[Understanding VerifyBamID output]].
    
=== Output files ===
 
=== Output files ===
Line 125: Line 156:  
# FREEMIX : Sequence-only estimate of contamination (0-1 scale)
 
# FREEMIX : Sequence-only estimate of contamination (0-1 scale)
 
# FREELK1 : Maximum log-likelihood of the sequence reads given estimated contamination under sequence-only method
 
# FREELK1 : Maximum log-likelihood of the sequence reads given estimated contamination under sequence-only method
# FREELK0 : Log-ikelihood of the sequence reads given no contamination under sequence-only method
+
# FREELK0 : Log-likelihood of the sequence reads given no contamination under sequence-only method
 
# FREE_RH : Estimated reference bias parameter Pr(refBase|HET) (when --free-refBias or --free-full is used)
 
# FREE_RH : Estimated reference bias parameter Pr(refBase|HET) (when --free-refBias or --free-full is used)
 
# FREE_RA : Estimated reference bias parameter Pr(refBase|HOMALT) (when --free-refBias or --free-full is used)
 
# FREE_RA : Estimated reference bias parameter Pr(refBase|HOMALT) (when --free-refBias or --free-full is used)
Line 134: Line 165:  
# CHIP_RA : Estimated reference bias parameter Pr(refBase|HOMALT) (when --chip-refBias or --chip-full is used)
 
# CHIP_RA : Estimated reference bias parameter Pr(refBase|HOMALT) (when --chip-refBias or --chip-full is used)
 
# DPREF : Depth (Coverage) of HomRef site (based on the genotypes of (SELF_SM/BEST_SM), passing mapQ, baseQual, maxDepth thresholds.
 
# DPREF : Depth (Coverage) of HomRef site (based on the genotypes of (SELF_SM/BEST_SM), passing mapQ, baseQual, maxDepth thresholds.
# RDPHET : DPHET/DPREF, Relative depth at Heterozygous site.
+
# RDPHET : DPHET/DPREF, Relative depth to HomRef site at Heterozygous site.
# RDPALT : DPHET/DPREF, Relative depth at HomAlt site.
+
# RDPALT : DPHET/DPREF, Relative depth to HomRef site at HomAlt site.
    
=== A guideline to interpret output files ===
 
=== A guideline to interpret output files ===
Line 167: Line 198:  
                                           --refAlt [0.00]
 
                                           --refAlt [0.00]
 
                           Output options : --out [], --verbose
 
                           Output options : --out [], --verbose
 
+
                              PhoneHome : --noPhoneHome,
 +
                                          --phoneHomeThinning [50]
    
Each option provides the following features:
 
Each option provides the following features:
Line 182: Line 214:  
* --free-none : Do not perform sequence-only method to estimate parameters
 
* --free-none : Do not perform sequence-only method to estimate parameters
 
* --free-mix : (default) Estimate contamination using sequence-only method with Brent's single dimensional optimization.
 
* --free-mix : (default) Estimate contamination using sequence-only method with Brent's single dimensional optimization.
* --free-refBias : Estimate the refernece bias parameters using sequence-only method with Simplex method
+
* --free-refBias : Estimate the reference bias parameters using sequence-only method with Simplex method
 
* --free-full : Estimate both reference bias parameters and the contamination parameters using sequence-only method
 
* --free-full : Estimate both reference bias parameters and the contamination parameters using sequence-only method
 
* --chip-none : Do not perform sequence+array method to estimate parameters
 
* --chip-none : Do not perform sequence+array method to estimate parameters
* --free-mix : (default) Estimate contamination using sequence+array method with Brent's single dimensional optimization.
+
* --chip-mix : (default) Estimate contamination using sequence+array method with Brent's single dimensional optimization.
* --free-refBias : Estimate the refernece bias parameters using sequence+array method with Simplex method
+
* --chip-refBias : Estimate the refernece bias parameters using sequence+array method with Simplex method
* --free-full : Estimate both reference bias parameters and the contamination parameters using sequence+array method
+
* --chip-full : Estimate both reference bias parameters and the contamination parameters using sequence+array method
 
* --ignoreRG : ignore the read grouup level comparison and compare samples only (recommended for an expedited run)
 
* --ignoreRG : ignore the read grouup level comparison and compare samples only (recommended for an expedited run)
 
* --ignoreOverlapPair : ignore overlapping pair end fragment covering the same base. Disabling this option may decrease the sensitivity of the method when the insert size is short (with slight gain in the computational speed)
 
* --ignoreOverlapPair : ignore overlapping pair end fragment covering the same base. Disabling this option may decrease the sensitivity of the method when the insert size is short (with slight gain in the computational speed)
Line 201: Line 233:  
* --out : output file prefix (required)
 
* --out : output file prefix (required)
 
* --verbose : print the progress of the method on the screeen
 
* --verbose : print the progress of the method on the screeen
 +
{{PhoneHomeParameters|hdr=====|bullet=1}}
    
== Principle of Operation ==
 
== Principle of Operation ==
Line 219: Line 252:       −
== For array intensity data ==
+
== Contamination in Array Data ==
 
  −
[[VerifyIDintensity]] software can estimate sample contamination from Illumina genotype array intensity data.
      +
[[VerifyIDintensity]] or [[BAFRegress]] can estimate sample contamination from Illumina genotype array data.
    
== Acknowledgements ==
 
== Acknowledgements ==
   −
VerifyBamID is a result from collaborative effort by Hyun Min Kang, Goo Jun, Matthew Flickinger, Mary Kate Wing, and Goncalo Abecasis. Please email to Hyun Min Kang [[mailto:hmkang@umich.edu| hmkang@umich.edu ]] for any questions.
+
VerifyBamID is a result from collaborative effort by Hyun Min Kang, Goo Jun, Matthew Flickinger, Mary Kate Wing, Goncalo Abecasis, and Michael Boehnke. Please email to Hyun Min Kang [hmkang@umich.edu ] for any questions.

Navigation menu