Changes

Sequencing Workshop Analysis of Indels (view source)

Revision as of 21:58, 15 June 2014

235 bytes added , 21:58, 15 June 2014

→‎Normalization

Line 189: Line 189:

==Normalization==

+

A slight digression here, when analyzing indels, it is important to normalize it. While it is a simple concept,

+

it is hardly standardized. The call set here had already been normalized but we feel that this is an important

+

concept so we discuss this a bit here.

Indel representation is not unique, you should normalize them and remove duplicates.

Line 240: Line 244:

| 0

| 374

−

|

+

| 0

−

|

+

| 0

|-

| Left aligned

Line 301: Line 305:

Time elapsed: 0.13s

−

~~The following will be slight faster:~~ + ~~denotes using of uncompressed~~ bcf ~~stream.~~

+

vt normalize mills.genotypes.bcf -r ~/ref/vt/grch37/hs37d5.fa -o + | vt mergedups + -o mills.normalized.genotypes.bcf

−

~~vt normalize mills.genotypes.bcf -r ~/ref/vt/grch37/hs37d5.fa -o + | vt mergedups + -o mills.normalized.genotypes.bcf~~

−

~~Also remember to index this file~~ and ~~extract the sites~~.

+

UMICH's algorithm for normalization has been adopted by Petr Danecek in bcftools and is also used in GKNO.

==to document==

Atks

1,102

edits

Changes

Sequencing Workshop Analysis of Indels (view source)

Revision as of 21:58, 15 June 2014

Navigation menu

Page actions

Page actions

Personal tools

quick links

teaching

Navigation

Search

Tools