Difference between revisions of "Relationship between Ploidy, Alleles and Genotypes"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 5: Line 5:
  
 
= Motivation =
 
= Motivation =
 +
 +
Plants species exhibit a diverse number of ploidy, for example, the strawberry is an octoploid and the pear is a triploid.
  
 
While there are explicit functions that could be googled for handling haploid and diploidy cases.  It seems to be difficult to find the closed forms for the general case.
 
While there are explicit functions that could be googled for handling haploid and diploidy cases.  It seems to be difficult to find the closed forms for the general case.
This wiki fills in that need.  The cases where one requires such extensions is when pooled samples are studied or when plant species that exhibit a diverse number of
+
This wiki fills in that need.
ploidy.
 
  
 
= The number of genotypes given a ploidy and alleles =
 
= The number of genotypes given a ploidy and alleles =

Revision as of 11:07, 31 January 2015

Introduction

The VCF format encodes genotypes by the index of the enumeration of genotypes give a ploidy number and alleles. This allows for direct access to a genotype value within an array particularly when one works with genotype likelihoods.

Motivation

Plants species exhibit a diverse number of ploidy, for example, the strawberry is an octoploid and the pear is a triploid.

While there are explicit functions that could be googled for handling haploid and diploidy cases. It seems to be difficult to find the closed forms for the general case. This wiki fills in that need.

The number of genotypes given a ploidy and alleles

where P is the ploidy number and A is the number of alleles.

The indexing of genotypes given a ploidy and alleles


where a_1, a_2 .... are the alleles in numeric encoding (0 to A-1) and are ordered (AB, ABCCCC). For example ACB is not ordered.

Simple cases

Ploidy Alleles Genotypes Index
1 A
2 A