Nucleotide diversity - π


Nucleotide diversity (π) measures the mean nucleotide differences per site between two randomly chosen sequences from a population. In simpler terms, it is the probability of two alleles being different at a given nucleotide.

The formula that represents π is the following:



where:

n is the number of sequences or individuals in the population,

dij is the number of nucleotide differences between sequences i and j,

L is the alignment length,

(n 2) is the number of possible pairs of sequences.


Nucleotide diversity values of 0.01-0.05 are considered high (found in Drosophila, for example), while 0.001 is considered low (found in humans).

Comments

Popular posts from this blog

How to interpret Tajima's D

How to easily obtain a VCF file based on BAM files using BCFtools

How to obtain an Admixture bar plot using ANGSD (ngsTools)