Absolute genetic divergence

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Absolute genetic divergence, often denoted as dXY, is a measure used in population genetics to quantify the average number of nucleotide differences per site between two DNA sequences drawn from different populations. Unlike relative measures of divergence such as the Fixation index (FST), absolute genetic divergence is independent of the genetic diversity within the populations being compared.[1] It is frequently used in evolutionary biology to detect gene flow, estimate species divergence times, and investigate "genomic islands" of speciation.[2]

Definition and Calculation

[edit | edit source]

Absolute genetic divergence was formally defined by Masatoshi Nei in 1987. It represents the average number of pairwise nucleotide differences between all possible pairs of sequences where one is taken from population X and the other from population Y.[3]

Mathematically, it is calculated as:

dXY=i,jxiyjdij

Where:

  • xi is the frequency of the i-th haplotype in population X.
  • yj is the frequency of the j-th haplotype in population Y.
  • dij is the number of nucleotide differences between haplotype i and haplotype j.

Because it measures the average pairwise differences regardless of population subdivision, dXY is mathematically equivalent to the total nucleotide diversity (πTotal) if the two populations were pooled together.[1]

Comparison with Relative Divergence (FST)

[edit | edit source]

A major application of dXY is to distinguish between evolutionary processes that affect genetic variation within populations versus those that affect divergence between populations.

  • Relative Divergence (FST): Measures population differentiation relative to the total genetic variance. It can be inflated solely by a reduction in within-population diversity (π), even if the populations have not actually diverged significantly in terms of sequence mutations.[4]
  • Absolute Divergence (dXY): Measures the accumulation of sequence differences. It is generally not affected by current within-population diversity or selective sweeps that reduce local variation.[1]

In the study of "genomic islands of speciation" (regions of the genome with high differentiation), dXY is often used as a control statistic. If a genomic region has high FST but normal or low dXY, the differentiation is likely driven by reduced diversity (e.g., linked selection or background selection) rather than accelerated divergence or a barrier to gene flow.[1][5]

Relation to Ancestral Diversity

[edit | edit source]

Absolute divergence is influenced by the diversity of the ancestral population. The value of dXY reflects both the mutations accumulated after the populations split and the polymorphism that was present in the common ancestor.

E(dXY)=2μt+θAnc

Where:

  • μ is the mutation rate.
  • t is the time since divergence.
  • θAnc is the ancestral nucleotide diversity.[1]

Because of this relationship, dXY requires a significant amount of time to accumulate after speciation, leading to a "time lag" in its utility for detecting very recent divergence compared to other metrics.[2]

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).