Absolute genetic divergence
Absolute genetic divergence, often denoted as , is a measure used in population genetics to quantify the average number of nucleotide differences per site between two DNA sequences drawn from different populations. Unlike relative measures of divergence such as the Fixation index (), absolute genetic divergence is independent of the genetic diversity within the populations being compared.[1] It is frequently used in evolutionary biology to detect gene flow, estimate species divergence times, and investigate "genomic islands" of speciation.[2]
Definition and Calculation
[edit | edit source]Absolute genetic divergence was formally defined by Masatoshi Nei in 1987. It represents the average number of pairwise nucleotide differences between all possible pairs of sequences where one is taken from population X and the other from population Y.[3]
Mathematically, it is calculated as:
Where:
- is the frequency of the i-th haplotype in population X.
- is the frequency of the j-th haplotype in population Y.
- is the number of nucleotide differences between haplotype i and haplotype j.
Because it measures the average pairwise differences regardless of population subdivision, is mathematically equivalent to the total nucleotide diversity () if the two populations were pooled together.[1]
Comparison with Relative Divergence ()
[edit | edit source]A major application of is to distinguish between evolutionary processes that affect genetic variation within populations versus those that affect divergence between populations.
- Relative Divergence (): Measures population differentiation relative to the total genetic variance. It can be inflated solely by a reduction in within-population diversity (), even if the populations have not actually diverged significantly in terms of sequence mutations.[4]
- Absolute Divergence (): Measures the accumulation of sequence differences. It is generally not affected by current within-population diversity or selective sweeps that reduce local variation.[1]
In the study of "genomic islands of speciation" (regions of the genome with high differentiation), is often used as a control statistic. If a genomic region has high but normal or low , the differentiation is likely driven by reduced diversity (e.g., linked selection or background selection) rather than accelerated divergence or a barrier to gene flow.[1][5]
Relation to Ancestral Diversity
[edit | edit source]Absolute divergence is influenced by the diversity of the ancestral population. The value of reflects both the mutations accumulated after the populations split and the polymorphism that was present in the common ancestor.
Where:
- is the mutation rate.
- is the time since divergence.
- is the ancestral nucleotide diversity.[1]
Because of this relationship, requires a significant amount of time to accumulate after speciation, leading to a "time lag" in its utility for detecting very recent divergence compared to other metrics.[2]
See also
[edit | edit source]References
[edit | edit source]- ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).