Mitochondrial diversity across
wolf populations worldwide

Population genomics mtDNA · 16,774 bp 100 sequences · 12 populations NJ phylogeny Status: complete

Using complete mitochondrial genomes from 100 wolves across 12 populations and subspecies, this analysis quantifies nucleotide diversity, reconstructs population relationships, and identifies the severe genetic bottleneck afflicting North American wolves — with direct implications for Utah reintroduction planning.

14×
diversity gap: Iberian vs. Mexican wolf
Iberian wolves (π = 0.537%) have 14 times more mitochondrial diversity than Mexican wolves (π = 0.038%) — a direct signal of the Mexican wolf's severe historical bottleneck.
~4.3%
Eastern Timber Wolf divergence
Eastern Timber Wolves sit as a deep outgroup, separated from all other populations by ~4.3% p-distance — roughly 10× more distant than intra-North American comparisons.
0.08%
North American Wolf mtDNA diversity
Northern Rocky Mountain wolves — the most likely Utah reintroduction source — carry 6× more diversity than Mexican wolves, making them the strongest genomic candidate for restoration.

Why genetic diversity matters for reintroduction

When a wolf population is reintroduced, it starts from a small number of founding individuals. Those founders carry only a fraction of the genetic diversity present in the source population. Over subsequent generations, that diversity can only decrease — never increase — without gene flow from outside. This is why the genetic health of the source population is one of the most consequential decisions in a reintroduction program.

Genetic diversity is the raw material for adaptation. Populations with low diversity are more vulnerable to inbreeding depression (reduced fertility and survival from harmful recessive alleles), less capable of responding to novel diseases and environmental change, and more likely to accumulate deleterious mutations over time. The Mexican gray wolf recovery program offers a cautionary example: descended from just seven founders, the captive and wild populations still show measurable effects of this extreme bottleneck more than four decades later.

What is nucleotide diversity (π)?

Nucleotide diversity measures the average percentage of DNA bases that differ between any two randomly chosen sequences from a population. A value of π = 0.5% means that if you picked two wolves at random and compared their mitochondrial genomes base-by-base, about 1 in 200 positions would differ. Higher π = more diversity = healthier population genetics. For context, the complete mitochondrial genome analyzed here is 16,774 base pairs — so π = 0.1% represents roughly 17 differences per genome.

What is mitochondrial DNA?

mtDNA is a small circular genome (16,774 bp here) inherited entirely from the mother. Because it doesn't recombine, mutations accumulate linearly over time — making it ideal for tracing maternal lineages, measuring bottlenecks, and reconstructing population histories.

What is a genetic bottleneck?

A bottleneck occurs when a population shrinks dramatically, then recovers from a small number of survivors. Even if the population grows back to large numbers, it retains the limited diversity of those few founders — exactly what happened to Mexican gray wolves in the 20th century.

What is a phylogenetic tree?

A tree that shows evolutionary relationships based on genetic similarity. Populations that share more recent common ancestors cluster together with shorter branch lengths. The trees here were built using the Neighbor-Joining method on pairwise p-distances across complete mitochondrial genomes.

What is p-distance?

The simplest genetic distance measure: the proportion of nucleotide positions that differ between two sequences. A p-distance of 0.01 (1%) between two populations means 1 in 100 bases differs on average. It's used here for both pairwise comparisons and tree reconstruction.

Nucleotide diversity by population

The most immediate result is the enormous range in mitochondrial diversity across wolf populations. European wolves — particularly Iberian wolves (π = 0.537%) and French wolves (π = 0.524%) — carry roughly 6–14 times more diversity than their North American counterparts. North American wolves (π = 0.084%) and Mexican wolves (π = 0.038%) sit near the bottom of the distribution, with Arctic and Eastern Timber wolves so low that single-sample populations register near zero.

The large "Wolf (Unknown)" category (n=55, π = 1.40%) is an artifact of pooling geographically diverse sequences under a single label — when combined, their geographic diversity creates apparent high diversity. The populations with known geographic labels are more informative.

Figure 1
Mitochondrial nucleotide diversity bar chart by wolf population
Mitochondrial nucleotide diversity (π) by wolf population
Nucleotide diversity calculated from complete mitochondrial genomes (16,774 bp). Sample sizes shown above each bar. Populations sorted by descending diversity. North American and Mexican wolves show markedly lower diversity than European populations, reflecting demographic bottlenecks associated with 20th-century extirpation and captive breeding.
Utah reintroduction implication

Northern Rocky Mountain wolves (represented here as "North American Wolf," π = 0.084%) carry approximately 2.2× more mitochondrial diversity than Mexican wolves (π = 0.038%). For a Utah reintroduction, sourcing from Northern Rocky Mountain stock would establish a population with a substantially healthier genetic foundation than the Mexican wolf program — which still grapples with the consequences of its seven-founder bottleneck today.

Population diversity statistics

Full diversity data for all 12 populations. Sort by any column or filter by geographic region. The Utah relevance column indicates each population's significance for reintroduction planning.

Filter:
Population ↕ n ↕ Nucleotide diversity (π%) ↕ Diversity status Utah relevance

Population structure and genetic distances

Principal components analysis of the pairwise distance matrix (Figure 2) reveals the primary axes of differentiation across populations. PC1 accounts for 96.7% of total variance — an unusually high proportion driven by the extreme divergence of Eastern Timber Wolves, which sit far to the right of all other populations. This indicates that the dominant signal in wolf mitochondrial variation globally is the ancient split between the C. l. lycaon lineage and all other wolves.

Along PC2 (2.0% variance), further structure is visible among the remaining populations: Iberian wolves separate downward, Tibetan/Asian wolves cluster toward the bottom right, and the majority of Eurasian and North American wolves form a tight central cluster — consistent with relatively recent shared ancestry and the radiation of C. l. lupus across the Holarctic.

Figure 2
PCA of wolf mitochondrial genome distances
PCA of mitochondrial genome distances
PC1 (96.7%) separates Eastern Timber Wolves from all others. PC2 (2.0%) reveals Iberian and Tibetan/Asian wolf divergence. Click to enlarge.
Figure 3
Pairwise genetic distance heatmap between wolf populations
Mean pairwise genetic distance (p-distance %)
Eastern Timber Wolves show ~4.3% divergence from all other populations. North American and Mexican wolves cluster with <0.4% between-group distances. Click to enlarge.

The pairwise distance heatmap (Figure 3) makes the population relationships immediately legible. The deep red row and column for Eastern Timber Wolf (4.26–4.62% from all others) stands in stark contrast to the pale yellow values within the Eurasian–North American cluster. Within that cluster, North American wolves and Mexican wolves are separated by only 0.38% — less than the within-population diversity of European wolves.

Tibetan/Asian wolves form a second distinct group (~2.73–3.13% from other populations), consistent with their designation as a separate subspecies (C. l. chanco) with an ancient Asian lineage. Dingo (0.49–0.92% from wolf populations) sits within the wolf radiation as expected, reflecting their recent descent from domesticated dogs.

Neighbor-joining phylogeny

The individual-level neighbor-joining tree (Figure 4) places all 100 sequences in a single phylogeny, with branch lengths proportional to genetic distance. The topology is consistent across bootstrap replicates: Eastern Timber Wolves and Tibetan/Asian wolves branch away from the main radiation, while North American, Mexican, European, and Arctic wolves form an interleaved cluster with short internal branches — reflecting rapid recent radiation with incomplete lineage sorting.

The population-level condensed tree (Figure 5) is the most directly interpretable result for reintroduction planning. It shows mean genetic distances between populations as branch lengths, revealing the relative distinctiveness of each group. The tight clustering of Mexican Wolf, Iberian Wolf, European Wolf (Spain/Sweden), North American Wolf, and Arctic Wolf in a clade with very short branches (<0.005 p-distance) indicates these populations are all closely related maternally — and that reintroduction mixing among them would be genetically defensible.

Figure 4 — individual-level NJ tree (100 sequences)
Neighbor-joining phylogenetic tree of 100 wolf mitochondrial genomes
NJ phylogeny of 100 individual wolf mitochondrial genomes
Branch lengths proportional to p-distance. Eastern Timber Wolves (orange) and Tibetan/Asian Wolves (teal) branch as deep outgroups. North American, Mexican, European, and Arctic wolves share a recent common ancestor with short internal branches. Color coding matches population legend. Click to enlarge.
Figure 5 — population-level NJ tree (mean distances)
Population-level neighbor-joining phylogeny of wolf subspecies
Population-level NJ phylogeny based on mean pairwise distances
Each tip represents a population's mean position. The short branch lengths among North American Wolf, Mexican Wolf, Arctic Wolf, and European wolves reflect their recent shared ancestry and low between-group differentiation. Eastern Timber Wolf (~0.043 p-distance) and Tibetan/Asian Wolf (~0.011 p-distance) are the most divergent lineages. Click to enlarge.
Key phylogenetic finding

The population-level tree shows that North American wolves and Mexican wolves are phylogenetically indistinguishable at the mitochondrial level at the scale of this analysis — they share the same short branch position with near-zero between-group distance (0.38%). This means a Utah reintroduction using Northern Rocky Mountain stock would be introducing a lineage that is maternally nearly identical to the region's historical wolves. The genetics support the reintroduction as a restoration of the original lineage, not the introduction of a foreign one.

What this means for Utah

Three findings from this analysis converge on a clear recommendation for Utah reintroduction planning:

1. Source population matters enormously. The 14-fold diversity difference between Iberian and Mexican wolves demonstrates how consequentially a founder bottleneck shapes long-term population health. Utah has the opportunity to avoid repeating the Mexican wolf program's bottleneck by using a well-characterized, higher-diversity source population.

2. Northern Rocky Mountain stock is the right choice. At 0.084% nucleotide diversity and only 0.38% genetic distance from historical North American wolf lineages, the Northern Rocky Mountain Distinct Population Segment is both geographically proximate and genetically appropriate. The phylogeny shows no meaningful divergence between these wolves and historical Utah wolf lineages at the mitochondrial level.

3. This analysis is a starting point. Mitochondrial DNA captures only maternal lineage history. A complete picture requires whole-genome nuclear analysis — which is the next phase of this research project (see: Wolf source population analysis, in progress). The mtDNA results presented here are consistent with and provide context for that upcoming work.

Next step in this research

The whole-genome population structure analysis (using the Snakemake pipeline described in the research overview) will extend these findings with nuclear SNP data, ADMIXTURE ancestry modeling, FST estimates, and runs-of-homozygosity inbreeding analysis — providing a complete genomic picture of source population suitability for Utah.

Citation

Hansen, A. (2025). Mitochondrial genome diversity and phylogeography of global wolf populations: implications for Utah reintroduction. Rewild Genomics LLC. Preprint in preparation.