• Don't want to see ads? Install an adblocker like uBlock Origin or use a Europe-based privacy-friendly browser like Vivaldi or Mullvad.

Mitotree: The Universal Human Mitochondrial Reference Phylogeny at 10x the Resolution

Tautalus

Regular Member
Messages
545
Reaction score
1,380
Points
93
Ethnic group
Portuguese
Y-DNA haplogroup
I2-M223 / I-FTB15368
mtDNA haplogroup
H6a1b2y
Mitotree: A New Era for Human Mitochondrial Phylogeny

Human mitochondrial DNA has long been used to reconstruct maternal ancestry, population history, migration patterns, and evolutionary relationships. For decades, PhyloTree served as the main reference framework for mitochondrial classification, but the explosive growth of genomic databases exposed limitations in scalability and update frequency. Mitotree was developed as a next-generation solution, introducing a much larger, automated, and continuously maintainable phylogenetic framework for human mitochondrial variation.

Mitotree represents the largest human mitochondrial phylogeny assembled to date. The study analysed more than 331,000 complete mitochondrial genomes and reconstructed relationships among over 177,000 unique haplotypes. To overcome computational limitations associated with large phylogenetic datasets, the authors developed a divide-and-conquer strategy based primarily on weighted maximum parsimony, recursive partitioning, and specialised treatment of rapidly mutating mitochondrial regions. Ancient DNA sequences were incorporated into the reconstruction process, allowing temporal calibration and historical interpretation.

One of the most important achievements of Mitotree is the dramatic expansion of recognised mitochondrial diversity. The framework defines more than 53,000 haplogroups, representing approximately ten times more branches than previous reference systems. This increased resolution revealed around 180 previously unrecognised haplogroups older than thirty thousand years and refined age estimates for many terminal branches, which were often found to be younger than previously assumed.

Mitotree significantly improves the reconstruction of historical maternal lineages by linking ancient individuals, archaeological samples, and living populations. One of the clearest examples involves the Tyrolean Iceman Ötzi, whose mitochondrial lineage K1f had previously appeared isolated and potentially extinct. The new tree connects K1f with modern individuals of North African maternal ancestry, medieval European samples, and Mesolithic Balkan populations, transforming the interpretation of this lineage from an isolated branch into a long-lasting historical network extending across millennia. The framework also refines the placement of maternal lineages associated with historical figures and documented individuals, including Abraham Lincoln, Dolley Madison, Otto I, Henry II, and Nicholas II. Although many exact subclades remain under refinement, the study demonstrates the increasing ability of large phylogenies to contextualise historical genomes within broader maternal ancestry patterns.

The study identifies extremely ancient mitochondrial branches that reshape understanding of early human population history. A newly recognised lineage, L2e'f^, dates to approximately eighty-three thousand years ago and represents one of the oldest newly resolved mitochondrial divergences identified in the study. Additional restructuring within African haplogroups, particularly within L0-related lineages, reveals previously hidden demographic complexity and contributes to more detailed reconstructions of ancient African population history.

Mitotree substantially improves resolution for populations shaped by founder effects and migration. Major Ashkenazi Jewish founder lineages, including K1a1b1a, N1b1b1, K1a9, K2a2a1, and HV1b2, were divided into numerous new branches, revealing deeper ancestry and older divergence dates than previously recognised. The framework also clarifies maternal histories in geographically isolated and diaspora populations, including Belmonte crypto-Jewish communities, Libyan Jewish populations, Cochin Jewish groups, and lineages connecting Jewish and East Asian ancestry.
Native American maternal history also undergoes major revision. Haplogroups A2, B2, C1b, and D1 expand dramatically in complexity, allowing improved reconstruction of early American dispersals and connections between ancient and present-day populations. Similar increases in resolution are observed in northern Eurasian founder populations such as the Saami and in East Asian populations linked to Jōmon and Ainu ancestry.

Mitotree is more than a larger mitochondrial tree. It is a reconstruction of human maternal history at unprecedented scale, connecting prehistoric divergences, ancient populations, historical individuals, and modern communities into a unified evolutionary framework. Its expanded resolution fundamentally changes how mitochondrial ancestry can be studied and interpreted.
Abstract
Mitochondrial DNA (mtDNA) is the oldest and most prevalent source of human molecular phylogenetic reconstruction. Everyone alive today traces their matrilines to one female African ancestor who lived some 145 thousand years ago. For decades, PhyloTree provided researchers with a moderately sized reference tree (v17: n=24,275 sequences, n=5,438 branches). However, hundreds of thousands of sequences are available today, and since PhyloTree's retirement in 2016, there is a need for an actively maintained phylogenetic reference system. In addition, no currently published method can wield such a large de novo reconstruction while dealing with the rampant homoplasy seen in mtDNA and adhering to the accuracy standards expected in this well-studied field. In this paper we introduce Mitotree, the largest human phylogeny ever described (n~330,000 complete sequences, n~54,000 branches), and a new recursive phylogenetic pipeline for estimating it. We incorporate ancient DNA, relaxed clock age estimates (TMRCAs), and public databases (e.g., GenBank, 1000 Genomes, HGDP, and SGDP). We report approximately 180 previously unknown branches older than 30,000 years. The median TMRCA of terminal haplogroups is 2,000 years more recent than PhyloTree. Our pipeline offers a novel divide-and-conquer approach that tackles huge heuristic searches within tractable runtimes, provides high accuracy, and reasonable confidence estimates. Our validation found a false negative rate of 3.5%, and a false positive rate of 1.0%. Among the most striking findings are an ~83 kya split of African L2e (the oldest in our dataset), the resolution of Ötzi's K1f into a clade with living descendants, new ethnic founder haplogroups (e.g., Jewish diaspora), and placement of historical figures such as Abraham Lincoln. To our knowledge, Mitotree is the largest de novo haplotype phylogeny built for humanity, and is a continually improved resource for academic, genealogical, and medical research.

x16uZAO.png


Mitochondrial Tree Growth 2008–2026
utNnOCc.png

Mitotree Haplogroup Counts and Population Origins
mnQBhj5.png

 
Last edited:
Back
Top