Investigating the source data of the Vahaduo Eurogenes K15 Ancient calculator I was able to verify that there are not many samples from the Iberian Peninsula. I was able to verify the existence of those that are listed below.
Iberia:
Iberia_Pre_BellBeaker_Chalcolithic_ATP16,4.94,38.80,0.01,0.01,56.23,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01
IberiaChalcolithic_I1281,11.39,38.28,0.01,0.01,50.33,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01
IberiaChalcolithic_I1303,2.09,42.04,0.01,0.01,55.87,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01
IberiaMBA_ATP9,17.71,44.44,0.50,0.01,37.36,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01
Canarias Islands:
Guanche_11,0.01,12.69,0.01,0.01,24.73,0.01,23.27,18.15,0.01,0.01,0.01,0.01,0.19,13.56,7.41
Guanche_5,9.80,16.64,0.09,0.01,29.00,0.01,14.44,14.85,0.01,0.01,0.01,0.01,0.01,1.07,14.12
Out of curiosity I discovered the threads below, respectively created by @Tomenable (Europeans in Zanzibar in 800 CE) and @Ygorcs (Doubts about the "Tanzania_Zanzibar_First Millennium" aDNA sample) where it is discussed how an Iberian European could be found in Zanzibar in 800 CE (I0588_Tanzania_Zanzibar_800AD_outlier_European).
I0588 Tanzania Zanzibar, sample published by Pontus Skoglund and dated to 800 AD:
Eurogenes K15 results of this sample:
27834 SNPs used in this evaluation
Admix Results (sorted):
# Population Percent
1 Atlantic 26.89
2 West_Med 26.26
3 North_Sea 24.7
4 East_Med 6.37
5 Baltic 5.24
6 West_Asian 3
7 Red_Sea 2.41
8 Sub-Saharan 2.4
9 Eastern_Euro 1.48
10 Siberian 1.25
Single Population Sharing:
# Population (source) Distance
1 Spanish_Galicia 6.55
2 Portuguese 7.09
3 Spanish_Cantabria 7.34
4 Spanish_Cataluna 7.72
5 Spanish_Castilla_Y_Leon 7.99
6 Spanish_Extremadura 8.15
7 Spanish_Murcia 8.9
8 Spanish_Castilla_La_Mancha 9.69
9 Southwest_French 10.05
10 Spanish_Aragon 10.18
11 Spanish_Valencia 10.25
12 Spanish_Andalucia 10.47
13 French 11.81
14 North_Italian 12.6
15 South_Dutch 16.64
16 French_Basque 17.96
17 West_German 18.29
18 Southwest_English 18.53
19 Tuscan 18.61
20 Southeast_English 20.41
Mixed Mode Population Sharing:
# Primary Population (source) Secondary Population (source) Distance
1 55.1% Orcadian + 44.9% Sardinian @ 4.56
2 60.8% Southwest_English + 39.2% Sardinian @ 5.11
3 55.2% West_Scottish + 44.8% Sardinian @ 5.35
4 55.9% Irish + 44.1% Sardinian @ 5.41
5 58.3% Southeast_English + 41.7% Sardinian @ 5.48
6 59.6% Spanish_Galicia + 40.4% Spanish_Cantabria @ 5.8
7 85.5% Spanish_Galicia + 14.5% French_Basque @ 5.89
8 92% Spanish_Galicia + 8% Sardinian @ 6.11
9 55% North_Dutch + 45% Sardinian @ 6.12
10 73.1% French + 26.9% Sardinian @ 6.18
11 79.4% Spanish_Galicia + 20.6% Spanish_Aragon @ 6.21
12 79.3% Spanish_Galicia + 20.7% Southwest_French @ 6.22
13 83.2% Spanish_Galicia + 16.8% Spanish_Castilla_La_Mancha @ 6.38
14 80.4% Spanish_Galicia + 19.6% Spanish_Cataluna @ 6.47
15 88.9% Spanish_Galicia + 11.1% Spanish_Andalucia @ 6.47
16 88.9% Spanish_Galicia + 11.1% Spanish_Valencia @ 6.48
17 54.5% Portuguese + 45.5% Spanish_Cantabria @ 6.49
18 99.4% Spanish_Galicia + 0.6% Yoruban @ 6.51
19 99.4% Spanish_Galicia + 0.6% Mandenka @ 6.52
20 88% Spanish_Galicia + 12% Spanish_Castilla_Y_Leon @ 6.52
Similarity map:
Europeans in Zanzibar - eastern coast of Africa - as early as 800 CE ???
This sample is also in Global25.
I decided to post this doubt of mine here so that someone might perhaps help clarify what's happening with the average population DNA sample named TZA_Zanzibar_FirstMillennium in Global25 datasheets. I have googled to try to find the paper that that sample comes from, but couldn't find it. The issue is that, running the sample into nMonte software to model its ancestry, it's clear that it's just too much of an outlier to be believed: the sample is closest to ancient Iberian samples and, among modern populations, also to Iberians. Okay, I wouldn't be totally surprised by a Portuguese trader/sailor being buried in Zanzibar in the 16th or 17th century (though I still think that would be a real find, as they must've been a tiny minority of the total population), but first millennium? Does anyone have any information about that DNA sample, the study that analyzed and released it, the dating and archaeological circumstances? For now I'll just assume that the labeling was totally wrong and it in fact dates to the last centuries.
As an experiment, I added the coordinates obtained for this sample in Gedmatch by @Tomenable in the Vahaduo EU K15 Ancient source data and, to my surprise, it became my best ancestral match in the Vahaduo Eurogenes K15 Ancient. Below the results. In calculating the mix, I searched for distances close to that displayed by Gedmatch EU V2 K15 'Mixed Mode Population Sharing - Primary Population + Secondary Population'.
Curious.
Gedmatch EU V2 K15 coordinats:
I0588_Tanzania_Zanzibar_800AD_outlier(European),24.70,26.89,5.24,1.48,26.26,3.00,6.37,2.41,0.00,0.00,1.25,0.00,0.00,0.00,2.40
Distance to: | Duarte |
---|
6.76977104 | I0588_Tanzania_Zanzibar_800AD_outlier(European) |
8.24704796 | IA_Prenestina_Colombella_R435 |
9.21567686 | IA_Boville_Ernica_R1021 |
9.48283713 | Swiss_CHE_IA_SX18 |
10.20174005 | IA_Civitavecchia_R474 |
10.21657477 | IA_Civitavecchia_R473 |
10.64023026 | ScythianMoldova_SCY197 |
10.64999531 | RomanSoldier_FN2 |
11.06359345 | IA_Ardea_R851 |
11.28990257 | IA_Etruscan_Veio_Grotta_Gramiccia_R1015 |
11.45422193 | Swiss_DEU_Singen_EBA_MX277 |
11.65095704 | France_IA_Jeb8 |
11.68151531 | BronzAgeBalkan |
12.28918223 | BronzeAgeDalmatian_I4332 |
12.30414158 | IA_Castel_di_Decima_R1016 |
12.86881502 | EMBA_Croatia_I4331 |
13.01338157 | France_IA_ERS86 |
13.49124531 | Swiss_DEU_Singen_EBA_MX283_relMX286 |
13.53752562 | Swiss_CHE_EBA_SX20 |
13.91407201 | France_IA_NOR4 |
14.39051771 | ScythianMoldova_SCY300 |
14.79033468 | France_IA_NOR2B6 |
14.81770563 | Swiss_DEU_Singen_EBA_MX254_relMX286 |
15.09151748 | Vucedol_I3499 |
15.38797258 | BronzeAgeEngland_I2462
|