With several other users we collected K13 (which were then converted to G25 sims) and G25 coords by Albanians from different regions in order to create a more representative average for the Albanian population. I would like to thank everybody who helped and make a special mention for Zanatis who found and contacted many individuals for sampling. The full list of individual samples will be published at a later date as full annotation is being worked out and new samples will likely be added.
n= 77 (12 samples from the existing G25 dataset + 65 new G25 sims and G25)
Existing Albanian G25 dataset (n=12) and new dataset (n=77)
The existing Albanian dataset covers ~70% of total Albanian variability as reflected in the new dataset. The inner distribution at the intersection between the two datasets is highly different and much of the average variability and inner clustering of Albanian samples is not represented in the older dataset. This means that the existing dataset is only partly representative of
average Albanian variability both quantitatively and qualitatively.
Comparison of average values:
Code:
Albanian_old,0.1181863,0.1417511,0.0155562,-0.0158808,0.0251586,-0.0074138,0.0033683,0.0018077,0.0006648,0.0161279,0.0011097,0.0015486,-0.0070986,0.0066059,-0.019295,-0.0041544,0.0081273,0.0014992,0.0082962,-0.0059402,-0.0055629,0.0007212,0.0024238,0.0021991,-0.0025248
Albanian_new,0.12098974,0.1446909,0.016269078,-0.016299558,0.025773831,-0.0048755065,0.00095344156,0.00027711688,0.0031148831,0.015759169,0.00013324675,0.0026802597,-0.0071447403,0.0023749221,-0.012327208,-0.00037183117,0.0078564935,0.00061572727,0.0042468831,-0.004238987,-0.0050441039,0.00093803896,0.00063811688,0.000061818182,-0.000051051948
Modern populations:
Distance to: Albanian_old
0.01172523 Greek_Central_Macedonia
0.01398776 Greek_Macedonia
0.01647191 Greek_Thessaly
0.01738110 Rumelia_East
0.01894300 Greek_Peloponnese
0.02764228 Italian_Tuscany
0.02824767 Swiss_Italian
0.02837148 Italian_Piedmont
0.02864146 Gagauz
0.02928925 Italian_Marche
0.03115133 Greek_Izmir
0.03122240 Bulgarian
0.03123173 Italian_Umbria
0.03186874 Greek_Laconia
0.03281939 Italian_Molise
Distance to: Albanian_new
0.01209714 Greek_Central_Macedonia
0.01400206 Greek_Thessaly
0.01672179 Greek_Macedonia
0.01775242 Rumelia_East
0.01968231 Italian_Tuscany
0.02000210 Greek_Peloponnese
0.02007244 Swiss_Italian
0.02023180 Italian_Piedmont
0.02366034 Italian_Marche
0.02453388 Italian_Umbria
0.02717856 Italian_Veneto
0.02876732 Gagauz
0.03021512 Italian_Lazio
0.03054964 Italian_Bergamo
0.03057167 Greek_Laconia
Pre-medieval populations:
Distance to: Albanian_old
0.02599546 GRC_Logkas_MBA
0.03105090 ITA_Rome_Late_Antiquity
0.03277406 Scythian_MDA
0.03455405 SRB_Mokrin_EBA_Maros_oAegean
0.03609949 ITA_Proto-Villanovan
0.03611625 HRV_Pop_CA
0.03664529 HUN_IA_Syrmian_SremGroup
0.03753924 HUN_LBA_EIA
0.04001548 HRV_EBA
0.04127877 HRV_EIA
Distance to: Albanian_new
0.02432793 GRC_Logkas_MBA
0.02581156 ITA_Rome_Late_Antiquity
0.03151149 Scythian_MDA
0.03310358 SRB_Mokrin_EBA_Maros_oAegean
0.03435231 HUN_LBA_EIA
0.03437358 HRV_Pop_CA
0.03441905 HUN_IA_Syrmian_SremGroup
0.03506589 ITA_Proto-Villanovan
0.03517396 HRV_EIA
0.03580568 HRV_EBA
Distance to populations from the Roman Balkans:
Distance to: Albanian_old
0.02937597 Gardun_Tilurium_Croatia_549_CE_600CE:3544:3544
0.03335371 Zadar_Croatia_127_CE_227_CE:3747
0.03422501 Sipar_Umag_Croatia_558_CE_639_CE:3662:3662
0.03542427 Sviloš_Kruševlje_Serbia_236_CE_331_CE:6693:6693
0.03654690 Beli_Manastir_Croatia_255_CE_405_CE:3542:3542
0.03776972 Sirmium_Serbia_266_CE_430_CE:6730:6730
0.03944106 Sviloš_Kruševlje_Serbia_236_CE_331_CE:6701:6701
0.04265914 Sipar_Umag_Croatia_686_CE_876_CE:3663:3663
0.04736707 Zadar_Croatia_22_CE_121_CE:3745:3745
0.04839841 Osijek_Croatia_133CE_306CE:3655:3655
0.05454439 Zadar_Croatia_127_CE_227_CE:3746
0.06185512 Doclea_Bjelovine_Montenegro_709_CE_880_CE:3478:347 8
0.06991570 Trogir_Dragulin_Croatia_124_CE_217_CE:3665:3665
0.07280484 Zadar_Croatia_127_CE_227_CE:3742:3742
0.08226871 Sirmium_Serbia_1481_CE_1635_CE:6737:6737
0.08259413 Sirmium_Serbia_1479_CE_1634_CE:3906:3906
0.09071470 Trogir_Policija_Croatia_120CE_215_CE:3670:3670
0.09710730 Viminacium_Serbia_129_CE_230_CE:3931:3931
Distance to: Albanian_new
0.02666428 Gardun_Tilurium_Croatia_549_CE_600CE:3544:3544
0.02799523 Zadar_Croatia_127_CE_227_CE:3747
0.02951193 Sipar_Umag_Croatia_558_CE_639_CE:3662:3662
0.03226992 Beli_Manastir_Croatia_255_CE_405_CE:3542:3542
0.03471729 Sviloš_Kruševlje_Serbia_236_CE_331_CE:6693:6693
0.03678690 Sirmium_Serbia_266_CE_430_CE:6730:6730
0.03816536 Sviloš_Kruševlje_Serbia_236_CE_331_CE:6701:6701
0.03840354 Sipar_Umag_Croatia_686_CE_876_CE:3663:3663
0.04563692 Zadar_Croatia_22_CE_121_CE:3745:3745
0.04621254 Osijek_Croatia_133CE_306CE:3655:3655
0.05284715 Zadar_Croatia_127_CE_227_CE:3746
0.06097498 Doclea_Bjelovine_Montenegro_709_CE_880_CE:3478:347 8
0.06912722 Trogir_Dragulin_Croatia_124_CE_217_CE:3665:3665
0.07371484 Zadar_Croatia_127_CE_227_CE:3742:3742
0.08221655 Sirmium_Serbia_1479_CE_1634_CE:3906:3906
0.08246558 Sirmium_Serbia_1481_CE_1635_CE:6737:6737
0.09039312 Trogir_Policija_Croatia_120CE_215_CE:3670:3670
0.09498046 Viminacium_Serbia_129_CE_230_CE:3931:3931
Comments:
1)In comparison to the old dataset, the new Albanian cluster shows a higher affinity to pre-medieval populations in the Balkans and modern southern Europe and a lower post-Migration Period admixture.
2) Within Albania, on the north-south axis, there is no distinct geographical south/north sub-clustering which is to say that all possible variations of the same typical profile are found from south to north of Albania.
(veri = north, jug= south, Shqipëri qendrore = central Albania - samples which have origins from both southern and northern regions weren't used in this PCA)
3)Some profiles which were considered to be outliers on the official G25 dataset are in fact common part of Albanian variation. AL12 is one such profile which is almost entirely EEF/Yamnaya. There are several Albanian profiles which have such features. In my opinion, as such features aren't correlated with population contact situations, a likely explanation is endogamy within particular Albanian regions. Endogamy may have enhanced certain components, while it lowered to a great extent others (autosomal drift).