A high-coverage Neandertal genome from Vindija Cave in Croatia

Then, over the past few years, ancient DNA researchers sequenced two more Neandertal genomes, including another high-quality sequence from an individual that lived 122,000 years ago in the Altai Mountains of Siberia.

is this correct?
afaik the oldest Neanderthal in Central Asia is dated 87 ka, and in the Altaï Mts, it is even much later


this is the Neanderthal DNA they are talking about :

https://www.theguardian.com/science...of-neanderthal-human-sex-revealed-by-dna-test

it is from 50 ka, not 122 ka

it was admixed 100 ka with modern human DNA, from a branch of modern humans that has gone extinct, probably related to the humans from Skhul and Qafzeh cave in the Levant
those Skhul and Qafzeh skull shapes suggests they are related to the Irhoud skulls in the Atlas Mountains, which recently were dated +/- 315 ka
 
Looks like they've released the paper to the public now:

http://science.sciencemag.org/content/early/2017/10/04/science.aao1887

Abstract

To date the only Neandertal genome that has been sequenced to high quality is from an individual found in Southern Siberia. We sequenced the genome of a female Neandertal from ~50 thousand years ago from Vindija Cave, Croatia to ~30-fold genomic coverage. She carried 1.6 differences per ten thousand base pairs between the two copies of her genome, fewer than present-day humans, suggesting that Neandertal populations were of small size. Our analyses indicate that she was more closely related to the Neandertals that mixed with the ancestors of present-day humans living outside of sub-Saharan Africa than the previously sequenced Neandertal from Siberia, allowing 10-20% more Neandertal DNA to be identified in present-day humans, including variants involved in LDL cholesterol levels, schizophrenia and other diseases.


Neandertals are the closest evolutionary relatives identified to date of all present-day humans and therefore provide a unique perspective on human biology and history. In particular, comparisons of genome sequences from Neandertals with those of present-day humans have allowed genetic features unique to modern humans to be identified (1, 2) and have shown that Neandertals mixed with the ancestors of present-day people living outside sub-Saharan Africa (3). Many of the DNA sequences acquired by non-Africans from Neandertals were likely detrimental and were purged from the human genome via negative selection (48) but some appear to have been beneficial and were positively selected (9); among people today, alleles derived from Neandertals are associated with both susceptibility and resistance to diseases (7, 1012).
However, our knowledge about the genetic variation among Neandertals is still limited. To date genome-wide DNA sequences of five Neandertals have been determined. One of these, the “Altai Neandertal”, found in Denisova Cave in the Altai Mountains in southern Siberia, the eastern-most known reach of the Neandertal range, yielded a high quality genome sequence (~50-fold genomic coverage) (2). In addition, a composite genome sequence from three Neandertal individuals has been generated from Vindija Cave in Croatia in southern Europe but is of low quality (~1.2-fold total coverage) (3), while a Neandertal genome from Mezmaiskaya Cave in the Caucasus (2) is of even lower quality (~0.5-fold coverage). In addition, chromosome 21 (13) and exome sequences (14) have been generated from a different individual from Vindija Cave and one from Sidron Cave in Spain. The lack of high-quality Neandertal genome sequences, especially from the center of their geographical range and from the time close to when they were estimated to have mixed with modern humans, limits our ability to reconstruct their history and the extent of their genetic contribution to present-day humans.
Neandertals lived in Vindija Cave in Croatia until relatively late in their history (3, 15). The cave has yielded Neandertal and animal bones, many of them too fragmentary to determine from their morphology from what species they derive. Importantly, DNA preservation in Vindija Cave is relatively good and allowed the determination of Pleistocene nuclear DNA from a cave bear (16), a Neandertal genome (3), exome and chromosome 21 sequences (13, 14).
To generate DNA suitable for deep sequencing, we extracted DNA (17) and generated DNA libraries (18) from 12 samples from Vindija 33.19, one of 19 bone fragments from Vindija Cave determined to be of Neandertal origin by mitochondrial (mt) DNA analyses (19). In addition, 567 mg (mg) were removed for radiocarbon dating and yielded a date of greater than 45.5 thousand years before present (OxA 32,278). One of the DNA extracts, generated from 41 mg of bone material, contained more hominin DNA than the other extracts. We created additional libraries from this extract, but to maximize the number of molecules retrieved from the specimen we omitted the uracil-DNA-glycosylase (UDG) treatment (20, 21). A total of 24 billion DNA fragments were sequenced and approximately 10% of these could be mapped to the human genome. Their average length was 53 base pairs (bp) and they yielded 30-fold coverage of the approximately 1.8 billion bases of the genome to which such short fragments can be confidently mapped.
We estimated present-day human DNA contamination among the DNA fragments (20). First, using positions in the mtDNA where present-day humans differ from Neandertals we estimated an mtDNA contamination rate of 1.4-1.7%. Similarly, using positions in the autosomal genome where all present-day humans carry derived variants whereas all archaic genomes studied to date carry ancestral variants we estimated a nuclear contamination rate of 0.17-0.48%. Because the coverage of the X chromosome is similar to that of the autosomes we inferred that the Vindija 33.19 individual is a female, allowing us to use DNA fragments that map to the Y chromosome to estimate a male DNA contamination of 0.74% (between 0.70-0.78% for each of the nine sequencing libraries). Finally, using a likelihood method (2, 3) we estimated the autosomal contamination to 0.18-0.23%. We conclude that the nuclear DNA contamination rate among the DNA fragments sequenced is less than 1%. After genotyping this will result in contamination that is much lower than 1%.
Because ~76% of the DNA fragments were not UDG-treated, they carry C to T substitutions throughout their lengths. This causes standard genotyping software to generate false heterozygous calls. To overcome this we implemented snpAD, a genotyping software that incorporates a position-dependent error-profile to estimate the most likely genotype for each position in the genome. This results in genotypes of comparable quality to UDG-treated ancient DNA given our genomic coverage (20). The high-coverage of the Vindija genome also allowed for characterization of longer structural variants and segmental duplications (20).
To gauge whether the Vindija 33.19 bone might stem from a previously sequenced individual from Vindija Cave we compared heterozygous sites in the Vindija 33.19 genome to DNA fragments sequenced from the other bones. The three bones from which a low-coverage composite genome has been generated (Vindija 33.16, 33.25 and 33.26) do not share variants with Vindija 33.19 at a level compatible with deriving from the same individual. In contrast, over 99% of heterozygous sites in the chromosome 21 sequence from Vindija 33.15 (13) are shared with Vindija 33.19, indicating that they come from the same individual (20). Additionally, two of the other three bones may come from individuals that shared a maternal ancestor to Vindija 33.19 relatively recently in their family history because all carry identical mtDNAs.
In addition to the Altai Neandertal genome, a genome from a Denisovan, an Asian relative of Neandertals, has been sequenced to high coverage (~30-fold) from Denisova Cave. These two genomes are similar in that their heterozygosity is about one fifth of that of present-day Africans and about one third of that of present-day Eurasians. We estimated the heterozygosity of the Vindija 33.19 autosomal genome to 1.6x10−5; similar to the Altai Neandertal genome and slightly lower than the Denisovan genome (1.8 x10−5) (Fig. 1A). Thus, low heterozygosity may be a feature typical of archaic hominins, suggesting that they lived in small and isolated populations with an effective population size of around 3,000 individuals (20). In addition to low over-all heterozygosity, the Altai Neandertal genome carried segments of many megabases (Mb) (>10 centimorgans (cM)) without any differences between its two chromosomes, indicating that the parents of that individual were related at the level of half-sibs (2). Such segments are almost totally absent in the Vindija genome (Fig. 1B), suggesting that the extreme inbreeding between the parents of the Altai Neandertal was not ubiquitous among Neandertals. We note, however, that the Vindija genome carries extended homozygous segments (>2.5cM) comparable to what is seen in some isolated Native American populations today (20).

The high quality of the three archaic genome sequences allows their approximate ages to be estimated from the number of new nucleotide substitutions they carry relative to present-day humans when compared to the inferred ancestor shared with apes (1). Using this approach, we estimate that the Vindija 33.19 individual lived 52 thousand years ago (kya), the Altai Neandertal individual 122kya, and the Denisovan individual 72kya (Fig. 2) (20). Many factors make such absolute age estimates tentative. Among these are uncertainty in generation times and mutation rates. Nevertheless, these results indicate that the Altai Neandertal lived about twice as far back in time as the Vindija 33.19 Neandertal, while the Denisovan individual lived after the Altai but before the Vindija Neandertal.


We next estimated when ancestral populations that gave rise to the three archaic genomes and to modern humans split from each other based on the extent to which they share genetic variants (13, 20). The estimated population split time between the Vindija Neandertal and the Denisovan is 390-440kya and between the Vindija Neandertal and modern humans 520-630kya, in agreement with previous estimates using the Altai Neandertal (2). The split time between the Vindija and the Altai Neandertals is estimated to 130-145kya. To estimate the population split time to the Mezmaiskaya 1 Neandertal previously sequenced to 0.5-fold coverage, we prepared and sequenced libraries yielding an additional 1.4-fold coverage. Because the present-day human DNA contamination of these libraries is in the order of 2-3% (20), we estimated the population split time to the Vindija 33.19 individual with and without restricting the analysis of the Mezmaiskaya 1 individual to fragments that show evidence of deamination. The resulting split time estimates are 100kya for the deaminated fragments and 80kya for all fragments (Fig. 2).
It has been suggested that Denisovans received gene flow from a human lineage that diverged prior to the common ancestor of modern humans, Neandertals and Denisovans (2). In addition, it has been suggested that the ancestors of the Altai Neandertal received gene flow from early modern humans that may not have affected the ancestors of European Neandertals (13). In agreement with these studies, we find that the Denisovan genome carries fewer derived alleles that are fixed in Africans, and thus tend to be older, than the Altai Neandertal genome while the Altai genome carries more derived alleles that are of lower frequency in Africa, and thus younger, than the Denisovan genome (20). However, the Vindija and Altai genomes do not differ significantly in the sharing of derived alleles with Africans indicating that they may not differ with respect to their putative interactions with early modern humans (Fig. 3A & B). Thus, in contrast to earlier analyses of chromosome 21 data for the European Neandertals (13), analyses of the full genomes suggest that the putative early modern human gene flow into Neandertals occurred prior to the divergence of the populations ancestral to the Vindija and Altai Neandertals ~130-145 thousand years ago (Fig. 2). Coalescent simulations show that a model with only gene flow from a deeply diverged hominin into Denisovan ancestors explains the data better than one with only gene flow from early modern humans into Neandertal ancestors, but that a model involving both gene flows explains the data even better. It is likely that gene flow occurred between many or even most hominin groups in the late Pleistocene and that more such events will be detected as more ancient genomes of high quality become available.


A proportion of the genomes of all present-day people whose roots are outside Africa derives from Neandertals (2, 3, 22). We tested if any of the three sequenced Neandertals falls closer to the lineage that contributed DNA to present-day non-Africans by asking if any of them shares more alleles with present-day non-Africans than the others (20, 23). The Vindija 33.19 and Mezmaiskaya 1 genomes share more alleles with non-Africans than the Altai Neandertal, and there is no indication that the former two genomes differ in the extent of their allele-sharing with present-day people (Fig. 3C). Using a likelihood approach we estimate the proportion of Neandertal DNA in present-day populations that is closer to the Vindija than the Altai genomes to be 99%-100% (20). Thus, the majority of Neandertal DNA in present-day populations appears to come from Neandertal populations that diverged from the Vindija and Mezmaiskaya 1 Neandertals prior to their divergence from each other some 80-100kya.
The two high-coverage Neandertal genomes allow us to estimate the proportion of the genomes of present-day people that derive from Neandertals with greater accuracy than was hitherto possible. We asked how many derived alleles non-Africans share with the Altai Neandertal relative to how many derived alleles the Vindija Neandertal shares with the Altai Neandertal - essentially asking how close non-Africans are to being 100% Neandertal (24). We find that non-African populations outside Oceania carry between 1.8-2.6% Neandertal DNA (Fig. 4A), higher than previous estimates of 1.5-2.1% (2). As described (25), East Asians carry somewhat more Neandertal DNA (2.3-2.6%) than people in Western Eurasia (1.8-2.4%).


We also identified (8) regions of Neandertal-ancestry in present-day Europeans and Asians using the Vindija and the Altai Neandertal genomes (20). The Vindija genome allows us to identify ~10% more Neandertal DNA sequences per individual than the Altai Neandertal genome (e.g. 40.4 Mb vs 36.3 Mb in Europeans) due to the closer relationship between the Vindija genome and the introgressing Neandertal populations. In Melanesians, the increased power to distinguish between Denisovan and Neandertal DNA sequences results in the identification of 20% more Neandertal DNA (Fig. 4B).
Many Neandertal variants associated with phenotypes and susceptibility to diseases have been identified in present-day non-Africans (6, 7, 1012). The fact that the Vindija Neandertal genome is more closely related to the introgressing Neandertals allows ~15% more such variants to be identified (20). Among these are variants associated with plasma levels of LDL cholesterol (rs10490626) and vitamin D (rs6730714), eating disorders (rs74566133), visceral fat accumulation (rs2059397), rheumatoid arthritis (45475795), schizophrenia (rs16977195) and the response to antipsychotic drugs (rs1459148). This adds to mounting evidence that Neandertal ancestry influences disease risk in present-day humans, particularly with respect to neurological, psychiatric, immunological, and dermatological phenotypes (7).




KVogluy.jpg


Fig. 1
Heterozygosity and inbreeding in the Vindija Neandertal.
(A) Distribution of heterozygosity over all autosomes in the three archaic hominins, 12 Non-Africans and 3 Africans. Each dot represents the heterozygosity measured for one autosome. The center bar indicates the mean heterozygosity across the autosomal genome. (B) Genome covered by shorter (2.5-10cM, red) and longer (>10cM, yellow) runs of homozygosity in the three archaic hominins.
HeSQZRx.jpg


Fig. 2Approximate ages of specimens and population split times.
Age estimates for the genomes estimated from branch shortening, i.e. the absence of mutations in the archaic genomes, are indicated by dashed lines. Population split time estimates are indicated by dashed lines. The majority of Neandertal DNA in present-day people comes from a population that split from the branch indicated in red. All reported ages assume a human-chimpanzee divergence of 13 million years. Numbers show ranges over point estimates (split times), or ranges over different data filters (branch shortening).

Ha2QY8I.jpg


Fig. 3Allele sharing between archaic and modern humans.

(A) Derived allele-sharing in percent of 19 African populations with the Altai and Denisovan, and Vindija and Denisovan genomes, respectively. (B) Sharing of derived alleles in each of the 19 African populations with the Vindija and Altai genomes. (C) Allele sharing of Neandertals with non-Africans and Africans. Points show derived allele sharing in percent for all pairwise comparisons between non-Africans (OAA: French, Sardinian, Han, Dai, Karitiana, Mixe, Australian, Papuan) and Africans (AFR: San, Mbuti, Yoruba). Mezmaiskaya 1 data were restricted to sequences showing evidence of deamination to reduce the influence of present-day human DNA contamination. Lines show two standard errors from the mean in all plots.

uOJp0KV.jpg


Fig. 4Estimates of fraction of Neandertal DNA for present-day populations.

(A) Colors indicate Neandertal ancestry estimates (20). Oceanian populations show high estimates due to Denisovan ancestry that is difficult to distinguish from Neandertal ancestry. (B) Amount of Neandertal sequence in present-day Europeans, South Asians and East Asians (20).
 
Back
Top