Edited by Liisa Loog, University of Oxford, Cambridge, UK; received November 1, 2022; accepted December 5, 2022, by Editorial Board Member Richard G. Klein
February 23, 2023
120 (9) e2218375120
Significance
Ancient human DNA (aDNA) extracted from archaeological contexts allows reconstructing past population movements. Previous methods work by calculating proportions of shared ancestry among individuals or groups in order to answer specific, regional research questions. Here, we propose a large-scale algorithm to quantify human mobility through time and space using bulk aDNA data. The algorithm has two core components: i) interpolation of the spatiotemporal distribution of genetic ancestry to obtain a continuous ancestry information field and ii) probabilistic estimation of a spatial genetic similarity surface for each input sample by projecting its ancestry profile into this field. We apply this to thousands of published genomic samples in the last 10,000 y to trace diachronic mobility patterns in Western Eurasia.
Abstract
The recent increase in openly available ancient human DNA samples allows for large-scale meta-analysis applications. Trans-generational past human mobility is one of the key aspects that ancient genomics can contribute to since changes in genetic ancestry—unlike cultural changes seen in the archaeological record—necessarily reflect movements of people. Here, we present an algorithm for spatiotemporal mapping of genetic profiles, which allow for direct estimates of past human mobility from large ancient genomic datasets. The key idea of the method is to derive a spatial probability surface of genetic similarity for each individual in its respective past. This is achieved by first creating an interpolated ancestry field through space and time based on multivariate statistics and Gaussian process regression and then using this field to map the ancient individuals into space according to their genetic profile. We apply this algorithm to a dataset of 3138 aDNA samples with genome-wide data from Western Eurasia in the last 10,000 y. Finally, we condense this sample-wise record with a simple summary statistic into a diachronic measure of mobility for subregions in Western, Central, and Southern Europe. For regions and periods with sufficient data coverage, our similarity surfaces and mobility estimates show general concordance with previous results and provide a meta-perspective of genetic changes and human mobility.
Continue Reading
Data, Materials, and Software Availability
The code for this paper is available in a repository here: https://doi.org/10.17605/OSF.IO/6UWM5. From that, we outsourced the main similarity search and mobility estimation workflow into an R package available here: https://github.com/nevrome/mobest. All data analysis and plotting was done in R (87) with the following packages: checkmate (88), cowplot (89), fractional (90), future (91), ggh4x (92), ggnewscale (93), ggpubr (94), ggrepel (95), ggridges (96), igraph (97), khroma (98), latex2exp (99), lemon (100), progress (101), rnaturalearth (102), sf (103), smartsnp (104), viridis (105), and, finally, the tidyverse and the many packages within it ref. 106. Previously published data were used for this work (Allen Ancient DNA Resource https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-dataversion 50.0).
Acknowledgments
This research was financed by the International Max Planck Research School for the Science of Human History (IMPRS-SHH) and carried out on computational facilities of the Max Planck Institutes for Geoanthropology (formerly for the Science of Human History) and for Evolutionary Anthropology. Data collection was significantly simplified thanks to the Allen Ancient DNA Resource and the Poseidon genotype data initiative. We gratefully acknowledge insightful discussions with Joscha Gretzinger and helpful advice from Thiseas C. Lamnidis, James A. Fellows Yates, He Yu, Ayshin Ghalichi, Ke Wang (all currently or formerly affiliated with MPI-EVA), Martin Hinz (University Bern), Martin J. Kümmel (University Jena), Oliver Nakoinz (University Kiel), and all members of the Population genetics working group at the MPI-EVA. This project has also received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement number 851511).
Author contributions
C.S. and S.S. designed research; C.S. and S.S. performed research; C.S. analyzed data; and C.S. and S.S. wrote the paper.
Competing interests
The authors declare no competing interest.
Supporting Information
References
1
P. Bellwood, The Global Prehistory of Human Migration (John Wiley and Sons, 2014).
2
R. L. Kelly, Mobility/Sedentism: Concepts, archaeological measures, and effects. Annu. Rev. Anthropol. 2143–66 (1992).
3
B. Bender, Landscapes on-the-move. J. Soc. Archaeol. 175–89 (2001).
4
M. A. Lelièvre, M. E. Marshall, “Because life it selfe is but motion”: Toward an anthropology of mobility. Anthropol. Theory 15434–471 (2015).
5
D. W. Anthony, Migration in archeology: The baby and the bathwater. Am. Anthropol. 92895–914 (1990).
6
S. Burmeister, “The archaeology of migration: What can and should it accomplish?” in Migration and integration from prehistory to the Middle Ages, conferences of the State Museum of Prehistory Halle, H. Meller, F. Daim, J. Krause, R. Risch, Eds. (State Office for Monument Preservation and Archeology Saxony-Anhalt, Halle (Saale), German), 1 ed., 2017), pp. 57-68.
7
S. Burmeister, Archaeology and migration: Approaches to an archaeological proof of migration. Curr. Anthropol. 41539–567 (2000).
8
H. Wolinsky, Ancient DNA and contemporary politics. EMBO Rep. 20 (2019).
9
W. Haak et al., Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522207–211 (2015).
10
M. Lipson et al., Ancient genomes document multiple waves of migration in Southeast Asian prehistory. Science 36192–95 (2018).
11
P. Flegontov et al., Palaeo-Eskimo Genetic Ancestry. Nature 570236–240 (2019).
12
M. Furholt, Massive migrations? The impact of recent aDNA studies on our view of third millennium Europe. Eur. J. Archaeol. 21159–191 (2018).
13
O. Gokcumen, M. Frachetti, The impact of ancient genome studies in archaeology. Ann. Rev. Anthropol. 49277–298 (2020).
14
M. Furholt, Mobility and social change: Understanding the European Neolithic Period after the archaeogenetic revolution. J. Archaeol. Res. (2021).
15
G. S. Bradburd, P. L. Ralph, Spatial population genetics: It’s about time. Ann. Rev. Ecol. Evol. Syst. 50427–449 (2019).
16
D. Petkova, J. Novembre, M. Stephens, Visualizing spatial population structure with estimated effective migration surfaces. Nat. Genet. 4894–100 (2015).
17
G. S. Bradburd, P. L. Ralph, G. M. Coop, A spatial framework for understanding population structure and admixture. PLOS Genet. 12e1005703 (2016).
18
H. Al-Asadi, D. Petkova, M. Stephens, J. Novembre, Estimating recent migration and population-size surfaces. PLOS Genet. 15e1007908 (2019).
19
B. M. Peter, D. Petkova, J. Novembre, Genetic landscapes reveal how human genetic diversity aligns with geography. Mol. Biol. Evol. 37943–951 (2019).
20
L. Loog et al., Estimating mobility using sparse data: Application to human genetic variation. Proc. Natl. Acad. Sci. U.S.A. 11412213–12218 (2017).
21
F. Racimo et al., The spatiotemporal spread of human migrations during the European Holocene. Proc. Natl. Acad. Sci. U.S.A. 1178989–9000 (2020).
23
I. Mathieson et al., Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528499–503 (2015).
24
J. Meisner, S. Liu, M. Huang, A. Albrechtsen, Large-scale inference of population structure in presence of missingness using PCA. Bioinformatics 371868–1875 (2021).
25
I. Lazaridis et al., Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513409–413 (2014).
26
W. Haak et al., Ancient DNA from European Early Neolithic farmers reveals their Near Eastern affinities. PLoS Biol. 8e1000536 (2010).
27
P. Skoglund et al., Origins and genetic legacy of Neolithic farmers and hunter-gatherers in Europe. Science 336466–469 (2012).
28
P. Menozzi, A. Piazza, L. Cavalli-Sforza, Synthetic maps of human gene frequencies in Europeans. Science 201786–792 (1978).
29
M. E. Allentoft et al., Population genomics of Bronze Age Eurasia. Nature 522167–172 (2015).
30
M. Porčić, T. Blagojević, J. Pendić, S. Stefanović, The timing and tempo of the Neolithic expansion across the Central Balkans in the light of the new radiocarbon evidence. J. Archaeol. Sci.: Rep. 33102528 (2020).
31
J. P. Bocquet-Appel, S. Naji, M. Vander Linden, J. K. Kozlowski, Detection of diffusion and contact zones of early farming in Europe from the space-time distribution of 14C dates. J. Archaeol. Sci. 36807–820 (2009).
32
J. P. Bocquet-Appel, S. Naji, M. Vander Linden, J. Kozlowski, Understanding the rates of expansion of the farming system in Europe. J. Archaeol. Sci. 39531–546 (2012).
33
R. Martiniano et al., Genomic signals of migration and continuity in Britain before the Anglo-Saxons. Nat. Commmon. 710326 (2016).
34
M. L. Antonio et al., Ancient Rome: A genetic crossroads of Europe and the Mediterranean. Science 366708–714 (2019).
35
M. Haber et al., A transient pulse of genetic admixture from the crusaders in the Near East identified from ancient genome sequences. Am. J. Hum. Genet. 104977–984 (2019).
36
I. Olalde et al., The genomic history of the Iberian Peninsula over the past 8000 years. Science 3631230–1234 (2019).
37
J. A. Sheridan, “The Neolithisation of Britain and Ireland: The big picture” in Landscapes in TransitionB. Finlayson, G. Warren, Eds. (Oxbow Books, Oxford, 2010), pp. 89–105.
38
N. Thorpe, “The Atlantic Mesolithic–Neolithic transition” in The Oxford Handbook of Neolithic EuropeC. Fowler, J. Harding, D. Hofmann, Eds. (Oxford University Press, 2015).
39
S. Brace et al., Anci ent genomes indicate population replacement in Early Neolithic Britain. Nat. Ecol. Evol. 3765–771 (2019).
40
F. Sánchez-Quinto et al., Megalithic tombs in western and northern Neolithic Europe were linked to a kindred society. Proc. Natl. Acad. Sci. U.S.A. 1169469–9474 (2019).
41
L. M. Cassidy et al., A dynastic elite in monumental Neolithic society. Nature 582384–388 (2020).
42
L. M. Cassidy et al., Neolithic and Bronze Age migration to Ireland and establishment of the insular Atlantic genome. Proc. Natl. Acad. Sci. U.S.A. 113368–373 (2016).
43
I. Olalde et al., The Beaker phenomenon and the genomic transformation of Northwest Europe. Nature 555190–196 (2018).
44
A. Margaryan et al., Population genomics of the Viking world. Nature 585390–396 (2020).
45
N. Patterson et al., Large-scale migration into Britain during the Middle to Late Bronze Age. Nature 1–14 (2021).
46
D. Gronenborn, P. Dolukhanov, “Early Neolithic manifestations in Central and Eastern Europe” in The Oxford Handbook of Neolithic EuropeC. Fowler, J. Harding, D. Hofmann, Eds. (Oxford University Press, 2015).
47
M. Lipson et al., Parallel palaeogenomic transects reveal complex genetic history of early European farmers. Nature 551368–372 (2017).
48
A. G. Nikitin et al., Interactions between earliest Linearbandkeramik farmers and Central European hunter gatherers at the dawn of European Neolithization. Sci. Rep. 919544 (2019).
49
S. Brunel et al., Ancient genomes from present-day France unveil 7,000 years of its demographic history. Proc. Natl. Acad. Sci. U.S.A. 11712791–12798 (2020).
50
D. M. Fernandes et al., A genomic Neolithic time transect of hunter-farmer admixture in central Poland. Sci. Rep. 8 (2018).
51
A. Furtwängler et al., Ancient genomes reveal social and genetic structure of Late Neolithic Switzerland. Nat. Commmon. 111915 (2020).
52
A. Linderholm et al., Corded ware cultural complexity uncovered using genomic and isotopic analysis from south-eastern Poland. Sci. Rep. 106885 (2020).
53
K. Kristiansen, TB Larsson, The Rise of Bronze Age Society – Travels, Transmissions and Transformations (Cambridge University Press, Cambridge, 2005).
54
J. Burger et al., Low prevalence of lactase persistence in Bronze Age Europe indicates ongoing strong selection over the last 3,000 years. Curr. Biol. 304307–4315.e13 (2020).
55
K. R. Veeramah et al., Population genomic analysis of elongated skulls reveals extensive female-biased immigration in Early Medieval Bavaria. Proc. Natl. Acad. Sci. 1153494–3499 (2018).
56
G. González-Fortes et al., Paleogenomic evidence for multi-generational mixing between Neolithic farmers and Mesolithic hunter-gatherers in the Lower Danube basin. Curr. Biol. 271801–1810.e10 (2017).
57
I. Olalde et al., A common genetic origin for early farmers from Mediterranean Cardial and Central European LBK cultures. Mol. Biol. Evol. msv181 (2015).
58
C. Valdiosera et al., Four millennia of Iberian biomolecular prehistory illustrate the impact of prehistoric migrations at the far end of Eurasia. Proc. Natl. Acad. Sci. U.S.A. 1153428–3433 (2018).
59
R. Martiniano et al., The population genomics of archaeological transition in west Iberia: Investigation of ancient substructure using imputation and haplotype-based methods. PLOS Genet. 13e1006852 (2017).
60
B. S. Paulsson, Radiocarbon dates and Bayesian modeling support maritime diffusion model for megaliths in Europe. Proc. Natl. Acad. Sci. U.S.A. 1163460–3465 (2019).
61
F. Sánchez-Quinto et al., Megalithic tombs in western and northern Neolithic Europe were linked to a kindred society. Proc. Natl. Acad. Sci. U.S.A. 1169469–9474 (2019).
62
C. E. Fischer et al., The multiple maternal legacy of the Late Iron Age group of Urville-Nacqueville (France, Normandy) documents a long-standing genetic contact zone in northwestern France. PLoS One 13e0207459 (2018).
63
G. González-Fortes et al., A western route of prehistoric human migration from Africa into the Iberian Peninsula. Proc. Biol. Sci. 28620182288 (2019).
64
J. H. Marcus et al., Genetic history from the Middle Neolithic to present on the Mediterranean island of Sardinia. Nat. Commmon. 11 (2020).
65
D. M. Fernandes et al., The spread of steppe and Iranian-related ancestry in the islands of the western Mediterranean. Nat. Ecol. Evol. 4334–345 (2020).
66
R. Fregel et al., Ancient genomes from North Africa evidence prehistoric migrations to the Maghreb from both the Levant and Europe. Proc. Natl. Acad. Sci. U.S.A. 1156774–6779 (2018).
67
M. Rivollat et al., Ancient genome-wide DNA from France highlights the complexity of interactions between Mesolithic hunter-gatherers and Neolithic farmers. Sci. Adv. 6eaaz5344 (2020).
68
A. Mittnik et al., Kinship-based social inequality in Bronze Age Europe. Science 366731–734 (2019).
69
C. J. Battey, P. L. Ralph, A. D. Kern, Predicting geographic location from genetic variation with deep neural networks. eLife 9 (2020).
70
A. E. Close, Reconstructing movement in prehistory. J. Archaeol. Method Theory 749–77 (2000).
71
K. Britton, “Isotope analysis for mobility and climate studies” in Archaeological Science: An Introduction (Cambridge University Press, 2020), pp. 99–124.
72
P. Verhagen, L. Nuninger, M. R. Groenhuijzen, “Modelling of pathways and movement networks in archaeology: An overview of current approaches” in Finding the Limits of the Limes: Modelling Demography, Economy and Transport on the Edge of the Roman Empire, P. Verhagen, J. Joyce, MR Groenhuijzen, Eds. (Springer International Publishing, Cham, 2019), pp. 217–249.
73
C. Perreault, The Quality of the Archaeological Record (University of Chicago Press, Chicago, 2019).
74
F. Riede, C. Hoggard, S. Shennan, Reconciling material cultures in archaeology with genetic data requires robust cultural evolutionary taxonomies. Palgrave Commun. 5 (2019).
75
J. Lindo et al., A time transect of exomes from a Native American population before and after European contact. Nat. Commmon. 713175 (2016).
76
SJ Micheletti et al., Genetic Consequences of the Transatlantic Slave Trade in the Americas. Am. J. Hum. Genet. 107265–277 (2020).
77
N. Patterson, A. L. Price, D. Reich, Population structure and Eigenanalysis. PLoS Genet. 2e190 (2006).
78
TS Korneliussen, A. Albrechtsen, R. Nielsen, ANGSD: Analysis of next generation sequencing data. BMC Bioinf. 15356 (2014).
79
J. Haslett, A. C. Parnell, A simple monotone process with application to radiocarbon-dated depth chronologies. J. R. Stat. Soc.: Ser. C (Appl. Stat.) 57399–418 (2008).
80
S. Purcell et al., PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81559–575 (2007).
81
A. L. Price et al., Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83132–135 (2008).
82
C. A. Anderson et al., Data quality control in genetic case-control association studies. Nat. Protoc. 51564–1573 (2010).
83
R. B. Gramacy, Surrogates: Gaussian Process Modeling, Design, and Optimization for the Applied Sciences (Taylor Francis Limited, 2020).
84
G. Matheron, Principles of geostatistics. Econ. Geol. 581246–1266 (1963).
85
R. B. Gramacy, laGP: Large-scale spatial modeling via Local Approximate Gaussian Processes in R. J. Stat. Software 721–46 (2016).
86
M. J. Heaton et al., A case study competition among methods for analyzing large spatial data. J. Agric. Biol. Environ. Stat. 24398–425 (2019).
87
R Core Team, R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria) (2021).
88
M. Lang, checkmate: Fast argument checks for defensive R programming. R. J. 9437–445 (2017).
89
C. O. Wilke, cowplot: Streamlined plot theme and plot annotations for ‘ggplot2’ (2019). R package version 1.0.0.
90
B. Venables, fractional: Vulgar fractions in R (2016). R package version 0.1.3.
92
T. van den Brand, ggh4x: Hacks for ‘ggplot2’ (2022). R package version 0.2.2.
93
E. Campitelli, ggnewscale: Multiple fill and colour scales in ‘ggplot2’ (2022). R package version 0.4.7.
94
A. Kassambara, ggpubr: ‘ggplot2’ based publication ready plots (2020). R package version 0.4.0.
95
K. Slowikowski, ggrepel: Automatically position non-overlapping text labels with ‘ggplot2’ (2021). R package version 0.9.1.
96
C. O. Wilke, ggridges: Ridgeline plots in ‘ggplot2’ (2021). R package version 0.5.3.
97
G. Csardi, T. Nepusz, The igraph software package for complex network research. InterJournal Complex Syst. 1695 (2006).
98
N. Frerebeau, khroma: Color schemes for scientific data visualization (Bordeaux Montaigne University, Pessac, France) (2021). R package version 1.7.0.
99
S. Meschiari, latex2exp: Use LaTeX expressions in plots (2015). R package version 0.4.0.
100
S. M. Edwards, lemon: Freshing up your ‘ggplot2’ plots (2020). R package version 0.4.5.
101
G. Csárdi, R. FitzJohn, progress: Terminal progress bars (2019). R package version 1.2.2.
102
A. South, rnaturalearth: World map data from Natural Earth (2021). R package version 0.2.0.
103
E. Pebesma, Simple Features for R: Standardized support for spatial vector data. R. J. 10439–446 (2018).
104
S. Herrando-Pérez, R. Tobler, C. D. Huber, smartsnp, an R package for fast multivariate analyses of big genomic data. Method s Ecol. Evol. 122084–2093 (2021).
105
Garnier et al., viridis – colorblind-friendly color maps for R (2021). R package version 0.6.1.
106
H. Wickham et al., Welcome to the tidyverse. J. Open Source Softw. 41686 (2019).
Information & Authors
Information
Published in
Proceedings of the National Academy of Sciences
Vol. 120 | No. 9
February 28, 2023
Classifications
Copyright
Data, Materials, and Software Availability
The code for this paper is available in a repository here: https://doi.org/10.17605/OSF.IO/6UWM5. From that, we outsourced the main similarity search and mobility estimation workflow into an R package available here: https://github.com/nevrome/mobest. All data analysis and plotting was done in R (87) with the following packages: checkmate (88), cowplot (89), fractional (90), future (91), ggh4x (92), ggnewscale (93), ggpubr (94), ggrepel (95), ggridges (96), igraph (97), khroma (98), latex2exp (99), lemon (100), progress (101), rnaturalearth (102), sf (103), smartsnp (104), viridis (105), and, finally, the tidyverse and the many packages within it ref. 106. Previously published data were used for this work (Allen Ancient DNA Resource https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-dataversion 50.0).
Submission history
Received: November 1, 2022
Accepted: December 5, 2022
Published online: February 23, 2023
Published in issue: February 28, 2023
Change history
March 2, 2023: The supporting datasets have been updated.
Keywords
- aDNA
- prehistory
- mobility estimation
- Gaussian process regression
Acknowledgments
This research was financed by the International Max Planck Research School for the Science of Human History (IMPRS-SHH) and carried out on computational facilities of the Max Planck Institutes for Geoanthropology (formerly for the Science of Human History) and for Evolutionary Anthropology. Data collection was significantly simplified thanks to the Allen Ancient DNA Resource and the Poseidon genotype data initiative. We gratefully acknowledge insightful discussions with Joscha Gretzinger and helpful advice from Thiseas C. Lamnidis, James A. Fellows Yates, He Yu, Ayshin Ghalichi, Ke Wang (all currently or formerly affiliated with MPI-EVA), Martin Hinz (University Bern), Martin J. Kümmel (University Jena), Oliver Nakoinz (University Kiel), and all members of the Population genetics working group at the MPI-EVA. This project has also received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement number 851511).
Author Contributions
C.S. and S.S. designed research; C.S. and S.S. performed research; C.S. analyzed data; and C.S. and S.S. wrote the paper.
Competing Interests
The authors declare no competing interest.
Notes
This article is a PNAS Direct Submission. L.L. is a guest editor invited by the Editorial Board.
Authors
Affiliations
Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
International Max Planck Research School for the Science of Human History, Max Planck Institute for Geoanthropology (formerly known as Max Planck Institute for the Science of Human History), Jena 07745, Germany
Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
Notes
Metrics & Citations
Metrics
Citation statements
Altmetrics
Citations
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
View Options
View options
PDF format
Download this article as a PDF file
DOWNLOAD PDF
Get Access
Media
Figures
Tables
Other