Suppr超能文献

利用等位基因计数数据高效推断多个群体的人口历史。

Efficiently inferring the demographic history of many populations with allele count data.

作者信息

Kamm Jack, Terhorst Jonathan, Durbin Richard, Song Yun S

机构信息

Wellcome Sanger Institute, Hinxton, Cambridge, UK.

Department of Genetics, University of Cambridge, Cambridge, UK.

出版信息

J Am Stat Assoc. 2020;115(531):1472-1487. doi: 10.1080/01621459.2019.1635482. Epub 2019 Jul 22.

Abstract

The sample frequency spectrum (SFS), or histogram of allele counts, is an important summary statistic in evolutionary biology, and is often used to infer the history of population size changes, migrations, and other demographic events affecting a set of populations. The expected multipopulation SFS under a given demographic model can be efficiently computed when the populations in the model are related by a tree, scaling to hundreds of populations. Admixture, back-migration, and introgression are common natural processes that violate the assumption of a tree-like population history, however, and until now the expected SFS could be computed for only a handful of populations when the demographic history is not a tree. In this article, we present a new method for efficiently computing the expected SFS and linear functionals of it, for demographies described by general directed acyclic graphs. This method can scale to more populations than p reviously possible for complex demographic histories including admixture. We apply our method to an 8-population SFS to estimate the timing and strength of a proposed "basal Eurasian" admixture event in human history. We implement and release our method in a new open-source software package momi2.

摘要

样本频率谱(SFS),即等位基因计数的直方图,是进化生物学中的一个重要汇总统计量,常用于推断种群大小变化、迁移以及影响一组种群的其他人口统计学事件的历史。当模型中的种群通过一棵树相关联时,在给定的人口统计学模型下,可以高效地计算预期的多群体SFS,扩展到数百个种群。然而,混合、回交和基因渗入是常见的自然过程,它们违反了树状种群历史的假设,并且到目前为止,当人口统计学历史不是树状时,对于少数几个种群才能计算预期的SFS。在本文中,我们提出了一种新方法,用于高效计算由一般有向无环图描述的人口统计学的预期SFS及其线性泛函。该方法可以扩展到比以前更多的种群,适用于包括混合在内的复杂人口统计学历史。我们将我们的方法应用于一个8种群的SFS,以估计人类历史中一个提议的“基础欧亚人”混合事件的时间和强度。我们在一个新的开源软件包momi2中实现并发布了我们的方法。

相似文献

1
Efficiently inferring the demographic history of many populations with allele count data.
J Am Stat Assoc. 2020;115(531):1472-1487. doi: 10.1080/01621459.2019.1635482. Epub 2019 Jul 22.
2
Efficient computation of the joint sample frequency spectra for multiple populations.
J Comput Graph Stat. 2017;26(1):182-194. doi: 10.1080/10618600.2016.1159212. Epub 2017 Feb 16.
3
Geometry of the Sample Frequency Spectrum and the Perils of Demographic Inference.
Genetics. 2018 Oct;210(2):665-682. doi: 10.1534/genetics.118.300733. Epub 2018 Jul 31.
4
Accuracy of Demographic Inferences from the Site Frequency Spectrum: The Case of the Yoruba Population.
Genetics. 2017 May;206(1):439-449. doi: 10.1534/genetics.116.192708. Epub 2017 Mar 24.
5
A likelihood-based framework for demographic inference from genealogical trees.
bioRxiv. 2023 Oct 13:2023.10.10.561787. doi: 10.1101/2023.10.10.561787.
6
The Promise of Inferring the Past Using the Ancestral Recombination Graph.
Genome Biol Evol. 2024 Feb 1;16(2). doi: 10.1093/gbe/evae005.
7
Exact calculation of the expected SFS in structured populations.
Theor Popul Biol. 2025 Jun;163:50-61. doi: 10.1016/j.tpb.2025.03.003. Epub 2025 Mar 21.
8
Inferring admixture histories of human populations using linkage disequilibrium.
Genetics. 2013 Apr;193(4):1233-54. doi: 10.1534/genetics.112.147330. Epub 2013 Feb 14.
9
Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories.
G3 (Bethesda). 2017 Nov 6;7(11):3605-3620. doi: 10.1534/g3.117.300259.
10
Faster inference of complex demographic models from large allele frequency spectra.
bioRxiv. 2024 Mar 29:2024.03.26.586844. doi: 10.1101/2024.03.26.586844.

引用本文的文献

1
GHIST 2024: The 1st Genomic History Inference Strategies Tournament.
bioRxiv. 2025 Aug 11:2025.08.05.668560. doi: 10.1101/2025.08.05.668560.
4
A likelihood-based framework for demographic inference from genealogical trees.
Nat Genet. 2025 Apr;57(4):865-874. doi: 10.1038/s41588-025-02129-x. Epub 2025 Mar 20.
5
A structured coalescent model reveals deep ancestral structure shared by all modern humans.
Nat Genet. 2025 Apr;57(4):856-864. doi: 10.1038/s41588-025-02117-1. Epub 2025 Mar 18.
6
A General Framework for Branch Length Estimation in Ancestral Recombination Graphs.
bioRxiv. 2025 Feb 15:2025.02.14.638385. doi: 10.1101/2025.02.14.638385.
7
Leveraging graphical model techniques to study evolution on phylogenetic networks.
Philos Trans R Soc Lond B Biol Sci. 2025 Feb 13;380(1919):20230310. doi: 10.1098/rstb.2023.0310. Epub 2025 Feb 20.
9
Estimating evolutionary and demographic parameters via ARG-derived IBD.
PLoS Genet. 2025 Jan 8;21(1):e1011537. doi: 10.1371/journal.pgen.1011537. eCollection 2025 Jan.
10
Haplotype-resolved genome of a papeda provides insights into the geographical origin and evolution of Citrus.
J Integr Plant Biol. 2025 Feb;67(2):276-293. doi: 10.1111/jipb.13819. Epub 2025 Jan 6.

本文引用的文献

1
On the decidability of population size histories from finite allele frequency spectra.
Theor Popul Biol. 2018 Mar;120:42-51. doi: 10.1016/j.tpb.2017.12.008. Epub 2018 Jan 3.
2
Inferring the Joint Demographic History of Multiple Populations: Beyond the Diffusion Approximation.
Genetics. 2017 Jul;206(3):1549-1567. doi: 10.1534/genetics.117.200493. Epub 2017 May 11.
3
Efficient computation of the joint sample frequency spectra for multiple populations.
J Comput Graph Stat. 2017;26(1):182-194. doi: 10.1080/10618600.2016.1159212. Epub 2017 Feb 16.
4
Robust and scalable inference of population history from hundreds of unphased whole genomes.
Nat Genet. 2017 Feb;49(2):303-309. doi: 10.1038/ng.3748. Epub 2016 Dec 26.
5
DESCARTES' RULE OF SIGNS AND THE IDENTIFIABILITY OF POPULATION DEMOGRAPHIC MODELS FROM GENOMIC VARIATION DATA.
Ann Stat. 2014;42(6):2469-2493. doi: 10.1214/14-AOS1264. Epub 2014 Oct 20.
6
The Simons Genome Diversity Project: 300 genomes from 142 diverse populations.
Nature. 2016 Oct 13;538(7624):201-206. doi: 10.1038/nature18964. Epub 2016 Sep 21.
7
The mutation rate in human evolution and demographic inference.
Curr Opin Genet Dev. 2016 Dec;41:36-43. doi: 10.1016/j.gde.2016.07.008. Epub 2016 Aug 31.
8
Genomic insights into the origin of farming in the ancient Near East.
Nature. 2016 Aug 25;536(7617):419-24. doi: 10.1038/nature19310. Epub 2016 Jul 25.
9
Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes.
PLoS Comput Biol. 2016 May 4;12(5):e1004842. doi: 10.1371/journal.pcbi.1004842. eCollection 2016 May.
10
Nuclear DNA sequences from the Middle Pleistocene Sima de los Huesos hominins.
Nature. 2016 Mar 24;531(7595):504-7. doi: 10.1038/nature17405. Epub 2016 Mar 14.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验