使用 f 统计量对非洲人群历史进行建模时，应用所有先前提出的 SNP 确定方案会产生偏差。

Modeling of African population history using f-statistics is biased when applying all previously proposed SNP ascertainment schemes.

机构信息

Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America.

Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia.

出版信息

PLoS Genet. 2023 Sep 7;19(9):e1010931. doi: 10.1371/journal.pgen.1010931. eCollection 2023 Sep.

DOI:10.1371/journal.pgen.1010931

PMID:37676865

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10508636/

Abstract

f-statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. Not only are they guaranteed to allow robust tests of the fits of proposed models of population history to data when analyzing full genome sequencing data-that is, all single nucleotide polymorphisms (SNPs) in the individuals being analyzed-but they are also guaranteed to allow robust tests of models for SNPs ascertained as polymorphic in a population that is an outgroup in a phylogenetic sense to all groups being analyzed. True "outgroup ascertainment" is in practice impossible in humans because our species has arisen from a substructured ancestral population that does not descend from a homogeneous ancestral population going back many hundreds of thousands of years into the past. However, initial studies suggested that non-outgroup-ascertainment schemes might produce robust enough results using f-statistics, and that motivated widespread fitting of models to data using non-outgroup-ascertained SNP panels such as the "Affymetrix Human Origins array" which has been genotyped on thousands of modern individuals from hundreds of populations, or the "1240k" in-solution enrichment reagent which has been the source of about 70% of published genome-wide data for ancient humans. In this study, we show that while analyses of population history using such panels work well for studies of relationships among non-African populations and one African outgroup, when co-modeling more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans), fitting of f-statistics to such SNP sets is expected to frequently lead to false rejection of true demographic histories, and failure to reject incorrect models. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, has limited statistical power and retains important biases. However, by carrying out simulations of diverse demographic histories, we show that bias in inferences based on f-statistics can be minimized by ascertaining on variants common in a union of diverse African groups; such ascertainment retains high statistical power while allowing co-analysis of archaic and modern groups.

摘要

f 统计量已成为从全基因组数据推断人口历史的首选分析方法。不仅在分析全基因组测序数据时，它们可以保证对群体历史提出的模型与数据的拟合进行稳健的检验——即分析个体中所有的单核苷酸多态性（SNP）——而且还可以保证对作为所有被分析群体的外群群体中确定为多态性的 SNP 模型进行稳健的检验。在实践中，真正的“外群确定”在人类中是不可能的，因为我们的物种是从一个亚结构的祖先群体中产生的，这个祖先群体不是从一个在过去几十万年中一直存在的同质祖先群体中衍生出来的。然而，最初的研究表明，非外群确定方案可能使用 f 统计量产生足够稳健的结果，这促使人们广泛地使用非外群确定的 SNP 面板拟合模型到数据中，例如“Affymetrix 人类起源阵列”，该阵列已经在来自数百个群体的数千个现代个体中进行了基因分型，或者“1240k”溶液内富集试剂，该试剂是大约 70%的已发表的古代人类全基因组数据的来源。在这项研究中，我们表明，虽然使用这些面板进行人口历史分析对于研究非非洲人群体之间的关系以及一个非洲外群体非常有效，但当同时建模超过一个撒哈拉以南非洲和/或古代人类群体（尼安德特人和丹尼索万人）时，拟合 f 统计量到这样的 SNP 集合预计会经常导致对真实人口历史的错误拒绝，并且不能拒绝不正确的模型。分析古人类多态性的 SNP 面板已被提议作为确定问题的解决方案，但这种方法统计能力有限，并且保留了重要的偏差。然而，通过进行各种人口历史的模拟，我们表明，基于 f 统计量的推断偏差可以通过在多样化的非洲群体联盟中常见的变体进行确定来最小化；这种确定方法保持了高统计能力，同时允许对古代和现代群体进行共同分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3232/10508636/3ee5b6189070/pgen.1010931.g001.jpg

相似文献

Modeling of African population history using f-statistics is biased when applying all previously proposed SNP ascertainment schemes.使用 f 统计量对非洲人群历史进行建模时，应用所有先前提出的 SNP 确定方案会产生偏差。

PLoS Genet. 2023 Sep 7;19(9):e1010931. doi: 10.1371/journal.pgen.1010931. eCollection 2023 Sep.

Modeling of African population history using -statistics can be highly biased and is not addressed by previously suggested SNP ascertainment schemes.使用F统计量对非洲人群历史进行建模可能存在高度偏差，并且先前提出的单核苷酸多态性（SNP）确定方案并未解决这一问题。

bioRxiv. 2023 Jan 22:2023.01.22.525077. doi: 10.1101/2023.01.22.525077.

How do SNP ascertainment schemes and population demographics affect inferences about population history?单核苷酸多态性（SNP）确定方案和人口统计学如何影响对人口历史的推断？

BMC Genomics. 2015 Apr 3;16(1):266. doi: 10.1186/s12864-015-1469-5.

Neanderthal and Denisova genetic affinities with contemporary humans: introgression versus common ancestral polymorphisms.尼安德特人和丹尼索瓦人与当代人类的遗传亲和力：基因渗入与共同祖先多态性。

Gene. 2013 Nov 1;530(1):83-94. doi: 10.1016/j.gene.2013.06.005. Epub 2013 Jul 19.

Ascertainment biases in SNP chips affect measures of population divergence.SNP 芯片中的确定偏差会影响种群分歧的度量。

Mol Biol Evol. 2010 Nov;27(11):2534-47. doi: 10.1093/molbev/msq148. Epub 2010 Jun 17.

Ancestry informative marker panels for African Americans based on subsets of commercially available SNP arrays.基于商业 SNP 芯片子集的非裔美国人溯源信息标记面板。

Genet Epidemiol. 2011 Jan;35(1):80-3. doi: 10.1002/gepi.20550.

Model-based analyses of whole-genome data reveal a complex evolutionary history involving archaic introgression in Central African Pygmies.基于模型的全基因组数据分析揭示了中非俾格米人复杂的进化史，其中涉及古老基因渗入。

Genome Res. 2016 Mar;26(3):291-300. doi: 10.1101/gr.196634.115. Epub 2016 Feb 17.

Detecting archaic introgression using an unadmixed outgroup.利用未混合的外群检测古老的基因渗入。

PLoS Genet. 2018 Sep 18;14(9):e1007641. doi: 10.1371/journal.pgen.1007641. eCollection 2018 Sep.

The discovery of single-nucleotide polymorphisms--and inferences about human demographic history.单核苷酸多态性的发现及对人类人口历史的推断。

Am J Hum Genet. 2001 Dec;69(6):1332-47. doi: 10.1086/324521. Epub 2001 Nov 6.

Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls.葡萄系统基因组学：SNP 芯片的杂交强度优于基因型分析。

PLoS One. 2013 Nov 13;8(11):e78680. doi: 10.1371/journal.pone.0078680. eCollection 2013.

引用本文的文献

The genomic footprints of migration: how ancient DNA reveals our history of mobility.迁徙的基因组印记：古代DNA如何揭示我们的迁徙历史。

Genome Biol. 2025 Jul 16;26(1):206. doi: 10.1186/s13059-025-03664-w.

Performance of qpAdm-based screens for genetic admixture on graph-shaped histories and stepping stone landscapes.基于qpAdm的基因混合筛选在图状历史和踏脚石景观上的表现。

Genetics. 2025 May 8;230(1). doi: 10.1093/genetics/iyaf047.

Genomic exploration of the journey of Plasmodium vivax in Latin America.拉丁美洲间日疟原虫传播历程的基因组探索。

PLoS Pathog. 2025 Jan 13;21(1):e1012811. doi: 10.1371/journal.ppat.1012811. eCollection 2025 Jan.

9,000 years of genetic continuity in southernmost Africa demonstrated at Oakhurst rockshelter.奥克赫斯特岩洞遗址揭示了最南端非洲地区 9000 年的遗传连续性。

Nat Ecol Evol. 2024 Nov;8(11):2121-2134. doi: 10.1038/s41559-024-02532-3. Epub 2024 Sep 19.

An explanation for the sister repulsion phenomenon in Patterson's f-statistics.帕特森 F 统计量中姐妹排斥现象的解释。

Genetics. 2024 Nov 6;228(3). doi: 10.1093/genetics/iyae144.

Testing times: disentangling admixture histories in recent and complex demographies using ancient DNA.测试时代：利用古代 DNA 解开近代和复杂人口中的混合历史。

Genetics. 2024 Sep 4;228(1). doi: 10.1093/genetics/iyae110.

Hunter-gatherer genetics research: Importance and avenues.狩猎采集者遗传学研究：重要性与途径

Evol Hum Sci. 2024 Feb 15;6:e15. doi: 10.1017/ehs.2024.7. eCollection 2024.

Testing Times: Challenges in Disentangling Admixture Histories in Recent and Complex Demographies.测试时代：解析近期复杂人口结构中的混合历史所面临的挑战

bioRxiv. 2023 Nov 15:2023.11.13.566841. doi: 10.1101/2023.11.13.566841.

Performance of -based screens for genetic admixture on admixture-graph-shaped histories and stepping-stone landscapes.基于 - 的遗传混合筛选在混合图形状的历史和踏脚石景观上的性能。（注：原文中“-based”前缺少具体内容，翻译可能不太准确，需根据完整准确的原文进一步完善）

bioRxiv. 2025 Feb 3:2023.04.25.538339. doi: 10.1101/2023.04.25.538339.

本文引用的文献

A weakly structured stem for human origins in Africa.人类起源于非洲的弱结构主干。

Nature. 2023 May;617(7962):755-763. doi: 10.1038/s41586-023-06055-y. Epub 2023 May 17.

On the limits of fitting complex models of population history to -statistics.关于将复杂的群体历史模型拟合到 -statistics 的限制。

Elife. 2023 Jun 29;12:e85492. doi: 10.7554/eLife.85492.

Population Genomic Evidence of Adaptive Response during the Invasion History of Plasmodium falciparum in the Americas.人口基因组证据表明，在美洲恶性疟原虫的入侵历史中存在适应性反应。

Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad082.

Entwined African and Asian genetic roots of medieval peoples of the Swahili coast.中世纪斯瓦希里沿海居民的非洲和亚洲遗传根源交织在一起。

Nature. 2023 Mar;615(7954):866-873. doi: 10.1038/s41586-023-05754-w. Epub 2023 Mar 29.

Palaeogenomics of Upper Palaeolithic to Neolithic European hunter-gatherers.旧石器时代晚期至新石器时代欧洲狩猎采集者的古基因组学

Nature. 2023 Mar;615(7950):117-126. doi: 10.1038/s41586-023-05726-0. Epub 2023 Mar 1.

Bayesian inference of admixture graphs on Native American and Arctic populations.贝叶斯推断美洲原住民和北极人群的混合图。

PLoS Genet. 2023 Feb 13;19(2):e1010410. doi: 10.1371/journal.pgen.1010410. eCollection 2023 Feb.

Genomic perspectives on human dispersals during the Holocene.人类在全新世迁徙过程中的基因组视角。

Proc Natl Acad Sci U S A. 2023 Jan 24;120(4):e2209475119. doi: 10.1073/pnas.2209475119. Epub 2023 Jan 17.

Three assays for in-solution enrichment of ancient human DNA at more than a million SNPs.三种在溶液中富集古人类 DNA 超过一百万 SNPs 的方法。

Genome Res. 2022 Nov-Dec;32(11-12):2068-2078. doi: 10.1101/gr.276728.122. Epub 2022 Dec 14.

Grey wolf genomic history reveals a dual ancestry of dogs.灰狼基因组历史揭示了狗的双重起源。

Nature. 2022 Jul;607(7918):313-320. doi: 10.1038/s41586-022-04824-9. Epub 2022 Jun 29.

Ancient genomes from the last three millennia support multiple human dispersals into Wallacea.过去三千年的古基因组支持人类多次向华莱士地区扩散。

Nat Ecol Evol. 2022 Jul;6(7):1024-1034. doi: 10.1038/s41559-022-01775-2. Epub 2022 Jun 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用 f 统计量对非洲人群历史进行建模时，应用所有先前提出的 SNP 确定方案会产生偏差。

Modeling of African population history using f-statistics is biased when applying all previously proposed SNP ascertainment schemes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献