• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超快全基因组对合并时间的推断。

Ultrafast genome-wide inference of pairwise coalescence times.

机构信息

Department of Genetics, University of Cambridge, Cambridge CB2 1TN, United Kingdom

Department of Genetics, University of Cambridge, Cambridge CB2 1TN, United Kingdom.

出版信息

Genome Res. 2023 Jul;33(7):1023-1031. doi: 10.1101/gr.277665.123. Epub 2023 Aug 10.

DOI:10.1101/gr.277665.123
PMID:37562965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10538485/
Abstract

The pairwise sequentially Markovian coalescent (PSMC) algorithm and its extensions infer the coalescence time of two homologous chromosomes at each genomic position. This inference is used in reconstructing demographic histories, detecting selection signatures, studying genome-wide associations, constructing ancestral recombination graphs, and more. Inference of coalescence times between each pair of haplotypes in a large data set is of great interest, as they may provide rich information about the population structure and history of the sample. Here, we introduce a new method, , which is more than 10 times faster than current methods. To obtain this speed-up, we represent the posterior coalescence time distributions succinctly as a gamma distribution with just two parameters; in contrast, PSMC and its extensions hold these in a vector over discrete intervals of time. Thus, Gamma-SMC has constant time-complexity per site, without dependence on the number of discrete time states. Additionally, because of this continuous representation, our method is able to infer times spanning many orders of magnitude and, as such, is robust to parameter misspecification. We describe how this approach works, show its performance on simulated and real data, and illustrate its use in studying recent positive selection in the 1000 Genomes Project data set.

摘要

成对序贯马尔可夫链 coalescent(PSMC)算法及其扩展推断了每个基因组位置两条同源染色体的合并时间。这种推断用于重建人口历史、检测选择特征、研究全基因组关联、构建祖先重组图谱等。在大型数据集的每个 haplotypes 对之间推断合并时间是非常有趣的,因为它们可能提供有关样本种群结构和历史的丰富信息。在这里,我们引入了一种新方法,,比当前方法快 10 多倍。为了获得这种加速,我们将后验合并时间分布简洁地表示为具有两个参数的伽马分布;相比之下,PSMC 及其扩展在时间离散区间的向量中保持这些参数。因此,Gamma-SMC 每个位置的时间复杂度是常数,不依赖于离散时间状态的数量。此外,由于这种连续表示,我们的方法能够推断跨越多个数量级的时间,并且因此对参数指定不敏感。我们描述了这种方法的工作原理,展示了它在模拟和真实数据上的性能,并说明了它在研究 1000 基因组计划数据集最近正选择中的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/afd3244f2197/1023f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/7d91b089f326/1023f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/aaa0b7998335/1023f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/26914a0ca687/1023f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/545fa93de204/1023f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/afd3244f2197/1023f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/7d91b089f326/1023f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/aaa0b7998335/1023f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/26914a0ca687/1023f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/545fa93de204/1023f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d34e/10538485/afd3244f2197/1023f05.jpg

相似文献

1
Ultrafast genome-wide inference of pairwise coalescence times.超快全基因组对合并时间的推断。
Genome Res. 2023 Jul;33(7):1023-1031. doi: 10.1101/gr.277665.123. Epub 2023 Aug 10.
2
Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.从多个基因组估计可变有效种群大小:一种顺序马尔可夫条件抽样分布方法。
Genetics. 2013 Jul;194(3):647-62. doi: 10.1534/genetics.112.149096. Epub 2013 Apr 22.
3
Evaluation of methods for estimating coalescence times using ancestral recombination graphs.使用祖先重组图估计合并时间的方法评估
Genetics. 2022 May 5;221(1). doi: 10.1093/genetics/iyac044.
4
The Promise of Inferring the Past Using the Ancestral Recombination Graph.利用祖先重组图谱推断过去的可能性。
Genome Biol Evol. 2024 Feb 1;16(2). doi: 10.1093/gbe/evae005.
5
Minimal-assumption inference from population-genomic data.基于群体基因组数据的最小假设推断
Elife. 2017 Jul 3;6:e24836. doi: 10.7554/eLife.24836.
6
Ancestral population genomics using coalescence hidden Markov models and heuristic optimisation algorithms.使用合并隐马尔可夫模型和启发式优化算法的祖先群体基因组学。
Comput Biol Chem. 2015 Aug;57:80-92. doi: 10.1016/j.compbiolchem.2015.02.001. Epub 2015 Mar 5.
7
Modeling Human Population Separation History Using Physically Phased Genomes.利用物理定相基因组对人类群体分离历史进行建模。
Genetics. 2017 Jan;205(1):385-395. doi: 10.1534/genetics.116.192963. Epub 2016 Nov 9.
8
Genome-wide inference of ancestral recombination graphs.全基因组祖先重组图推断
PLoS Genet. 2014 May 15;10(5):e1004342. doi: 10.1371/journal.pgen.1004342. eCollection 2014.
9
The IICR (inverse instantaneous coalescence rate) as a summary of genomic diversity: insights into demographic inference and model choice.反向瞬时聚合率作为基因组多样性的总结:对人口推断和模型选择的深入了解。
Heredity (Edinb). 2018 Jan;120(1):13-24. doi: 10.1038/s41437-017-0005-6. Epub 2017 Nov 8.
10
A new method for modeling coalescent processes with recombination.一种新的带有重组的合并过程建模方法。
BMC Bioinformatics. 2014 Aug 11;15(1):273. doi: 10.1186/1471-2105-15-273.

引用本文的文献

1
Robust and accurate Bayesian inference of genome-wide genealogies for hundreds of genomes.针对数百个基因组的全基因组谱系进行稳健且准确的贝叶斯推断。
Nat Genet. 2025 Sep 8. doi: 10.1038/s41588-025-02317-9.
2
Coalescence and Translation: A Language Model for Population Genetics.合并与翻译:一种用于群体遗传学的语言模型
bioRxiv. 2025 Jun 27:2025.06.24.661337. doi: 10.1101/2025.06.24.661337.
3
Accessible, realistic genome simulation with selection using stdpopsim.使用stdpopsim进行具有选择的可访问、现实的基因组模拟。

本文引用的文献

1
The selection landscape and genetic legacy of ancient Eurasians.古欧亚人的选择景观和遗传遗产。
Nature. 2024 Jan;625(7994):312-320. doi: 10.1038/s41586-023-06705-1. Epub 2024 Jan 10.
2
selscan 2.0: scanning for sweeps in unphased data.selscan 2.0:在非相位数据中扫描扫描。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae006.
3
SLiM 4: Multispecies Eco-Evolutionary Modeling.SLiM 4:多物种生态进化建模。
bioRxiv. 2025 Mar 23:2025.03.23.644823. doi: 10.1101/2025.03.23.644823.
4
Improved inference of population histories by integrating genomic and epigenomic data.通过整合基因组和表观基因组数据来改进群体历史推断。
Elife. 2024 Sep 12;12:RP89470. doi: 10.7554/eLife.89470.
5
Rapid Adaptation and Interspecific Introgression in the North American Crop Pest Helicoverpa zea.北美作物害虫棉铃虫的快速适应和种间基因渐渗。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae129.
6
Accurate inference of population history in the presence of background selection.在存在背景选择的情况下对种群历史进行准确推断。
bioRxiv. 2024 Jan 20:2024.01.18.576291. doi: 10.1101/2024.01.18.576291.
Am Nat. 2023 May;201(5):E127-E139. doi: 10.1086/723601. Epub 2023 Mar 21.
4
Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits.生物银行规模下的祖先重组图推断实现了复杂性状的系谱分析。
Nat Genet. 2023 May;55(5):768-776. doi: 10.1038/s41588-023-01379-x. Epub 2023 May 1.
5
Efficient ancestry and mutation simulation with msprime 1.0.利用 msprime 1.0 进行高效的祖先和突变模拟。
Genetics. 2022 Mar 3;220(3). doi: 10.1093/genetics/iyab229.
6
A Deep-Learning Approach for Inference of Selective Sweeps from the Ancestral Recombination Graph.基于祖先重组图谱的选择性清除推断的深度学习方法
Mol Biol Evol. 2022 Jan 7;39(1). doi: 10.1093/molbev/msab332.
7
Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations.对 487409 个英国样本的亲缘关系检测揭示了精细的人口结构和超罕见变异关联。
Nat Commun. 2020 Nov 30;11(1):6130. doi: 10.1038/s41467-020-19588-x.
8
A community-maintained standard library of population genetic models.一个社区维护的种群遗传模型标准库。
Elife. 2020 Jun 23;9:e54967. doi: 10.7554/eLife.54967.
9
Tracking human population structure through time from whole genome sequences.从全基因组序列追踪随时间变化的人类人口结构。
PLoS Genet. 2020 Mar 9;16(3):e1008552. doi: 10.1371/journal.pgen.1008552. eCollection 2020 Mar.
10
A practical introduction to sequentially Markovian coalescent methods for estimating demographic history from genomic data.从基因组数据估计种群历史的序列马尔可夫合并方法实用介绍。
Ecol Evol. 2019 Dec 7;10(1):579-589. doi: 10.1002/ece3.5888. eCollection 2020 Jan.