• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

快速“合并”模拟。

Fast "coalescent" simulation.

作者信息

Marjoram Paul, Wall Jeff D

机构信息

Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089-9011, USA.

出版信息

BMC Genet. 2006 Mar 15;7:16. doi: 10.1186/1471-2156-7-16.

DOI:10.1186/1471-2156-7-16
PMID:16539698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1458357/
Abstract

BACKGROUND

The amount of genome-wide molecular data is increasing rapidly, as is interest in developing methods appropriate for such data. There is a consequent increasing need for methods that are able to efficiently simulate such data. In this paper we implement the sequentially Markovian coalescent algorithm described by McVean and Cardin and present a further modification to that algorithm which slightly improves the closeness of the approximation to the full coalescent model. The algorithm ignores a class of recombination events known to affect the behavior of the genealogy of the sample, but which do not appear to affect the behavior of generated samples to any substantial degree.

RESULTS

We show that our software is able to simulate large chromosomal regions, such as those appropriate in a consideration of genome-wide data, in a way that is several orders of magnitude faster than existing coalescent algorithms.

CONCLUSION

This algorithm provides a useful resource for those needing to simulate large quantities of data for chromosomal-length regions using an approach that is much more efficient than traditional coalescent models.

摘要

背景

全基因组分子数据量正在迅速增加,人们对开发适用于此类数据的方法的兴趣也在增加。因此,对能够有效模拟此类数据的方法的需求也日益增长。在本文中,我们实现了由麦克维恩(McVean)和卡丹(Cardin)描述的顺序马尔可夫合并算法,并对该算法进行了进一步修改,这略微提高了其对完整合并模型的近似程度。该算法忽略了一类已知会影响样本系谱行为的重组事件,但这些事件似乎在很大程度上并不影响生成样本的行为。

结果

我们表明,我们的软件能够以比现有合并算法快几个数量级的方式模拟大型染色体区域,例如在考虑全基因组数据时适用的区域。

结论

该算法为那些需要使用比传统合并模型高效得多的方法来模拟染色体长度区域的大量数据的人提供了有用的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/5d405200f21f/1471-2156-7-16-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/f8e4062cf1fd/1471-2156-7-16-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/6040e4237fc1/1471-2156-7-16-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/5d405200f21f/1471-2156-7-16-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/f8e4062cf1fd/1471-2156-7-16-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/6040e4237fc1/1471-2156-7-16-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e15c/1458357/5d405200f21f/1471-2156-7-16-3.jpg

相似文献

1
Fast "coalescent" simulation.快速“合并”模拟。
BMC Genet. 2006 Mar 15;7:16. doi: 10.1186/1471-2156-7-16.
2
A sequential coalescent algorithm for chromosomal inversions.一种用于染色体倒位的序列凝聚算法。
Heredity (Edinb). 2013 Sep;111(3):200-9. doi: 10.1038/hdy.2013.38. Epub 2013 May 1.
3
Critical assessment of coalescent simulators in modeling recombination hotspots in genomic sequences.对基因组序列中重组热点建模的合并模拟器的批判性评估。
BMC Bioinformatics. 2014 Jan 3;15:3. doi: 10.1186/1471-2105-15-3.
4
The Bacterial Sequential Markov Coalescent.细菌序列马尔可夫合并过程
Genetics. 2017 May;206(1):333-343. doi: 10.1534/genetics.116.198796. Epub 2017 Mar 3.
5
Fast and flexible simulation of DNA sequence data.DNA序列数据的快速灵活模拟。
Genome Res. 2009 Jan;19(1):136-42. doi: 10.1101/gr.083634.108. Epub 2008 Nov 24.
6
Approximating the coalescent with recombination.用重组近似溯祖过程。
Philos Trans R Soc Lond B Biol Sci. 2005 Jul 29;360(1459):1387-93. doi: 10.1098/rstb.2005.1673.
7
A new method for modeling coalescent processes with recombination.一种新的带有重组的合并过程建模方法。
BMC Bioinformatics. 2014 Aug 11;15(1):273. doi: 10.1186/1471-2105-15-273.
8
Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes.大样本量的高效合并模拟和谱系分析
PLoS Comput Biol. 2016 May 4;12(5):e1004842. doi: 10.1371/journal.pcbi.1004842. eCollection 2016 May.
9
scrm: efficiently simulating long sequences using the approximated coalescent with recombination.scrm:使用带重组的近似合并过程高效模拟长序列。
Bioinformatics. 2015 May 15;31(10):1680-2. doi: 10.1093/bioinformatics/btu861. Epub 2015 Jan 8.
10
Sequential Markov coalescent algorithms for population models with demographic structure.用于具有人口结构的种群模型的序贯马尔可夫合并算法。
Theor Popul Biol. 2009 Sep;76(2):84-91. doi: 10.1016/j.tpb.2009.05.002. Epub 2009 May 9.

引用本文的文献

1
Robust and accurate Bayesian inference of genome-wide genealogies for hundreds of genomes.针对数百个基因组的全基因组谱系进行稳健且准确的贝叶斯推断。
Nat Genet. 2025 Sep 8. doi: 10.1038/s41588-025-02317-9.
2
The Length of Haplotype Blocks and Signals of Structural Variation in Reconstructed Genealogies.重构谱系中单体型块的长度及结构变异信号
Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf190.
3
Recent Statistical Innovations in Human Genetics.人类遗传学领域的最新统计创新

本文引用的文献

1
Calibrating a coalescent simulation of human genome sequence variation.校准人类基因组序列变异的合并模拟。
Genome Res. 2005 Nov;15(11):1576-83. doi: 10.1101/gr.3709305.
2
Approximating the coalescent with recombination.用重组近似溯祖过程。
Philos Trans R Soc Lond B Biol Sci. 2005 Jul 29;360(1459):1387-93. doi: 10.1098/rstb.2005.1673.
3
Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes.通过单核苷酸多态性单倍型的分支分析进行连锁不平衡作图。
Ann Hum Genet. 2025 Sep;89(5):241-254. doi: 10.1111/ahg.12606. Epub 2025 Jun 27.
4
Assessing simulation-based supervised machine learning for demographic parameter inference from genomic data.评估基于模拟的监督式机器学习用于从基因组数据推断人口统计学参数。
Heredity (Edinb). 2025 Jun 6. doi: 10.1038/s41437-025-00773-x.
5
A structured coalescent model reveals deep ancestral structure shared by all modern humans.一个结构化的溯祖模型揭示了所有现代人类共有的深层祖先结构。
Nat Genet. 2025 Apr;57(4):856-864. doi: 10.1038/s41588-025-02117-1. Epub 2025 Mar 18.
6
Likelihoods for a general class of ARGs under the SMC.在顺序蒙特卡罗方法下一类一般的祖先重组图的似然性。
bioRxiv. 2025 Feb 27:2025.02.24.639977. doi: 10.1101/2025.02.24.639977.
7
Inference and applications of ancestral recombination graphs.祖先重组图的推断与应用
Nat Rev Genet. 2025 Jan;26(1):47-58. doi: 10.1038/s41576-024-00772-4. Epub 2024 Sep 30.
8
Exact Decoding of a Sequentially Markov Coalescent Model in Genetics.遗传学中顺序马尔可夫合并模型的精确解码
J Am Stat Assoc. 2024;119(547):2242-2255. doi: 10.1080/01621459.2023.2252570. Epub 2023 Oct 3.
9
Improved inference of population histories by integrating genomic and epigenomic data.通过整合基因组和表观基因组数据来改进群体历史推断。
Elife. 2024 Sep 12;12:RP89470. doi: 10.7554/eLife.89470.
10
Global and Local Ancestry and its Importance: A Review.全球和本地血统及其重要性:综述
Curr Genomics. 2024;25(4):237-260. doi: 10.2174/0113892029298909240426094055. Epub 2024 May 9.
Am J Hum Genet. 2004 Jul;75(1):35-43. doi: 10.1086/422174. Epub 2004 May 13.
4
Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.利用单核苷酸多态性数据对连锁不平衡进行建模并识别重组热点。
Genetics. 2003 Dec;165(4):2213-33. doi: 10.1093/genetics/165.4.2213.
5
The structure of haplotype blocks in the human genome.人类基因组中单倍型块的结构。
Science. 2002 Jun 21;296(5576):2225-9. doi: 10.1126/science.1069424. Epub 2002 May 23.
6
Generating samples under a Wright-Fisher neutral model of genetic variation.在遗传变异的赖特-费希尔中性模型下生成样本。
Bioinformatics. 2002 Feb;18(2):337-8. doi: 10.1093/bioinformatics/18.2.337.
7
Two-locus sampling distributions and their application.两位点抽样分布及其应用。
Genetics. 2001 Dec;159(4):1805-17. doi: 10.1093/genetics/159.4.1805.
8
A comparison of estimators of the population recombination rate.群体重组率估计值的比较。
Mol Biol Evol. 2000 Jan;17(1):156-63. doi: 10.1093/oxfordjournals.molbev.a026228.
9
Recombination as a point process along sequences.作为沿序列的点过程的重组。
Theor Popul Biol. 1999 Jun;55(3):248-59. doi: 10.1006/tpbi.1998.1403.
10
The ancestry of a sample of sequences subject to recombination.一个经历重组的序列样本的祖先。
Genetics. 1999 Mar;151(3):1217-28. doi: 10.1093/genetics/151.3.1217.