• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同源片段的快速模拟。

Fast simulation of identity-by-descent segments.

作者信息

Temple Seth D, Browning Sharon R, Thompson Elizabeth A

机构信息

Department of Statistics, University of Washington, Seattle, WA, USA.

Department of Statistics, University of Michigan, Ann Arbor, MI, USA.

出版信息

Bull Math Biol. 2025 May 23;87(7):84. doi: 10.1007/s11538-025-01464-8.

DOI:10.1007/s11538-025-01464-8
PMID:40410602
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12102126/
Abstract

The worst-case runtime complexity to simulate haplotype segments identical by descent (IBD) is quadratic in sample size. We propose two main techniques to reduce the compute time, both of which are motivated by coalescent and recombination processes. We provide mathematical results that explain why our algorithm should outperform a naive implementation with high probability. In our experiments, we observe average compute times to simulate detectable IBD segments around a locus that scale approximately linearly in sample size and take a couple of seconds for sample sizes that are less than 10,000 diploid individuals. In contrast, we find that existing methods to simulate IBD segments take minutes to hours for sample sizes exceeding a few thousand diploid individuals. When using IBD segments to study recent positive selection around a locus, our efficient simulation algorithm makes feasible statistical inferences, e.g., parametric bootstrapping in analyses of large biobanks, that would be otherwise intractable.

摘要

模拟同源相同的单倍型片段(IBD)的最坏情况运行时复杂度与样本量呈二次方关系。我们提出了两种主要技术来减少计算时间,这两种技术均受溯祖和重组过程的启发。我们提供了数学结果,解释了为什么我们的算法很可能优于简单的实现方式。在我们的实验中,我们观察到在一个位点周围模拟可检测的IBD片段的平均计算时间与样本量大致呈线性关系,对于样本量小于10000个二倍体个体的情况,计算时间只需几秒钟。相比之下,我们发现现有模拟IBD片段的方法对于超过几千个二倍体个体的样本量需要数分钟到数小时。当使用IBD片段来研究一个位点周围的近期正选择时,我们高效的模拟算法使得可行的统计推断成为可能,例如在大型生物样本库分析中的参数自举法,否则这将是难以处理的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/b9aafa28bd77/11538_2025_1464_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/6e506e9de41e/11538_2025_1464_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/7c045caba8a7/11538_2025_1464_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/9984412fdd53/11538_2025_1464_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/c528f7184e9d/11538_2025_1464_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/b9aafa28bd77/11538_2025_1464_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/6e506e9de41e/11538_2025_1464_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/7c045caba8a7/11538_2025_1464_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/9984412fdd53/11538_2025_1464_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/c528f7184e9d/11538_2025_1464_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a0d/12102126/b9aafa28bd77/11538_2025_1464_Fig4_HTML.jpg

相似文献

1
Fast simulation of identity-by-descent segments.同源片段的快速模拟。
Bull Math Biol. 2025 May 23;87(7):84. doi: 10.1007/s11538-025-01464-8.
2
Fast simulation of identity-by-descent segments.同源片段的快速模拟。
bioRxiv. 2025 Jan 7:2024.12.13.628449. doi: 10.1101/2024.12.13.628449.
3
Thromboprophylaxis during pregnancy and the puerperium: a systematic review and economic evaluation to estimate the value of future research.妊娠期和产褥期的血栓预防:一项系统评价和经济评估,以估算未来研究的价值。
Health Technol Assess. 2024 Mar;28(9):1-176. doi: 10.3310/DFWT3873.
4
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
5
Identity-by-descent segments in large samples.大样本中的同源片段。
Theor Popul Biol. 2025 Jul 5;165:10-21. doi: 10.1016/j.tpb.2025.06.003.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Individualised gonadotropin dose selection using markers of ovarian reserve for women undergoing in vitro fertilisation plus intracytoplasmic sperm injection (IVF/ICSI).针对接受体外受精加卵胞浆内单精子注射(IVF/ICSI)的女性,使用卵巢储备标志物进行个性化促性腺激素剂量选择。
Cochrane Database Syst Rev. 2018 Feb 1;2(2):CD012693. doi: 10.1002/14651858.CD012693.pub2.
9
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
10
hmmibd-rs: An enhanced hmmIBD implementation for parallelizable identity-by-descent detection from large-scale Plasmodium genomic data.hmmibd-rs:一种用于从大规模疟原虫基因组数据中进行可并行化的同源片段检测的增强型hmmIBD实现。
Res Sq. 2025 Jul 2:rs.3.rs-7004070. doi: 10.21203/rs.3.rs-7004070/v1.

引用本文的文献

1
Multiple-testing corrections in case-control studies using identity-by-descent segments.在使用同源片段的病例对照研究中进行多重检验校正。
bioRxiv. 2025 Jul 7:2025.07.03.663057. doi: 10.1101/2025.07.03.663057.
2
Multiple-testing corrections in selection scans using identity-by-descent segments.使用同源片段在选择扫描中进行多重检验校正。
bioRxiv. 2025 Jan 29:2025.01.29.635528. doi: 10.1101/2025.01.29.635528.
3
Identity-by-descent segments in large samples.大样本中的同源片段

本文引用的文献

1
Identity-by-descent segments in large samples.大样本中的同源片段。
Theor Popul Biol. 2025 Jul 5;165:10-21. doi: 10.1016/j.tpb.2025.06.003.
2
Estimating effective population size trajectories from time-series identity-by-descent segments.通过按血统相同片段的时间序列估计有效种群大小轨迹。
Genetics. 2025 Mar 17;229(3). doi: 10.1093/genetics/iyae212.
3
Modeling recent positive selection using identity-by-descent segments.利用亲缘关系片段进行近期正选择建模。
bioRxiv. 2025 Jan 7:2024.06.05.597656. doi: 10.1101/2024.06.05.597656.
Am J Hum Genet. 2024 Nov 7;111(11):2510-2529. doi: 10.1016/j.ajhg.2024.08.023. Epub 2024 Oct 2.
4
Biobank-scale inference of multi-individual identity by descent and gene conversion.基于个体血缘关系和基因转换的生物银行规模个体推断。
Am J Hum Genet. 2024 Apr 4;111(4):691-700. doi: 10.1016/j.ajhg.2024.02.015. Epub 2024 Mar 20.
5
Strong positive selection biases identity-by-descent-based inferences of recent demography and population structure in Plasmodium falciparum.强烈的正选择偏倚了基于亲缘关系的对疟原虫 falciparum 近期人口动态和种群结构的推断。
Nat Commun. 2024 Mar 20;15(1):2499. doi: 10.1038/s41467-024-46659-0.
6
FiMAP: A fast identity-by-descent mapping test for biobank-scale cohorts.FiMAP:一种用于生物库规模队列的快速基于关系的映射测试。
PLoS Genet. 2023 Dec 1;19(12):e1011057. doi: 10.1371/journal.pgen.1011057. eCollection 2023 Dec.
7
Identity-by-descent-based estimation of the X chromosome effective population size with application to sex-specific demographic history.基于亲缘关系的 X 染色体有效种群大小估计及其在性别特异性人口历史中的应用。
G3 (Bethesda). 2023 Sep 30;13(10). doi: 10.1093/g3journal/jkad165.
8
Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.扩展 stdpopsim 物种目录,以及从真实基因组模拟中获得的经验教训。
Elife. 2023 Jun 21;12:RP84874. doi: 10.7554/eLife.84874.
9
Deep Learning in Population Genetics.群体遗传学中的深度学习。
Genome Biol Evol. 2023 Feb 3;15(2). doi: 10.1093/gbe/evad008.
10
Selecting Clustering Algorithms for Identity-By-Descent Mapping.选择用于同源定位映射的聚类算法。
Pac Symp Biocomput. 2023;28:121-132.