• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较两向和三向混合群体中的本地祖先推断模型。

Comparing local ancestry inference models in populations of two- and three-way admixture.

作者信息

Schubert Ryan, Andaleon Angela, Wheeler Heather E

机构信息

Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, United States of America.

Department of Biology, Loyola University Chicago, Chicago, IL, United States of America.

出版信息

PeerJ. 2020 Oct 2;8:e10090. doi: 10.7717/peerj.10090. eCollection 2020.

DOI:10.7717/peerj.10090
PMID:33072440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7537619/
Abstract

Local ancestry estimation infers the regional ancestral origin of chromosomal segments in admixed populations using reference populations and a variety of statistical models. Integrating local ancestry into complex trait genetics has the potential to increase detection of genetic associations and improve genetic prediction models in understudied admixed populations, including African Americans and Hispanics. Five methods for local ancestry estimation that have been used in human complex trait genetics are LAMP-LD (2012), RFMix (2013), ELAI (2014), Loter (2018), and MOSAIC (2019). As users rather than developers, we sought to perform direct comparisons of accuracy, runtime, memory usage, and usability of these software tools to determine which is best for incorporation into association study pipelines. We find that in the majority of cases RFMix has the highest median accuracy with the ranking of the remaining software dependent on the ancestral architecture of the population tested. Additionally, we estimate the O(n) of both memory and runtime for each software and find that for both time and memory most software increase linearly with respect to sample size. The only exception is RFMix, which increases quadratically with respect to runtime and linearly with respect to memory. Effective local ancestry estimation tools are necessary to increase diversity and prevent population disparities in human genetics studies. RFMix performs the best across methods, however, depending on application, other methods perform just as well with the benefit of shorter runtimes. Scripts used to format data, run software, and estimate accuracy can be found at https://github.com/WheelerLab/LAI_benchmarking.

摘要

本地血统估计利用参考群体和各种统计模型推断混合群体中染色体片段的区域祖先起源。将本地血统整合到复杂性状遗传学中,有可能在包括非裔美国人和西班牙裔在内的研究较少的混合群体中增加遗传关联的检测,并改进遗传预测模型。人类复杂性状遗传学中使用的五种本地血统估计方法是LAMP-LD(2012年)、RFMix(2013年)、ELAI(2014年)、Loter(2018年)和MOSAIC(2019年)。作为用户而非开发者,我们试图对这些软件工具的准确性、运行时间、内存使用和可用性进行直接比较,以确定哪种工具最适合纳入关联研究流程。我们发现,在大多数情况下,RFMix的中位数准确性最高,其余软件的排名取决于所测试群体的祖先结构。此外,我们估计了每个软件的内存和运行时间的O(n),发现对于时间和内存,大多数软件都随样本量线性增加。唯一的例外是RFMix,其运行时间呈二次方增加,内存呈线性增加。有效的本地血统估计工具对于增加人类遗传学研究中的多样性和防止群体差异是必要的。RFMix在所有方法中表现最佳,然而根据应用情况,其他方法在运行时间较短的情况下表现同样出色。用于格式化数据、运行软件和估计准确性的脚本可在https://github.com/WheelerLab/LAI_benchmarking上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/3cb2979fd6c7/peerj-08-10090-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/7c915dc2cc6f/peerj-08-10090-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/4f11439016e4/peerj-08-10090-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/3e1fefc183f9/peerj-08-10090-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/fe6d4feced07/peerj-08-10090-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/a23c239d64e8/peerj-08-10090-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/be5c38f1d7e8/peerj-08-10090-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/3cb2979fd6c7/peerj-08-10090-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/7c915dc2cc6f/peerj-08-10090-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/4f11439016e4/peerj-08-10090-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/3e1fefc183f9/peerj-08-10090-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/fe6d4feced07/peerj-08-10090-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/a23c239d64e8/peerj-08-10090-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/be5c38f1d7e8/peerj-08-10090-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/3cb2979fd6c7/peerj-08-10090-g007.jpg

相似文献

1
Comparing local ancestry inference models in populations of two- and three-way admixture.比较两向和三向混合群体中的本地祖先推断模型。
PeerJ. 2020 Oct 2;8:e10090. doi: 10.7717/peerj.10090. eCollection 2020.
2
Loter: A Software Package to Infer Local Ancestry for a Wide Range of Species.洛特(Loter):一种适用于广泛物种的推断局部祖先的软件包。
Mol Biol Evol. 2018 Sep 1;35(9):2318-2326. doi: 10.1093/molbev/msy126.
3
Putting RFMix and ADMIXTURE to the test in a complex admixed population.在一个复杂的混合人群中检验 RFMix 和 ADMIXTURE。
BMC Genet. 2020 Apr 7;21(1):40. doi: 10.1186/s12863-020-00845-3.
4
Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.基于群体特异单核苷酸多态性的局域亲缘关系推断——以 1000 基因组计划中的混合人群为例。
Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.
5
An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.基于全外显子组测序数据的西班牙裔个体祖籍信息标记面板设计用于个体祖籍估计。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.
6
RFMix-reader: Accelerated reading and processing for local ancestry studies.RFMix阅读器:用于本地血统研究的加速读取与处理
bioRxiv. 2024 Jul 18:2024.07.13.603370. doi: 10.1101/2024.07.13.603370.
7
Rye: genetic ancestry inference at biobank scale.黑麦:生物库规模的遗传祖先推断。
Nucleic Acids Res. 2023 May 8;51(8):e44. doi: 10.1093/nar/gkad149.
8
Assessing the limits of local ancestry inference from small reference panels.评估小参考面板中本地血统推断的局限性。
Mol Ecol Resour. 2024 Aug;24(6):e13981. doi: 10.1111/1755-0998.13981. Epub 2024 May 22.
9
LAIT: a local ancestry inference toolkit.LAIT:一种本地血统推断工具包。
BMC Genet. 2017 Sep 6;18(1):83. doi: 10.1186/s12863-017-0546-y.
10
AncestryGrapher toolkit: Python command-line pipelines to visualize global- and local- ancestry inferences from the RFMIX version 2 software.祖先图谱工具包:用于可视化 RFMIX 版本 2 软件中全球和局部祖先推断的 Python 命令行管道。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae616.

引用本文的文献

1
Genome-wide association analyses reveal susceptibility variants linked to Parkinson's disease in the South African population using inferred global and local ancestry.全基因组关联分析利用推断的全球和本地血统揭示了南非人群中与帕金森病相关的易感变异。
medRxiv. 2025 Aug 2:2025.08.01.25331910. doi: 10.1101/2025.08.01.25331910.
2
Phase-free local ancestry inference mitigates the impact of switch errors on phase-based methods.无相位局部祖先推断减轻了切换错误对基于相位的方法的影响。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf122.
3
Opportunities and challenges of local ancestry in genetic association analyses.

本文引用的文献

1
Fine-Scale Inference of Ancestry Segments Without Prior Knowledge of Admixing Groups.无先验混群知识的精细尺度遗传片段推断
Genetics. 2019 Jul;212(3):869-889. doi: 10.1534/genetics.119.302139. Epub 2019 May 23.
2
On Using Local Ancestry to Characterize the Genetic Architecture of Human Traits: Genetic Regulation of Gene Expression in Multiethnic or Admixed Populations.利用局部亲缘关系刻画人类性状的遗传结构:多民族或混合人群中基因表达的遗传调控。
Am J Hum Genet. 2019 Jun 6;104(6):1097-1115. doi: 10.1016/j.ajhg.2019.04.009. Epub 2019 May 16.
3
The Missing Diversity in Human Genetic Studies.
遗传关联分析中本地祖先的机遇与挑战。
Am J Hum Genet. 2025 Apr 3;112(4):727-740. doi: 10.1016/j.ajhg.2025.03.004.
4
Computational Genomics and Its Applications to Anthropological Questions.计算基因组学及其在人类学问题中的应用。
Am J Biol Anthropol. 2024 Dec;186 Suppl 78(Suppl 78):e70010. doi: 10.1002/ajpa.70010.
5
Old vs. New Local Ancestry Inference in HCHS/SOL: A Comparative Study.西班牙裔社区健康研究/拉丁裔研究中旧版与新版本地血统推断的比较研究
bioRxiv. 2025 Feb 8:2025.02.04.636481. doi: 10.1101/2025.02.04.636481.
6
Potential Adaptive Introgression From Dogs in Iberian Grey Wolves (Canis lupus).伊比利亚灰狼(Canis lupus)中可能存在来自狗的适应性基因渗入。
Mol Ecol. 2025 Jun;34(12):e17639. doi: 10.1111/mec.17639. Epub 2025 Jan 10.
7
The expected polygenic risk score (ePRS) framework: an equitable metric for quantifying polygenetic risk via modeling of ancestral makeup.预期多基因风险评分(ePRS)框架:一种通过对祖先构成进行建模来量化多基因风险的公平指标。
medRxiv. 2024 Dec 20:2024.03.05.24303738. doi: 10.1101/2024.03.05.24303738.
8
Characterizing features affecting local ancestry inference performance in admixed populations.表征影响混合群体中本地祖先推断性能的特征。
Am J Hum Genet. 2025 Feb 6;112(2):224-234. doi: 10.1016/j.ajhg.2024.12.005. Epub 2025 Jan 2.
9
Characterizing features affecting local ancestry inference performance in admixed populations.表征影响混合群体中本地祖先推断性能的特征。
bioRxiv. 2024 Aug 27:2024.08.26.609770. doi: 10.1101/2024.08.26.609770.
10
Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.基于群体特异单核苷酸多态性的局域亲缘关系推断——以 1000 基因组计划中的混合人群为例。
Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.
人类遗传研究中的缺失多样性。
Cell. 2019 Mar 21;177(1):26-31. doi: 10.1016/j.cell.2019.02.048.
4
Genetic architecture of gene expression traits across diverse populations.跨多种人群的基因表达性状的遗传结构。
PLoS Genet. 2018 Aug 10;14(8):e1007586. doi: 10.1371/journal.pgen.1007586. eCollection 2018 Aug.
5
A comprehensive survey of models for dissecting local ancestry deconvolution in human genome.人类基因组中局部祖源去卷积模型的综合调查。
Brief Bioinform. 2019 Sep 27;20(5):1709-1724. doi: 10.1093/bib/bby044.
6
Loter: A Software Package to Infer Local Ancestry for a Wide Range of Species.洛特(Loter):一种适用于广泛物种的推断局部祖先的软件包。
Mol Biol Evol. 2018 Sep 1;35(9):2318-2326. doi: 10.1093/molbev/msy126.
7
Properties of global- and local-ancestry adjustments in genetic association tests in admixed populations.混合人群基因关联测试中全局和局部祖先调整的特性
Genet Epidemiol. 2018 Mar;42(2):214-229. doi: 10.1002/gepi.22103. Epub 2017 Dec 30.
8
A robust and powerful two-step testing procedure for local ancestry adjusted allelic association analysis in admixed populations.一种用于混合人群中本地血统调整等位基因关联分析的强大且有效的两步检验程序。
Genet Epidemiol. 2018 Apr;42(3):288-302. doi: 10.1002/gepi.22104. Epub 2017 Dec 10.
9
LAIT: a local ancestry inference toolkit.LAIT:一种本地血统推断工具包。
BMC Genet. 2017 Sep 6;18(1):83. doi: 10.1186/s12863-017-0546-y.
10
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations.人类人口统计学历史影响不同人群的遗传风险预测。
Am J Hum Genet. 2017 Apr 6;100(4):635-649. doi: 10.1016/j.ajhg.2017.03.004. Epub 2017 Mar 30.