Suppr
超能文献

比较两向和三向混合群体中的本地祖先推断模型。

Comparing local ancestry inference models in populations of two- and three-way admixture.

作者信息

Schubert Ryan, Andaleon Angela, Wheeler Heather E

机构信息

Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, United States of America.

Department of Biology, Loyola University Chicago, Chicago, IL, United States of America.

出版信息

PeerJ. 2020 Oct 2;8:e10090. doi: 10.7717/peerj.10090. eCollection 2020.

DOI:10.7717/peerj.10090

PMID:33072440

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7537619/

Abstract

Local ancestry estimation infers the regional ancestral origin of chromosomal segments in admixed populations using reference populations and a variety of statistical models. Integrating local ancestry into complex trait genetics has the potential to increase detection of genetic associations and improve genetic prediction models in understudied admixed populations, including African Americans and Hispanics. Five methods for local ancestry estimation that have been used in human complex trait genetics are LAMP-LD (2012), RFMix (2013), ELAI (2014), Loter (2018), and MOSAIC (2019). As users rather than developers, we sought to perform direct comparisons of accuracy, runtime, memory usage, and usability of these software tools to determine which is best for incorporation into association study pipelines. We find that in the majority of cases RFMix has the highest median accuracy with the ranking of the remaining software dependent on the ancestral architecture of the population tested. Additionally, we estimate the O(n) of both memory and runtime for each software and find that for both time and memory most software increase linearly with respect to sample size. The only exception is RFMix, which increases quadratically with respect to runtime and linearly with respect to memory. Effective local ancestry estimation tools are necessary to increase diversity and prevent population disparities in human genetics studies. RFMix performs the best across methods, however, depending on application, other methods perform just as well with the benefit of shorter runtimes. Scripts used to format data, run software, and estimate accuracy can be found at https://github.com/WheelerLab/LAI_benchmarking.

摘要

本地血统估计利用参考群体和各种统计模型推断混合群体中染色体片段的区域祖先起源。将本地血统整合到复杂性状遗传学中，有可能在包括非裔美国人和西班牙裔在内的研究较少的混合群体中增加遗传关联的检测，并改进遗传预测模型。人类复杂性状遗传学中使用的五种本地血统估计方法是LAMP-LD（2012年）、RFMix（2013年）、ELAI（2014年）、Loter（2018年）和MOSAIC（2019年）。作为用户而非开发者，我们试图对这些软件工具的准确性、运行时间、内存使用和可用性进行直接比较，以确定哪种工具最适合纳入关联研究流程。我们发现，在大多数情况下，RFMix的中位数准确性最高，其余软件的排名取决于所测试群体的祖先结构。此外，我们估计了每个软件的内存和运行时间的O(n)，发现对于时间和内存，大多数软件都随样本量线性增加。唯一的例外是RFMix，其运行时间呈二次方增加，内存呈线性增加。有效的本地血统估计工具对于增加人类遗传学研究中的多样性和防止群体差异是必要的。RFMix在所有方法中表现最佳，然而根据应用情况，其他方法在运行时间较短的情况下表现同样出色。用于格式化数据、运行软件和估计准确性的脚本可在https://github.com/WheelerLab/LAI_benchmarking上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e5b/7537619/7c915dc2cc6f/peerj-08-10090-g001.jpg

相似文献

Comparing local ancestry inference models in populations of two- and three-way admixture.

PeerJ. 2020 Oct 2;8:e10090. doi: 10.7717/peerj.10090. eCollection 2020.

Loter: A Software Package to Infer Local Ancestry for a Wide Range of Species.

Mol Biol Evol. 2018 Sep 1;35(9):2318-2326. doi: 10.1093/molbev/msy126.

Putting RFMix and ADMIXTURE to the test in a complex admixed population.

BMC Genet. 2020 Apr 7;21(1):40. doi: 10.1186/s12863-020-00845-3.

Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.

Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.

An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.

BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.

RFMix-reader: Accelerated reading and processing for local ancestry studies.

bioRxiv. 2024 Jul 18:2024.07.13.603370. doi: 10.1101/2024.07.13.603370.

Rye: genetic ancestry inference at biobank scale.

Nucleic Acids Res. 2023 May 8;51(8):e44. doi: 10.1093/nar/gkad149.

Assessing the limits of local ancestry inference from small reference panels.

Mol Ecol Resour. 2024 Aug;24(6):e13981. doi: 10.1111/1755-0998.13981. Epub 2024 May 22.

LAIT: a local ancestry inference toolkit.

BMC Genet. 2017 Sep 6;18(1):83. doi: 10.1186/s12863-017-0546-y.

AncestryGrapher toolkit: Python command-line pipelines to visualize global- and local- ancestry inferences from the RFMIX version 2 software.

Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae616.

引用本文的文献

Genome-wide association analyses reveal susceptibility variants linked to Parkinson's disease in the South African population using inferred global and local ancestry.

medRxiv. 2025 Aug 2:2025.08.01.25331910. doi: 10.1101/2025.08.01.25331910.

Phase-free local ancestry inference mitigates the impact of switch errors on phase-based methods.

G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf122.

Opportunities and challenges of local ancestry in genetic association analyses.

Am J Hum Genet. 2025 Apr 3;112(4):727-740. doi: 10.1016/j.ajhg.2025.03.004.

Computational Genomics and Its Applications to Anthropological Questions.

Am J Biol Anthropol. 2024 Dec;186 Suppl 78(Suppl 78):e70010. doi: 10.1002/ajpa.70010.

Old vs. New Local Ancestry Inference in HCHS/SOL: A Comparative Study.

bioRxiv. 2025 Feb 8:2025.02.04.636481. doi: 10.1101/2025.02.04.636481.

Potential Adaptive Introgression From Dogs in Iberian Grey Wolves (Canis lupus).

Mol Ecol. 2025 Jun;34(12):e17639. doi: 10.1111/mec.17639. Epub 2025 Jan 10.

The expected polygenic risk score (ePRS) framework: an equitable metric for quantifying polygenetic risk via modeling of ancestral makeup.

medRxiv. 2024 Dec 20:2024.03.05.24303738. doi: 10.1101/2024.03.05.24303738.

Characterizing features affecting local ancestry inference performance in admixed populations.

Am J Hum Genet. 2025 Feb 6;112(2):224-234. doi: 10.1016/j.ajhg.2024.12.005. Epub 2025 Jan 2.

Characterizing features affecting local ancestry inference performance in admixed populations.

bioRxiv. 2024 Aug 27:2024.08.26.609770. doi: 10.1101/2024.08.26.609770.

Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.

Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.

本文引用的文献

Fine-Scale Inference of Ancestry Segments Without Prior Knowledge of Admixing Groups.

Genetics. 2019 Jul;212(3):869-889. doi: 10.1534/genetics.119.302139. Epub 2019 May 23.

On Using Local Ancestry to Characterize the Genetic Architecture of Human Traits: Genetic Regulation of Gene Expression in Multiethnic or Admixed Populations.

Am J Hum Genet. 2019 Jun 6;104(6):1097-1115. doi: 10.1016/j.ajhg.2019.04.009. Epub 2019 May 16.

The Missing Diversity in Human Genetic Studies.

Cell. 2019 Mar 21;177(1):26-31. doi: 10.1016/j.cell.2019.02.048.

Genetic architecture of gene expression traits across diverse populations.

PLoS Genet. 2018 Aug 10;14(8):e1007586. doi: 10.1371/journal.pgen.1007586. eCollection 2018 Aug.

A comprehensive survey of models for dissecting local ancestry deconvolution in human genome.

Brief Bioinform. 2019 Sep 27;20(5):1709-1724. doi: 10.1093/bib/bby044.

Loter: A Software Package to Infer Local Ancestry for a Wide Range of Species.

Mol Biol Evol. 2018 Sep 1;35(9):2318-2326. doi: 10.1093/molbev/msy126.

Properties of global- and local-ancestry adjustments in genetic association tests in admixed populations.

Genet Epidemiol. 2018 Mar;42(2):214-229. doi: 10.1002/gepi.22103. Epub 2017 Dec 30.

A robust and powerful two-step testing procedure for local ancestry adjusted allelic association analysis in admixed populations.

Genet Epidemiol. 2018 Apr;42(3):288-302. doi: 10.1002/gepi.22104. Epub 2017 Dec 10.

LAIT: a local ancestry inference toolkit.

BMC Genet. 2017 Sep 6;18(1):83. doi: 10.1186/s12863-017-0546-y.

Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations.

Am J Hum Genet. 2017 Apr 6;100(4):635-649. doi: 10.1016/j.ajhg.2017.03.004. Epub 2017 Mar 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

比较两向和三向混合群体中的本地祖先推断模型。

Comparing local ancestry inference models in populations of two- and three-way admixture.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译