Suppr超能文献

个体祖先系数的快速高效估计。

Fast and efficient estimation of individual ancestry coefficients.

作者信息

Frichot Eric, Mathieu François, Trouillon Théo, Bouchard Guillaume, François Olivier

机构信息

Université Joseph Fourier Grenoble 1, Centre National de la Recherche Scientifique, Techniques de l'Ingénierie Médicale et de la Complexité - Informatique, Mathématiques et Applications, Grenoble Unité Mixte de Recherche 5525, 38042 Grenoble, France.

出版信息

Genetics. 2014 Apr;196(4):973-83. doi: 10.1534/genetics.113.160572. Epub 2014 Feb 4.

Abstract

Inference of individual ancestry coefficients, which is important for population genetic and association studies, is commonly performed using computer-intensive likelihood algorithms. With the availability of large population genomic data sets, fast versions of likelihood algorithms have attracted considerable attention. Reducing the computational burden of estimation algorithms remains, however, a major challenge. Here, we present a fast and efficient method for estimating individual ancestry coefficients based on sparse nonnegative matrix factorization algorithms. We implemented our method in the computer program sNMF and applied it to human and plant data sets. The performances of sNMF were then compared to the likelihood algorithm implemented in the computer program ADMIXTURE. Without loss of accuracy, sNMF computed estimates of ancestry coefficients with runtimes ∼10-30 times shorter than those of ADMIXTURE.

摘要

个体祖先系数的推断对群体遗传学和关联研究很重要,通常使用计算量较大的似然算法来进行。随着大量群体基因组数据集的出现,似然算法的快速版本已引起了相当大的关注。然而,降低估计算法的计算负担仍然是一个重大挑战。在此,我们提出了一种基于稀疏非负矩阵分解算法来估计个体祖先系数的快速有效方法。我们在计算机程序sNMF中实现了我们的方法,并将其应用于人类和植物数据集。然后将sNMF的性能与计算机程序ADMIXTURE中实现的似然算法进行比较。在不损失准确性的情况下,sNMF计算祖先系数估计值的运行时间比ADMIXTURE短约10 - 30倍。

相似文献

1
Fast and efficient estimation of individual ancestry coefficients.
Genetics. 2014 Apr;196(4):973-83. doi: 10.1534/genetics.113.160572. Epub 2014 Feb 4.
2
Fast model-based estimation of ancestry in unrelated individuals.
Genome Res. 2009 Sep;19(9):1655-64. doi: 10.1101/gr.094052.109. Epub 2009 Jul 31.
4
Enhancements to the ADMIXTURE algorithm for individual ancestry estimation.
BMC Bioinformatics. 2011 Jun 18;12:246. doi: 10.1186/1471-2105-12-246.
5
TESS3: fast inference of spatial population structure and genome scans for selection.
Mol Ecol Resour. 2016 Mar;16(2):540-8. doi: 10.1111/1755-0998.12471. Epub 2015 Oct 17.
6
A novel and fast approach for population structure inference using kernel-PCA and optimization.
Genetics. 2014 Dec;198(4):1421-31. doi: 10.1534/genetics.114.171314. Epub 2014 Oct 16.
7
A fast least-squares algorithm for population inference.
BMC Bioinformatics. 2013 Jan 23;14:28. doi: 10.1186/1471-2105-14-28.
8
Estimating individual admixture proportions from next generation sequencing data.
Genetics. 2013 Nov;195(3):693-702. doi: 10.1534/genetics.113.154138. Epub 2013 Sep 11.
9
A fast and reliable computational method for estimating population genetic parameters.
Genetics. 2008 Jun;179(2):951-63. doi: 10.1534/genetics.108.087049. Epub 2008 May 27.
10
Unsupervised discovery of ancestry-informative markers and genetic admixture proportions in biobank-scale datasets.
Am J Hum Genet. 2023 Feb 2;110(2):314-325. doi: 10.1016/j.ajhg.2022.12.008. Epub 2023 Jan 6.

引用本文的文献

1
Genomic and Morphological Evidence Support Contemporary Three-Way Interspecific Hybridization in Ranid Frogs.
Ecol Evol. 2025 Aug 31;15(9):e72035. doi: 10.1002/ece3.72035. eCollection 2025 Sep.
2
Exploring the diversity of three Northern Atlantic sea beet populations.
Front Plant Sci. 2025 Aug 14;16:1635602. doi: 10.3389/fpls.2025.1635602. eCollection 2025.
3
Hybridization Resulted in Shifts from Dioecy to Monoecy in Weeping Willows ( L.).
Genes (Basel). 2025 Aug 13;16(8):958. doi: 10.3390/genes16080958.
5
Population structure in a fungal human pathogen is potentially linked to pathogenicity.
Nat Commun. 2025 Aug 15;16(1):7594. doi: 10.1038/s41467-025-62777-9.
7
Genome-wide association analysis of grain iron and zinc in rice grown under agroclimatic sites with contrasting soil iron status.
Front Plant Sci. 2025 Jul 7;16:1501878. doi: 10.3389/fpls.2025.1501878. eCollection 2025.
8
Interplay between large low-recombining regions and pseudo-overdominance in a plant genome.
Nat Commun. 2025 Jul 12;16(1):6458. doi: 10.1038/s41467-025-61529-z.
9
Population Phylogenomics and Genetic Structure of the Polyphagous Leafminer, (Burgess) (Diptera: Agromyzidae).
Evol Appl. 2025 Jul 9;18(7):e70132. doi: 10.1111/eva.70132. eCollection 2025 Jul.
10

本文引用的文献

1
Testing for associations between loci and environmental gradients using latent factor mixed models.
Mol Biol Evol. 2013 Jul;30(7):1687-99. doi: 10.1093/molbev/mst063. Epub 2013 Mar 29.
2
A fast least-squares algorithm for population inference.
BMC Bioinformatics. 2013 Jan 23;14:28. doi: 10.1186/1471-2105-14-28.
3
Correcting principal component maps for effects of spatial autocorrelation in population genetic data.
Front Genet. 2012 Nov 20;3:254. doi: 10.3389/fgene.2012.00254. eCollection 2012.
4
An integrated map of genetic variation from 1,092 human genomes.
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
5
Ancient admixture in human history.
Genetics. 2012 Nov;192(3):1065-93. doi: 10.1534/genetics.112.145037. Epub 2012 Sep 7.
6
Population identification using genetic data.
Annu Rev Genomics Hum Genet. 2012;13:337-61. doi: 10.1146/annurev-genom-082410-101510. Epub 2012 Jun 11.
7
Inference of population structure using dense haplotype data.
PLoS Genet. 2012 Jan;8(1):e1002453. doi: 10.1371/journal.pgen.1002453. Epub 2012 Jan 26.
8
Enhancements to the ADMIXTURE algorithm for individual ancestry estimation.
BMC Bioinformatics. 2011 Jun 18;12:246. doi: 10.1186/1471-2105-12-246.
9
Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis.
PLoS Genet. 2010 Sep 16;6(9):e1001117. doi: 10.1371/journal.pgen.1001117.
10
Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.
Nature. 2010 Jun 3;465(7298):627-31. doi: 10.1038/nature08800. Epub 2010 Mar 24.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验