Suppr
超能文献

FSTruct：一种基于 F 的工具，用于在推断种群结构中的祖先变异测量。

FSTruct: An F -based tool for measuring ancestry variation in inference of population structure.

机构信息

Department of Biology, Stanford University, Stanford, California, USA.

Rare Cancers Genomics Team (RCG), Genomic Epidemiology Branch (GEM), International Agency for Research on Cancer/World Health Organisation (IARC/WHO), Lyon, France.

出版信息

Mol Ecol Resour. 2022 Oct;22(7):2614-2626. doi: 10.1111/1755-0998.13647. Epub 2022 Jul 20.

DOI:10.1111/1755-0998.13647

PMID:35596736

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9544611/

Abstract

In model-based inference of population structure from individual-level genetic data, individuals are assigned membership coefficients in a series of statistical clusters generated by clustering algorithms. Distinct patterns of variability in membership coefficients can be produced for different groups of individuals, for example, representing different predefined populations, sampling sites or time periods. Such variability can be difficult to capture in a single numerical value; membership coefficient vectors are multivariate and potentially incommensurable across predefined groups, as the number of clusters over which individuals are distributed can vary among groups of interest. Further, two groups might share few clusters in common, so that membership coefficient vectors are concentrated on different clusters. We introduce a method for measuring the variability of membership coefficients of individuals in a predefined group, making use of an analogy between variability across individuals in membership coefficient vectors and variation across populations in allele frequency vectors. We show that in a model in which membership coefficient vectors in a population follow a Dirichlet distribution, the measure increases linearly with a parameter describing the variance of a specified component of the membership vector and does not depend on its mean. We apply the approach, which makes use of a normalized F statistic, to data on inferred population structure in three example scenarios. We also introduce a bootstrap test for equivalence of two or more predefined groups in their level of membership coefficient variability. Our methods are implemented in the r package FSTruct.

摘要

在基于模型的个体水平遗传数据群体结构推断中，个体被分配给聚类算法生成的一系列统计聚类的成员系数。不同个体群体的成员系数可以产生不同的可变性模式，例如，代表不同的预定义群体、采样地点或时间段。这种可变性很难用单个数值来捕捉；成员系数向量是多变量的，并且在预定义的群体之间可能不可比，因为个体分布的聚类数量在感兴趣的群体之间可能有所不同。此外，两个群体可能很少有共同的聚类，因此成员系数向量集中在不同的聚类上。我们引入了一种测量预定义群体中个体成员系数可变性的方法，利用成员系数向量中个体之间的可变性与等位基因频率向量中群体之间的变异之间的类比。我们表明，在一个群体成员系数向量遵循 Dirichlet 分布的模型中，该度量与指定成员向量分量的方差描述的参数呈线性增加，并且不依赖于其均值。我们应用了该方法，该方法利用了标准化 F 统计量，对三个示例场景中推断的群体结构数据进行了分析。我们还引入了一个用于检验两个或更多预定义群体成员系数可变性水平等效性的自举检验。我们的方法在 r 包 FSTruct 中实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b23/9544611/6715fa63c5a2/MEN-22-2614-g005.jpg

相似文献

FSTruct: An F -based tool for measuring ancestry variation in inference of population structure.

Mol Ecol Resour. 2022 Oct;22(7):2614-2626. doi: 10.1111/1755-0998.13647. Epub 2022 Jul 20.

A Dirichlet model of alignment cost in mixed-membership unsupervised clustering.

J Comput Graph Stat. 2023;32(3):1145-1159. doi: 10.1080/10618600.2022.2127739. Epub 2022 Nov 14.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

pong: fast analysis and visualization of latent clusters in population genetic data.

Bioinformatics. 2016 Sep 15;32(18):2817-23. doi: 10.1093/bioinformatics/btw327. Epub 2016 Jun 9.

A unifying study of phenotypic and molecular genetic variability in natural populations of Anadenanthera colubrina var. cebil from Yungas and Paranaense biogeographic provinces in Argentina.

J Genet. 2014 Apr;93(1):123-32. doi: 10.1007/s12041-014-0347-2.

Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.

J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.

Fast and accurate joint inference of coancestry parameters for populations and/or individuals.

PLoS Genet. 2023 Jan 19;19(1):e1010054. doi: 10.1371/journal.pgen.1010054. eCollection 2023 Jan.

That's Not a Hybrid: How to Distinguish Patterns of Admixture and Isolation By Distance.

Mol Ecol Resour. 2025 Apr;25(3):e14039. doi: 10.1111/1755-0998.14039. Epub 2024 Oct 28.

Inference of ancestry: constructing hierarchical reference populations and assigning unknown individuals.

Hum Genomics. 2006 Jan;2(4):212-35. doi: 10.1186/1479-7364-2-4-212.

Accurate inference of subtle population structure (and other genetic discontinuities) using principal coordinates.

PLoS One. 2009;4(1):e4269. doi: 10.1371/journal.pone.0004269. Epub 2009 Jan 27.

引用本文的文献

Ancient DNA connects large-scale migration with the spread of Slavs.

Nature. 2025 Sep 3. doi: 10.1038/s41586-025-09437-6.

Quantifying compositional variability in microbial communities with FAVA.

Proc Natl Acad Sci U S A. 2025 Mar 18;122(11):e2413211122. doi: 10.1073/pnas.2413211122. Epub 2025 Mar 10.

Cultural transmission, networks, and clusters among Austronesian-speaking peoples.

Evol Hum Sci. 2024 Dec 6;6:e51. doi: 10.1017/ehs.2024.45. eCollection 2024.

Genetic Variation and Population Structure of : An In Silico Analysis.

Pathogens. 2024 Nov 13;13(11):991. doi: 10.3390/pathogens13110991.

Understanding admixture fractions: theory and estimation of gene-flow.

J Math Biol. 2024 Oct 4;89(5):47. doi: 10.1007/s00285-024-02146-0.

9,000 years of genetic continuity in southernmost Africa demonstrated at Oakhurst rockshelter.

Nat Ecol Evol. 2024 Nov;8(11):2121-2134. doi: 10.1038/s41559-024-02532-3. Epub 2024 Sep 19.

Quantifying compositional variability in microbial communities with FAVA.

bioRxiv. 2024 Jul 5:2024.07.03.601929. doi: 10.1101/2024.07.03.601929.

An X-STRs analysis of the Iraqi Sorani Kurds.

PLoS One. 2023 Nov 27;18(11):e0294973. doi: 10.1371/journal.pone.0294973. eCollection 2023.

Prolonged delays in human microbiota transmission after a controlled antibiotic perturbation.

bioRxiv. 2023 Oct 30:2023.09.26.559480. doi: 10.1101/2023.09.26.559480.

A genetic and linguistic analysis of the admixture histories of the islands of Cabo Verde.

Elife. 2023 Apr 25;12:e79827. doi: 10.7554/eLife.79827.

本文引用的文献

Mathematical constraints on : multiallelic markers in arbitrarily many populations.

Philos Trans R Soc Lond B Biol Sci. 2022 Jun 6;377(1852):20200414. doi: 10.1098/rstb.2020.0414. Epub 2022 Apr 18.

Changes in the fine-scale genetic structure of Finland through the 20th century.

PLoS Genet. 2021 Mar 4;17(3):e1009347. doi: 10.1371/journal.pgen.1009347. eCollection 2021 Mar.

Ancient Rome: A genetic crossroads of Europe and the Mediterranean.

Science. 2019 Nov 8;366(6466):708-714. doi: 10.1126/science.aay6826.

Bulk tissue cell type deconvolution with multi-subject single-cell expression reference.

Nat Commun. 2019 Jan 22;10(1):380. doi: 10.1038/s41467-018-08023-x.

, Jost's D, and F are similarly constrained by allele frequencies: A mathematical, simulation, and empirical study.

Mol Ecol. 2019 Apr;28(7):1624-1636. doi: 10.1111/mec.15000.

Parallel Trajectories of Genetic and Linguistic Admixture in a Genetically Admixed Creole Population.

Curr Biol. 2017 Aug 21;27(16):2529-2535.e3. doi: 10.1016/j.cub.2017.07.002. Epub 2017 Aug 10.

Mathematical Constraints on : Biallelic Markers in Arbitrarily Many Populations.

Genetics. 2017 Jul;206(3):1581-1600. doi: 10.1534/genetics.116.199141. Epub 2017 May 5.

pong: fast analysis and visualization of latent clusters in population genetic data.

Bioinformatics. 2016 Sep 15;32(18):2817-23. doi: 10.1093/bioinformatics/btw327. Epub 2016 Jun 9.

Individual Identifiability Predicts Population Identifiability in Forensic Microsatellite Markers.

Curr Biol. 2016 Apr 4;26(7):935-42. doi: 10.1016/j.cub.2016.01.065. Epub 2016 Mar 17.

pophelper: an R package and web app to analyse and visualize population structure.

Mol Ecol Resour. 2017 Jan;17(1):27-32. doi: 10.1111/1755-0998.12509. Epub 2016 Feb 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

FSTruct：一种基于 F 的工具，用于在推断种群结构中的祖先变异测量。

FSTruct: An F -based tool for measuring ancestry variation in inference of population structure.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译