• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从统计遗传学和系统发生学的角度出发,为结构群体中的表型映射建立统一方法。

Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations.

机构信息

Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America.

Department of Biological Sciences, University of Southern California, Los Angeles, California, United States of America.

出版信息

PLoS Biol. 2024 Oct 9;22(10):e3002847. doi: 10.1371/journal.pbio.3002847. eCollection 2024 Oct.

DOI:10.1371/journal.pbio.3002847
PMID:39383205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11493298/
Abstract

In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these 2 fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we lay out a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., genome-wide association studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur analytically and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study, we re-examine an analysis testing for coevolution of expression levels between genes across a fungal phylogeny and show that including eigenvectors of the covariance matrix as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.

摘要

在统计遗传学和系统发育学中,一个主要目标是确定遗传基因座或表型或环境的其他方面与焦点性状之间的相关性。在这两个领域中,存在着针对这些任务的复杂但不同的统计传统。随着医学、保护生物学和进化生物学中的问题越来越依赖于整合来自物种内部和物种之间的数据,以及曾经清晰的概念性划分变得越来越模糊,它们各自方法之间的脱节变得不可持续。为了帮助弥合这一鸿沟,我们提出了一个通用模型,描述了不同个体的数量性状遗传贡献之间的协方差。采用这种方法表明,统计遗传学中的标准模型(例如全基因组关联研究;GWAS)和系统发育比较生物学中的标准模型(例如系统发育回归)可以被解释为这个更通用的数量遗传模型的特例。这些模型具有相同的核心架构这一事实意味着,当我们检验关联时,我们可以对不同方法控制遗传结构的优缺点建立统一的理解。我们从理论上分析了为什么以及何时会出现虚假相关,并对数量性状进行了群体遗传学和系统发育学模拟。统计遗传学和系统发育学中问题的结构相似性使我们能够从一个领域采用方法进步并将其应用于另一个领域。我们通过展示如何在系统发育分析中减轻虚假相关性来证明这一点,包括标准的 GWAS 技术——包括遗传相关矩阵(GRM)以及回归模型中基因型矩阵的主要特征向量,对应于主成分——都可以减轻系统发育分析中的虚假相关性。作为一个案例研究,我们重新检验了一项测试跨真菌系统发育中基因表达水平共进化的分析,并表明包括协方差矩阵的特征向量作为协变量可以降低假阳性率,同时提高真阳性率。更一般地说,这项工作为理解表型的遗传结构以及进化过程如何塑造它提供了一个更具综合性的方法基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/8dcb6d8e56b2/pbio.3002847.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/3e48b9b6a049/pbio.3002847.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/f69ccb56f9fa/pbio.3002847.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/fe2dc91da77f/pbio.3002847.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/8dcb6d8e56b2/pbio.3002847.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/3e48b9b6a049/pbio.3002847.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/f69ccb56f9fa/pbio.3002847.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/fe2dc91da77f/pbio.3002847.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/991e/11493298/8dcb6d8e56b2/pbio.3002847.g004.jpg

相似文献

1
Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations.从统计遗传学和系统发生学的角度出发,为结构群体中的表型映射建立统一方法。
PLoS Biol. 2024 Oct 9;22(10):e3002847. doi: 10.1371/journal.pbio.3002847. eCollection 2024 Oct.
2
Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations.整合统计遗传学和系统发育学方法以定位结构化群体中的表型。
bioRxiv. 2024 Mar 7:2024.02.10.579721. doi: 10.1101/2024.02.10.579721.
3
An alternative covariance estimator to investigate genetic heterogeneity in populations.一种用于研究群体遗传异质性的替代协方差估计器。
Genet Sel Evol. 2015 Nov 26;47:93. doi: 10.1186/s12711-015-0171-z.
4
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
5
Linkage Disequilibrium and Evaluation of Genome-Wide Association Mapping Models in Tetraploid Potato.四倍体马铃薯的连锁不平衡及全基因组关联作图模型评估
G3 (Bethesda). 2018 Oct 3;8(10):3185-3202. doi: 10.1534/g3.118.200377.
6
[Foundations of the new phylogenetics].[新系统发育学的基础]
Zh Obshch Biol. 2004 Jul-Aug;65(4):334-66.
7
Inferring trait-specific similarity among individuals from molecular markers and phenotypes with Bayesian regression.基于贝叶斯回归推断个体的分子标记和表型特征之间的特异性相似性。
Theor Popul Biol. 2020 Apr;132:47-59. doi: 10.1016/j.tpb.2019.11.008. Epub 2019 Dec 9.
8
A robust statistical method for association-based eQTL analysis.一种稳健的基于关联的 eQTL 分析统计方法。
PLoS One. 2011;6(8):e23192. doi: 10.1371/journal.pone.0023192. Epub 2011 Aug 9.
9
Effect of genetic architecture on the prediction accuracy of quantitative traits in samples of unrelated individuals.遗传结构对非相关个体样本中数量性状预测准确性的影响。
Heredity (Edinb). 2018 Jun;120(6):500-514. doi: 10.1038/s41437-017-0043-0. Epub 2018 Feb 10.
10
On Using Local Ancestry to Characterize the Genetic Architecture of Human Traits: Genetic Regulation of Gene Expression in Multiethnic or Admixed Populations.利用局部亲缘关系刻画人类性状的遗传结构:多民族或混合人群中基因表达的遗传调控。
Am J Hum Genet. 2019 Jun 6;104(6):1097-1115. doi: 10.1016/j.ajhg.2019.04.009. Epub 2019 May 16.

引用本文的文献

1
From Trees to Traits: A Review of Advances in PhyloG2P Methods and Future Directions.从树到性状:系统发育基因到性状(PhyloG2P)方法的进展回顾与未来方向
Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf150.
2
A genealogy-based approach for revealing ancestry-specific structures in admixed populations.一种基于系谱学的方法,用于揭示混合群体中特定祖先的结构。
Am J Hum Genet. 2025 Jul 17. doi: 10.1016/j.ajhg.2025.06.016.
3
Convergent expansions of keystone gene families drive metabolic innovation in Saccharomycotina yeasts.关键基因家族的趋同扩张驱动了酵母亚门酵母的代谢创新。

本文引用的文献

1
Interpreting population- and family-based genome-wide association studies in the presence of confounding.在存在混杂的情况下解释基于人群和家庭的全基因组关联研究。
PLoS Biol. 2024 Apr 11;22(4):e3002511. doi: 10.1371/journal.pbio.3002511. eCollection 2024 Apr.
2
MESuSiE enables scalable and powerful multi-ancestry fine-mapping of causal variants in genome-wide association studies.MESuSiE 可实现全基因组关联研究中因果变异的可扩展和强大的多祖系精细映射。
Nat Genet. 2024 Jan;56(1):170-179. doi: 10.1038/s41588-023-01604-7. Epub 2024 Jan 2.
3
Tree-based QTL mapping with expected local genetic relatedness matrices.
Proc Natl Acad Sci U S A. 2025 Jun 10;122(23):e2500165122. doi: 10.1073/pnas.2500165122. Epub 2025 Jun 3.
4
Evolutionary accumulation modeling in AMR: machine learning to infer and predict evolutionary dynamics of multi-drug resistance.抗菌药物耐药性中的进化积累建模:用于推断和预测多重耐药性进化动态的机器学习
mBio. 2025 Jun 11;16(6):e0048825. doi: 10.1128/mbio.00488-25. Epub 2025 May 21.
5
Testing for differences in polygenic scores in the presence of confounding.在存在混杂因素的情况下对多基因分数差异进行检测。
Genetics. 2025 Jun 4;230(2). doi: 10.1093/genetics/iyaf071.
6
On ARGs, pedigrees, and genetic relatedness matrices.关于抗菌药物耐药基因、谱系和遗传相关性矩阵。
bioRxiv. 2025 Mar 5:2025.03.03.641310. doi: 10.1101/2025.03.03.641310.
7
Error rates in QST-FST comparisons depend on genetic architecture and estimation procedures.QST与FST比较中的错误率取决于遗传结构和估计程序。
Genetics. 2025 Apr 17;229(4). doi: 10.1093/genetics/iyaf034.
8
A Litmus Test for Confounding in Polygenic Scores.多基因评分中混杂因素的石蕊试验
bioRxiv. 2025 Feb 4:2025.02.01.635985. doi: 10.1101/2025.02.01.635985.
9
A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression.树木过多的故事:系统发育回归的难题
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf032.
10
Error rates in - comparisons depend on genetic architecture and estimation procedures.在……比较中的错误率取决于遗传结构和估计程序。 (原文中“in - comparisons”部分有缺失内容)
bioRxiv. 2024 Nov 1:2024.10.28.620737. doi: 10.1101/2024.10.28.620737.
基于树的 QTL 作图与预期局部遗传相关性矩阵。
Am J Hum Genet. 2023 Dec 7;110(12):2077-2091. doi: 10.1016/j.ajhg.2023.10.017.
4
Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution.评估用于基因表达进化的常用系统发育模型的性能。
Genome Biol Evol. 2023 Dec 1;15(12). doi: 10.1093/gbe/evad211.
5
DNA language models are powerful predictors of genome-wide variant effects.DNA 语言模型是全基因组变异效应的有力预测因子。
Proc Natl Acad Sci U S A. 2023 Oct 31;120(44):e2311219120. doi: 10.1073/pnas.2311219120. Epub 2023 Oct 26.
6
Fluctuating selection maintains distinct species phenotypes in an ecological community in the wild.波动选择在野外生态群落中维持独特的物种表型。
Proc Natl Acad Sci U S A. 2023 Oct 17;120(42):e2222071120. doi: 10.1073/pnas.2222071120. Epub 2023 Oct 9.
7
PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance.PhyloAcc-GT:一种贝叶斯方法,用于推断靶向谱系中替代率转移模式,同时考虑基因树分歧。
Mol Biol Evol. 2023 Sep 1;40(9). doi: 10.1093/molbev/msad195.
8
The Cauchy Process on Phylogenies: A Tractable Model for Pulsed Evolution.系统发育树上的柯西过程:脉冲进化的一个可处理模型。
Syst Biol. 2023 Dec 30;72(6):1296-1315. doi: 10.1093/sysbio/syad053.
9
On the Decoupling of Evolutionary Changes in mRNA and Protein Levels.mRNA 和蛋白质水平进化变化的解耦。
Mol Biol Evol. 2023 Aug 3;40(8). doi: 10.1093/molbev/msad169.
10
The landscape of tolerated genetic variation in humans and primates.人类和灵长类动物中可耐受遗传变异的景观。
Science. 2023 Jun 2;380(6648):eabn8153. doi: 10.1126/science.abn8197.