• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

R$^{2}$s for Correlated Data: 进化模型、LMMs 和 GLMMs。

R$^{2}$s for Correlated Data: Phylogenetic Models, LMMs, and GLMMs.

机构信息

Department of Integrative Biology, UW-Madison, Madison, WI 53706, USA.

出版信息

Syst Biol. 2019 Mar 1;68(2):234-251. doi: 10.1093/sysbio/syy060.

DOI:10.1093/sysbio/syy060
PMID:30239975
Abstract

Many researchers want to report an $R^{2}$ to measure the variance explained by a model. When the model includes correlation among data, such as phylogenetic models and mixed models, defining an $R^{2}$ faces two conceptual problems. (i) It is unclear how to measure the variance explained by predictor (independent) variables when the model contains covariances. (ii) Researchers may want the $R^{2}$ to include the variance explained by the covariances by asking questions such as "How much of the data is explained by phylogeny?" Here, I investigated three $R^{2}$s for phylogenetic and mixed models. $R^{2}{resid}$ is an extension of the ordinary least-squares $R^{2}$ that weights residuals by variances and covariances estimated by the model; it is closely related to $R^{2}{glmm}$ presented by Nakagawa and Schielzeth (2013. A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol. Evol. 4:133-142). $R^{2}{pred}$ is based on predicting each residual from the fitted model and computing the variance between observed and predicted values. $R^{2}{lik}$ is based on the likelihood of fitted models, and therefore, reflects the amount of information that the models contain. These three $R^{2}$s are formulated as partial $R^{2}$s, making it possible to compare the contributions of predictor variables and variance components (phylogenetic signal and random effects) to the fit of models. Because partial $R^{2}$s compare a full model with a reduced model without components of the full model, they are distinct from marginal $R^{2}$s that partition additive components of the variance. I assessed the properties of the $R^{2}$s for phylogenetic models using simulations for continuous and binary response data (phylogenetic generalized least squares and phylogenetic logistic regression). Because the $R^{2}$s are designed broadly for any model for correlated data, I also compared $R^{2}$s for linear mixed models and generalized linear mixed models. $R^{2}{resid}$, $R^{2}{pred}$, and $R^{2}{lik}$ all have similar performance in describing the variance explained by different components of models. However, $R^{2}{pred}$ gives the most direct answer to the question of how much variance in the data is explained by a model. $R^{2}{resid}$ is most appropriate for comparing models fit to different data sets, because it does not depend on sample sizes. And $R^{2}{lik}$ is most appropriate to assess the importance of different components within the same model applied to the same data, because it is most closely associated with statistical significance tests.

摘要

许多研究人员希望报告一个$R^{2}$值来衡量模型解释的方差。当模型包括数据之间的相关性时,例如系统发育模型和混合模型,定义$R^{2}$值就面临两个概念问题。(i)当模型包含协方差时,如何衡量预测(独立)变量解释的方差是不清楚的。(ii)研究人员可能希望通过询问“系统发育解释了数据的多少?”等问题来让$R^{2}$值包含协方差解释的方差。在这里,我研究了三种用于系统发育和混合模型的$R^{2}$值。$R^{2}{resid}$是普通最小二乘$R^{2}$的扩展,它通过模型估计的方差和协方差对残差进行加权;它与 Nakagawa 和 Schielzeth(2013. 一种从广义线性混合效应模型中获得 R2 的通用且简单的方法。方法生态。进化。4:133-142)提出的$R^{2}{glmm}$密切相关。$R^{2}{pred}$是基于从拟合模型中预测每个残差,并计算观测值和预测值之间的方差。$R^{2}{lik}$基于拟合模型的似然,因此反映了模型包含的信息量。这三个$R^{2}$值被定义为偏$R^{2}$值,这使得比较预测变量和方差分量(系统发育信号和随机效应)对模型拟合的贡献成为可能。由于偏$R^{2}$值将完整模型与不包含完整模型组件的简化模型进行比较,因此它们与将加性方差分量划分的边际$R^{2}$值不同。我使用连续和二进制响应数据(系统发育广义最小二乘法和系统发育逻辑回归)的模拟来评估$R^{2}$值在系统发育模型中的性质。由于$R^{2}$值广泛设计用于任何相关数据模型,因此我还比较了线性混合模型和广义线性混合模型的$R^{2}$值。$R^{2}{resid}$、$R^{2}{pred}$和$R^{2}{lik}$在描述模型不同组成部分解释的方差方面都具有相似的性能。然而,$R^{2}{pred}$最直接地回答了模型解释数据中多少方差的问题。$R^{2}{resid}$最适合于比较拟合不同数据集的模型,因为它不依赖于样本量。而$R^{2}{lik}$最适合于评估同一模型应用于同一数据时不同组件的重要性,因为它与统计显着性检验最密切相关。

相似文献

1
R$^{2}$s for Correlated Data: Phylogenetic Models, LMMs, and GLMMs.R$^{2}$s for Correlated Data: 进化模型、LMMs 和 GLMMs。
Syst Biol. 2019 Mar 1;68(2):234-251. doi: 10.1093/sysbio/syy060.
2
partR2: partitioning R in generalized linear mixed models.第R2部分:广义线性混合模型中对R进行划分
PeerJ. 2021 May 25;9:e11414. doi: 10.7717/peerj.11414. eCollection 2021.
3
Extension of Nakagawa & Schielzeth's to random slopes models.中川和席尔泽特方法向随机斜率模型的扩展。
Methods Ecol Evol. 2014 Sep;5(9):944-946. doi: 10.1111/2041-210X.12225. Epub 2014 Jul 23.
4
Phylogenetic logistic regression for binary dependent variables.用于二项因变量的系统发育逻辑回归。
Syst Biol. 2010 Jan;59(1):9-26. doi: 10.1093/sysbio/syp074. Epub 2009 Nov 4.
5
Phylogeny, regression, and the allometry of physiological traits.系统发育、回归与生理特征的异速生长
Am Nat. 2007 Sep;170(3):431-42. doi: 10.1086/519459. Epub 2007 Jul 24.
6
Meta-analysis of binary outcomes via generalized linear mixed models: a simulation study.基于广义线性混合模型的二分类结局的Meta 分析:一项模拟研究。
BMC Med Res Methodol. 2018 Jul 4;18(1):70. doi: 10.1186/s12874-018-0531-9.
7
Statistical inference in generalized linear mixed models: a review.广义线性混合模型中的统计推断:综述
Br J Math Stat Psychol. 2006 Nov;59(Pt 2):225-55. doi: 10.1348/000711005X79857.
8
A generalized K statistic for estimating phylogenetic signal from shape and other high-dimensional multivariate data.一种用于从形状和其他高维多元数据中估计系统发育信号的广义 K 统计量。
Syst Biol. 2014 Sep;63(5):685-97. doi: 10.1093/sysbio/syu030. Epub 2014 Apr 30.
9
Genetic variance components analysis for binary phenotypes using generalized linear mixed models (GLMMs) and Gibbs sampling.使用广义线性混合模型(GLMMs)和吉布斯采样对二元表型进行遗传方差成分分析。
Genet Epidemiol. 1999;17(2):118-40. doi: 10.1002/(SICI)1098-2272(1999)17:2<118::AID-GEPI3>3.0.CO;2-V.
10
A method for assessing phylogenetic least squares models for shape and other high-dimensional multivariate data.一种用于评估形状及其他高维多元数据的系统发育最小二乘模型的方法。
Evolution. 2014 Sep;68(9):2675-88. doi: 10.1111/evo.12463. Epub 2014 Jul 9.

引用本文的文献

1
Incompatibility between two major innovations shaped the diversification of fish feeding mechanisms.两项重大创新之间的不相容性塑造了鱼类摄食机制的多样化。
PLoS Biol. 2025 Jun 24;23(6):e3003225. doi: 10.1371/journal.pbio.3003225. eCollection 2025 Jun.
2
Birds, Bats or Climate? Eucalypt Floral Traits Reflect Pollination Over Abiotic Environment.鸟类、蝙蝠还是气候?桉树的花部特征反映传粉作用而非非生物环境。
Ecol Evol. 2025 Jun 11;15(6):e71449. doi: 10.1002/ece3.71449. eCollection 2025 Jun.
3
The global spectrum of tree crown architecture.
树冠结构的全球范围。
Nat Commun. 2025 May 26;16(1):4876. doi: 10.1038/s41467-025-60262-x.
4
Strong bat predation and weak environmental constraints predict longer moth tails.强烈的蝙蝠捕食和微弱的环境限制预示着蛾类的尾巴更长。
Proc Biol Sci. 2025 May;292(2046):20242824. doi: 10.1098/rspb.2024.2824. Epub 2025 May 7.
5
Health Stress in Birds Increase with Urbanization in a Large Tropical City.在一个大型热带城市,鸟类的健康压力随着城市化进程而增加。
Ecohealth. 2025 Mar 27. doi: 10.1007/s10393-025-01708-y.
6
Negative global-scale association between genetic diversity and speciation rates in mammals.哺乳动物遗传多样性与物种形成速率之间的全球尺度负相关。
Nat Commun. 2025 Feb 20;16(1):1796. doi: 10.1038/s41467-025-56820-y.
7
Metabolic rate of angiosperm seeds: effects of allometry, phylogeny and bioclimate.被子植物种子的代谢率:异速生长、系统发育和生物气候的影响。
Proc Biol Sci. 2025 Feb;292(2041):20242683. doi: 10.1098/rspb.2024.2683. Epub 2025 Feb 19.
8
Does metabolic rate influence genome-wide amino acid composition in the course of animal evolution?代谢率在动物进化过程中是否会影响全基因组的氨基酸组成?
Evol Lett. 2024 Nov 8;9(1):137-149. doi: 10.1093/evlett/qrae061. eCollection 2025 Feb.
9
A phylogenetic approach to comparative genomics.一种用于比较基因组学的系统发育方法。
Nat Rev Genet. 2025 Jun;26(6):395-405. doi: 10.1038/s41576-024-00803-0. Epub 2025 Jan 8.
10
Size, not phylogeny, explains the morphology of the endosseous labyrinth in the crown clade Crocodylia.体型而非系统发育,解释了冠群鳄目动物骨内迷路的形态。
J Anat. 2025 Apr;246(4):558-574. doi: 10.1111/joa.14170. Epub 2024 Dec 20.