Department of Psychology.
Department of Psychiatry.
Psychol Addict Behav. 2022 May;36(3):284-295. doi: 10.1037/adb0000669. Epub 2021 Apr 29.
Generalized linear models (GLMs) such as logistic and Poisson regression are among the most common statistical methods for modeling binary and count outcomes. Though single-coefficient tests (odds ratios, incidence rate ratios) are the most common way to test predictor-outcome relations in these models, they provide limited information on the magnitude and nature of relations with outcomes. We assert that this is largely because they do not describe direct relations with quantities of interest (QoIs) such as probabilities and counts. Shifting focus to QoIs makes several critical nuances of GLMs more apparent.
To bolster interpretability of these models, we provide a tutorial on logistic and Poisson regression and suggestions for enhancements to current reporting practices for predictor-outcome relations in GLMs.
We first highlight differences in interpretation between traditional linear models and GLMs, and describe common misconceptions about GLMs. In particular, we highlight that link functions (a) introduce nonconstant relations between predictors and outcomes and (b) make predictor-QoI relations dependent on levels of other covariates. Each of these properties causes interpretation of GLM coefficients to diverge from interpretations of linear models. Next, we argue for a more central focus on QoIs (probabilities and counts). Finally, we propose and provide graphics and tables, with sample R code, for enhancing presentation and interpretation of QoIs.
By improving present practices in the reporting of predictor-outcome relations in GLMs, we hope to maximize the amount of actionable information generated by statistical analyses and provide a tool for building a cumulative science of substance use disorders. (PsycInfo Database Record (c) 2022 APA, all rights reserved).
广义线性模型(GLMs),如逻辑回归和泊松回归,是用于构建二项和计数结果模型的最常见的统计方法之一。尽管单系数检验(优势比、发病率比)是这些模型中检验预测变量与结果关系最常用的方法,但它们提供的关于与结果关系的大小和性质的信息有限。我们断言,这主要是因为它们不能描述与感兴趣的数量(QoIs),如概率和计数的直接关系。将重点转移到 QoIs 上,可以使 GLMs 的几个关键细微差别更加明显。
为了增强这些模型的可解释性,我们提供了关于逻辑回归和泊松回归的教程,并为 GLMs 中预测变量与结果关系的当前报告实践提出了改进建议。
我们首先强调了传统线性模型和 GLMs 之间在解释上的差异,并描述了关于 GLMs 的常见误解。特别是,我们强调了链接函数(a)在预测器和结果之间引入了非恒定的关系,(b)使预测器-QoI 关系依赖于其他协变量的水平。这两个特性都导致了对 GLM 系数的解释与对线性模型的解释不同。接下来,我们主张更关注 QoIs(概率和计数)。最后,我们提出并提供了图形和表格,以及示例 R 代码,以增强 QoIs 的呈现和解释。
通过改进 GLMs 中预测变量与结果关系报告的现有实践,我们希望最大限度地增加统计分析产生的可操作信息,并为物质使用障碍的累积科学提供一个工具。(PsycInfo 数据库记录(c)2022 APA,保留所有权利)。