Liao Lauren D, Zhu Yeyi, Ngo Amanda L, Chehab Rana F, Pimentel Samuel D
Division of Biostatistics, Berkeley, CA 94720.
Kaiser Permanente Northern California Division of Research, Oakland, CA 94612.
Am Stat. 2024;78(3):318-326. doi: 10.1080/00031305.2024.2303419. Epub 2024 Feb 8.
Observational studies of treatment effects require adjustment for confounding variables. However, causal inference methods typically cannot deliver perfect adjustment on all measured baseline variables, and there is often ambiguity about which variables should be prioritized. Standard prioritization methods based on treatment imbalance alone neglect variables' relationships with the outcome. We propose the joint variable importance plot to guide variable prioritization for observational studies. Since not all variables are equally relevant to the outcome, the plot adds outcome associations to quantify the potential confounding jointly with the standardized mean difference. To enhance comparisons on the plot between variables with different confounding relationships, we also derive and plot bias curves. Variable prioritization using the plot can produce recommended values for tuning parameters in many existing matching and weighting methods. We showcase the use of the joint variable importance plots in the design of a balance-constrained matched study to evaluate whether taking an antidiabetic medication, glyburide, increases the incidence of C-section delivery among pregnant individuals with gestational diabetes.
对治疗效果的观察性研究需要对混杂变量进行调整。然而,因果推断方法通常无法对所有测量的基线变量进行完美调整,而且对于哪些变量应被优先考虑往往存在模糊性。仅基于治疗不平衡的标准优先排序方法忽略了变量与结果之间的关系。我们提出联合变量重要性图来指导观察性研究的变量优先排序。由于并非所有变量与结果的相关性都相同,该图增加了结果关联,以便与标准化平均差异一起量化潜在的混杂因素。为了增强具有不同混杂关系的变量在图上的比较,我们还推导并绘制了偏差曲线。使用该图进行变量优先排序可以为许多现有的匹配和加权方法中的调整参数生成推荐值。我们展示了联合变量重要性图在平衡约束匹配研究设计中的应用,以评估服用抗糖尿病药物格列本脲是否会增加妊娠期糖尿病孕妇剖宫产的发生率。