Speth Kelly A, Elliott Michael R, Marquez Juan L, Wang Lu
Department of Biostatistics, School of Public Health, 1259University of Michigan, Ann Arbor, MI, USA.
Department of Epidemiology, School of Public Health, 1259University of Michigan, Ann Arbor, MI, USA.
Stat Methods Med Res. 2022 Dec;31(12):2338-2351. doi: 10.1177/09622802221122397. Epub 2022 Oct 3.
Dynamic treatment regimes are a set of time-adaptive decision rules that can be used to personalize treatment across multiple stages of care. Grounded in causal inference methods, dynamic treatment regimes identify variables that differentiate the treatment effect and may be used to tailor treatments across individuals based on the patient's own characteristics - thereby representing an important step toward personalized medicine. In this manuscript we introduce Penalized Spline-Involved Tree-based Learning, which seeks to improve upon existing tree-based approaches to estimating an optimal dynamic treatment regime. Instead of using weights determined from the estimated propensity scores, which may result in unstable estimates when weights are highly variable, we predict missing counterfactual outcomes using regression models that incorporate a penalized spline of the propensity score and other covariates predictive of the outcome. We further develop a novel purity measure applied within a decision tree framework to produce a flexible yet interpretable method for estimating an optimal multi-stage multi-treatment dynamic treatment regime. In simulation experiments we demonstrate good performance of Penalized Spline-Involved Tree-based Learning relative to competing methods and, in particular, we show that Penalized Spline-Involved Tree-based Learning may be advantageous when the sample size is small and/or when the level of confounding of the outcome is high. We apply Penalized Spline-Involved Tree-based Learning to the retrospectively-collected Medical Information Mart for Intensive Care dataset to identify variables that may be used to tailor early fluid resuscitation strategies in septic patients.
动态治疗方案是一组时间自适应决策规则,可用于在多个护理阶段实现个性化治疗。基于因果推断方法,动态治疗方案识别出能够区分治疗效果的变量,并可根据患者自身特征为个体量身定制治疗方案,从而代表了迈向个性化医疗的重要一步。在本论文中,我们介绍了基于惩罚样条的树状学习方法,该方法旨在改进现有的基于树状的方法来估计最优动态治疗方案。我们不是使用根据估计的倾向得分确定的权重,因为当权重高度可变时可能会导致不稳定的估计,而是使用回归模型预测缺失的反事实结果,该回归模型纳入了倾向得分的惩罚样条以及其他预测结果的协变量。我们进一步开发了一种在决策树框架内应用的新型纯度度量,以产生一种灵活且可解释的方法来估计最优的多阶段多治疗动态治疗方案。在模拟实验中,我们证明了基于惩罚样条的树状学习相对于竞争方法具有良好的性能,特别是当样本量较小和/或结果的混杂程度较高时,基于惩罚样条的树状学习可能具有优势。我们将基于惩罚样条的树状学习应用于回顾性收集的重症监护医学信息库数据集,以识别可用于为脓毒症患者量身定制早期液体复苏策略的变量。