Pollack Jackie, Yang Wei, Schnellinger Erin M, Arnaoutakis George J, Kallan Michael J, Kimmel Stephen E
Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, University of Florida, Gainesville, Fla.
Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pa.
JTCVS Open. 2023 Jul 24;15:94-112. doi: 10.1016/j.xjon.2023.07.011. eCollection 2023 Sep.
Clinical prediction models for surgical aortic valve replacement mortality, are valuable decision tools but are often limited in their ability to account for changes in medical practice, patient selection, and the risk of outcomes over time. Recent research has identified methods to update models as new data accrue, but their effect on model performance has not been rigorously tested.
The study population included 44,546 adults who underwent an isolated surgical aortic valve replacement from January 1, 1999, to December 31, 2018, statewide in Pennsylvania. After chronologically splitting the data into training and validation sets, we compared calibration, discrimination, and accuracy measures amongst a nonupdating model to 2 methods of model updating: calibration regression and the novel dynamic logistic state space model.
The risk of mortality decreased significantly during the validation period ( < .01) and the nonupdating model demonstrated poor calibration and reduced accuracy over time. Both updating models maintained better calibration (Hosmer-Lemeshow χ statistic) than the nonupdating model: nonupdating (156.5), calibration regression (4.9), and dynamic logistic state space model (8.0). Overall accuracy (Brier score) was consistently better across both updating models: dynamic logistic state space model (0.0252), calibration regression (0.0253), and nonupdating (0.0256). Discrimination improved with the dynamic logistic state space model (area under the curve, 0.696) compared with the nonupdating model (area under the curve, 0.685) and calibration regression method (area under the curve, 0.687).
Dynamic model updating can improve model accuracy, discrimination, and calibration. The decision as to which method to use may depend on which measure is most important in each clinical context. Because competing therapies have emerged for valve replacement models, updating may guide clinical decision making.
用于外科主动脉瓣置换术死亡率的临床预测模型是有价值的决策工具,但在考虑医疗实践变化、患者选择以及随时间推移的结果风险方面,其能力往往有限。最近的研究已确定随着新数据积累更新模型的方法,但其对模型性能的影响尚未经过严格测试。
研究人群包括1999年1月1日至2018年12月31日在宾夕法尼亚州全州接受单纯外科主动脉瓣置换术的44,546名成年人。按时间顺序将数据分为训练集和验证集后,我们比较了非更新模型与两种模型更新方法(校准回归和新型动态逻辑状态空间模型)之间的校准、区分度和准确性指标。
在验证期内死亡率风险显著降低(P<0.01),且非更新模型随着时间推移校准不佳且准确性降低。两种更新模型均比非更新模型保持了更好的校准(Hosmer-Lemeshow卡方统计量):非更新模型(156.5)、校准回归模型(4.9)和动态逻辑状态空间模型(8.0)。两种更新模型的总体准确性(Brier评分)始终更好:动态逻辑状态空间模型(0.0252)、校准回归模型(0.0253)和非更新模型(0.0256)。与非更新模型(曲线下面积,0.685)和校准回归方法(曲线下面积,0.687)相比,动态逻辑状态空间模型的区分度有所提高(曲线下面积,0.696)。
动态模型更新可提高模型准确性、区分度和校准度。选择使用哪种方法的决策可能取决于每种临床情况下哪种指标最重要。由于瓣膜置换模型出现了相互竞争的疗法,更新可能会指导临床决策。