Goldstein Mark, Han Xintian, Puli Aahlad, Perotte Adler J, Ranganath Rajesh
New York University.
Columbia University.
Adv Neural Inf Process Syst. 2020 Dec;33:18296-18307.
Survival analysis models the distribution of time until an event of interest, such as discharge from the hospital or admission to the ICU. When a model's predicted number of events within any time interval is similar to the observed number, it is called . A survival model's calibration can be measured using, for instance, distributional calibration (D-CALIBRATION) [Haider et al., 2020] which computes the squared difference between the observed and predicted number of events within different time intervals. Classically, calibration is addressed in post-training analysis. We develop explicit calibration (X-CAL), which turns D-CALIBRATION into a differentiable objective that can be used in survival modeling alongside maximum likelihood estimation and other objectives. X-CAL allows practitioners to directly optimize calibration and strike a desired balance between predictive power and calibration. In our experiments, we fit a variety of shallow and deep models on simulated data, a survival dataset based on MNIST, on length-of-stay prediction using MIMIC-III data, and on brain cancer data from The Cancer Genome Atlas. We show that the models we study can be miscalibrated. We give experimental evidence on these datasets that X-CAL improves D-CALIBRATION without a large decrease in concordance or likelihood.
生存分析对直至感兴趣事件(如出院或入住重症监护病房)发生的时间分布进行建模。当模型在任何时间间隔内预测的事件数量与观察到的数量相似时,它被称为 。生存模型的校准可以例如使用分布校准(D-CALIBRATION)[海德尔等人,2020年]来衡量,该方法计算不同时间间隔内观察到的和预测的事件数量之间的平方差。传统上,校准是在训练后分析中解决的。我们开发了显式校准(X-CAL),它将D-CALIBRATION转化为一个可微目标,可与最大似然估计和其他目标一起用于生存建模。X-CAL允许从业者直接优化校准,并在预测能力和校准之间达成理想的平衡。在我们的实验中,我们在模拟数据、基于MNIST的生存数据集、使用MIMIC-III数据进行的住院时间预测以及来自癌症基因组图谱的脑癌数据上拟合了各种浅层和深层模型。我们表明我们研究的模型可能存在校准错误。我们在这些数据集上给出实验证据,表明X-CAL在不大幅降低一致性或似然性的情况下改进了D-CALIBRATION。