Gensheimer Michael F, Narasimhan Balasubramanian
Department of Radiation Oncology, Stanford University, Stanford, CA, United States of America.
Department of Statistics, Stanford University, Stanford, CA, United States of America.
PeerJ. 2019 Jan 25;7:e6257. doi: 10.7717/peerj.6257. eCollection 2019.
There is currently great interest in applying neural networks to prediction tasks in medicine. It is important for predictive models to be able to use survival data, where each patient has a known follow-up time and event/censoring indicator. This avoids information loss when training the model and enables generation of predicted survival curves. In this paper, we describe a discrete-time survival model that is designed to be used with neural networks, which we refer to as Nnet-survival. The model is trained with the maximum likelihood method using mini-batch stochastic gradient descent (SGD). The use of SGD enables rapid convergence and application to large datasets that do not fit in memory. The model is flexible, so that the baseline hazard rate and the effect of the input data on hazard probability can vary with follow-up time. It has been implemented in the Keras deep learning framework, and source code for the model and several examples is available online. We demonstrate the performance of the model on both simulated and real data and compare it to existing models Cox-nnet and Deepsurv.
当前,将神经网络应用于医学预测任务备受关注。对于预测模型而言,能够使用生存数据至关重要,其中每位患者都有已知的随访时间以及事件/删失指标。这可避免在训练模型时出现信息丢失,并能生成预测生存曲线。在本文中,我们描述了一种设计用于与神经网络配合使用的离散时间生存模型,我们将其称为Nnet - survival。该模型使用小批量随机梯度下降(SGD)通过最大似然法进行训练。SGD的使用能够实现快速收敛,并可应用于无法完全载入内存的大型数据集。该模型具有灵活性,使得基线风险率以及输入数据对风险概率的影响能够随随访时间而变化。它已在Keras深度学习框架中实现,并且该模型的源代码以及多个示例均可在线获取。我们在模拟数据和真实数据上展示了该模型的性能,并将其与现有模型Cox - nnet和Deepsurv进行了比较。