Dept. of Biostatistics and Data Science, University of Kansas Medical Center, Kansas City, KS 66160, USA.
Dept. of Statistics and Actuarial-Financial Mathematics, University of the Aegean, Samos 83200, Greece.
Comput Methods Programs Biomed. 2020 Jul;190:105357. doi: 10.1016/j.cmpb.2020.105357. Epub 2020 Jan 29.
In survival analysis both the Kaplan-Meier estimate and the Cox model enjoy a broad acceptance. We present an improved spline-based survival estimate and offer a fully automated software for its implementation. We explore the use of natural cubic splines that are constrained to be monotone. Apart from its superiority over the Kaplan Meier estimator our approach overcomes limitations of other known smoothing approaches and can accommodate covariates. Unlike other spline methods, concerns of computational problems and issues of overfitting are resolved since no attempt is made to maximize a likelihood once the Kaplan-Meier estimator is obtained. An application to laryngeal cancer data, a simulation study and illustrations of the broad application of the method and its software are provided. In addition to presenting our approaches, this work contributes to bridging a communication gap between clinicians and statisticians that is often apparent in the medical literature.
We employ a two-stage approach: first obtain the stepwise cumulative hazard and then consider a natural cubic spline to smooth its steps under restrictions of monotonicity between any consecutive knots. The underlying region of monotonicity corresponds to a non-linear region that encompasses the full family of monotone third-degree polynomials. We approximate it linearly and reduce the problem to a restricted least squares one under linear restrictions. This ensures convexity. We evaluate our method through simulations against competitive traditional approaches.
Our method is compared to the popular Kaplan Meier estimate both in terms of mean squared error and in terms of coverage. Over-fitting is avoided by construction, as our spline attempts to approximate the empirical estimate of the cumulative hazard itself, and is not fitted directly on the data.
The proposed approach will enable clinical researchers to obtain improved survival estimates and valid confidence intervals over the full spectrum of the range of the survival data. Our methods outperform conventional approaches and can be readily utilized in settings beyond survival analysis such as diagnostic testing.
在生存分析中,Kaplan-Meier 估计和 Cox 模型都得到了广泛的认可。我们提出了一种改进的基于样条的生存估计,并提供了一个完全自动化的软件来实现它。我们探索使用受约束为单调的自然三次样条。除了优于 Kaplan-Meier 估计器之外,我们的方法还克服了其他已知平滑方法的局限性,并可以容纳协变量。与其他样条方法不同,由于一旦获得 Kaplan-Meier 估计器,就不会尝试最大化似然函数,因此不会出现计算问题和过度拟合的问题。我们将其应用于喉癌数据,并进行了模拟研究和方法及其软件的广泛应用的说明。除了介绍我们的方法外,这项工作还有助于弥合临床医生和统计学家之间在医学文献中经常出现的沟通鸿沟。
我们采用两阶段方法:首先获得逐步累积风险,然后在单调约束下考虑自然三次样条来平滑其步骤。单调的基本区域对应于包含全单调三次多项式族的非线性区域。我们将其线性逼近,并在线性约束下将问题简化为受限最小二乘问题。这确保了凸性。我们通过模拟与竞争传统方法进行评估。
我们的方法在均方误差和覆盖范围方面与流行的 Kaplan-Meier 估计进行了比较。通过构造避免了过度拟合,因为我们的样条试图逼近累积风险的经验估计本身,而不是直接拟合数据。
该方法将使临床研究人员能够在整个生存数据范围上获得改进的生存估计和有效置信区间。我们的方法优于传统方法,可以在生存分析以外的环境中轻松利用,例如诊断测试。