Wu Wenbo, Kalbfleisch John D, Taylor Jeremy M G, Kang Jian, He Kevin
Division of Biostatistics, Department of Population Health, Division of Nephrology, Department of Medicine, Center for Data Science, New York University.
Department of Biostatistics, University of Michigan.
J Comput Graph Stat. 2024;33(4):1252-1263. doi: 10.1080/10618600.2024.2304089. Epub 2024 Feb 13.
The coronavirus disease 2019 (COVID-19) pandemic has exerted a profound impact on patients with end-stage renal disease relying on kidney dialysis to sustain their lives. A preliminary analysis of dialysis patient postdischarge hospital readmissions and deaths in 2020 revealed that the COVID-19 effect has varied significantly with postdischarge time and time since the pandemic onset. However, the complex dynamics cannot be characterized by existing varying coefficient models. To address this issue, we propose a bivariate varying coefficient model for competing risks, where tensor-product B-splines are used to estimate the surface of the COVID-19 effect. An efficient proximal Newton algorithm is developed to facilitate the fitting of the new model to the massive data for Medicare beneficiaries on dialysis. Difference-based anisotropic penalization is introduced to mitigate model overfitting and effect wiggliness; a cross-validation method is derived to determine optimal tuning parameters. Hypothesis testing procedures are designed to examine whether the COVID-19 effect varies significantly with postdischarge time and the time since the pandemic onset, either jointly or separately. Applications to Medicare dialysis patients demonstrate the real-world performance of the proposed methods. Simulation experiments are conducted to evaluate the estimation accuracy, type I error rate, statistical power, and model selection procedures. Supplementary materials for this article are available online.
2019年冠状病毒病(COVID-19)大流行对依赖肾脏透析维持生命的终末期肾病患者产生了深远影响。对2020年透析患者出院后再次入院和死亡情况的初步分析显示,COVID-19的影响随出院时间和大流行开始后的时间有显著差异。然而,现有的变系数模型无法刻画这种复杂的动态变化。为解决这一问题,我们提出了一种用于竞争风险的双变量变系数模型,其中使用张量积B样条来估计COVID-19影响的曲面。开发了一种有效的近端牛顿算法,以促进新模型对医疗保险受益透析患者海量数据的拟合。引入基于差异的各向异性惩罚以减轻模型过拟合和效应波动;推导了一种交叉验证方法来确定最优调谐参数。设计了假设检验程序,以检验COVID-19影响是否随出院时间和大流行开始后的时间显著变化,无论是联合变化还是单独变化。对医疗保险透析患者的应用证明了所提方法的实际性能。进行了模拟实验,以评估估计准确性、I型错误率、统计功效和模型选择程序。本文的补充材料可在线获取。