Lee Kyu Ha, Haneuse Sebastien, Schrag Deborah, Dominici Francesca
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA.
Dana-Farber Cancer Institute, Boston, Massachusetts, USA.
J R Stat Soc Ser C Appl Stat. 2015 Feb 1;64(2):253-273. doi: 10.1111/rssc.12078.
In the U.S., the Centers for Medicare and Medicaid Services uses 30-day readmission, following hospitalization, as a proxy outcome to monitor quality of care. These efforts generally focus on treatable health conditions, such as pneumonia and heart failure. Expanding quality of care systems to monitor conditions for which treatment options are limited or non-existent, such as pancreatic cancer, is challenging because of the non-trivial force of mortality; 30-day mortality for pancreatic cancer is approximately 30%. In the statistical literature, data that arise when the observation of the time to some non-terminal event is subject to some terminal event are referred to as 'semi-competing risks data'. Given such data, scientific interest may lie in at least one of three areas: (i) estimation/inference for regression parameters, (ii) characterization of dependence between the two events, and (iii) prediction given a covariate profile. Existing statistical methods focus almost exclusively on the first of these; methods are sparse or non-existent, however, when interest lies with understanding dependence and performing prediction. In this paper we propose a Bayesian semi-parametric regression framework for analyzing semi-competing risks data that permits the simultaneous investigation of all three of the aforementioned scientific goals. Characterization of the induced posterior and posterior predictive distributions is achieved via an efficient Metropolis-Hastings-Green algorithm, which has been implemented in an R package. The proposed framework is applied to data on 16,051 individuals diagnosed with pancreatic cancer between 2005-2008, obtained from Medicare Part A. We found that increased risk for readmission is associated with a high comorbidity index, a long hospital stay at initial hospitalization, non-white race, male, and discharge to home care.
在美国,医疗保险和医疗补助服务中心将住院后30天再入院情况作为监测医疗质量的替代指标。这些工作通常聚焦于可治疗的健康状况,如肺炎和心力衰竭。由于死亡率较高,将医疗质量监测系统扩展到治疗选择有限或不存在的疾病(如胰腺癌)具有挑战性;胰腺癌的30天死亡率约为30%。在统计文献中,当观察到某个非终末事件的时间受到某个终末事件影响时产生的数据被称为“半竞争风险数据”。对于此类数据,科学兴趣可能至少集中在三个领域之一:(i)回归参数的估计/推断,(ii)两个事件之间相关性的特征描述,以及(iii)给定协变量概况时的预测。现有的统计方法几乎完全集中在第一个领域;然而,当兴趣在于理解相关性和进行预测时,方法却很稀少或不存在。在本文中,我们提出了一个贝叶斯半参数回归框架来分析半竞争风险数据,该框架允许同时研究上述所有三个科学目标。通过一种高效的Metropolis-Hastings-Green算法实现了诱导后验分布和后验预测分布的特征描述,该算法已在一个R包中实现。所提出的框架应用于2005年至2008年间从医疗保险A部分获得的16051例被诊断为胰腺癌的个体的数据。我们发现,再入院风险增加与高合并症指数、首次住院时间长、非白人种族、男性以及出院后接受家庭护理有关。