Petrik Amanda F, Johnson Eric S, Mummadi Rajasekhara, Slaughter Matthew, Coronado Gloria D, Lin Sunny C, Savitz Lucy, Wallace Neal
Kaiser Permanente Center for Health Research, Portland, OR, USA.
Northwest Permanente, Portland, OR, USA.
Prev Med Rep. 2023 Aug 11;36:102366. doi: 10.1016/j.pmedr.2023.102366. eCollection 2023 Dec.
Promotion of colorectal cancer (CRC) screening can be expensive and unnecessary for many patients. The use of predictive analytics promises to help health systems target the right services to the right patients at the right time while improving population health. Multilevel data at the interpersonal, organizational, community, and policy levels, is rarely considered in clinical decision making but may be used to improve CRC screening risk prediction. We compared the effectiveness of a CRC screening risk prediction model that uses multilevel data with a more conventional model that uses only individual patient data. We used a retrospective cohort to ascertain the one-year occurrence of CRC screening. The cohort was determined from a Health Maintenance Organization, in Oregon. Eligible patients were 50-75 years old, health plan members for at least one year before their birthday in 2018 and were due for screening. We created a risk model using logistic regression first with data available in the electronic health record (EHR), and then added multilevel data. In a cohort of 59,249 patients, 36.1% completed CRC screening. The individual level model included 14 demographic, clinical and encounter based characteristics, had a bootstrap-corrected C-statistic of 0.722 and sufficient calibration. The multilevel model added 9 variables from clinical setting and community characteristics, and the bootstrap-corrected C-statistic remained the same with continued sufficient calibration. The predictive power of the CRC screening model did not improve after adding multilevel data. Our findings suggest that multilevel data added no improvement to the prediction of the likelihood of CRC screening.
推广结直肠癌(CRC)筛查对许多患者来说可能成本高昂且没有必要。使用预测分析有望帮助医疗系统在正确的时间为正确的患者提供正确的服务,同时改善人群健康状况。人际、组织、社区和政策层面的多层次数据在临床决策中很少被考虑,但可用于改善CRC筛查风险预测。我们比较了使用多层次数据的CRC筛查风险预测模型与仅使用个体患者数据的传统模型的有效性。我们采用回顾性队列研究来确定CRC筛查的一年发生率。该队列来自俄勒冈州的一个健康维护组织。符合条件的患者年龄在50至75岁之间,在2018年生日前至少有一年的健康计划成员资格且应进行筛查。我们首先使用电子健康记录(EHR)中的可用数据通过逻辑回归创建了一个风险模型,然后添加了多层次数据。在一个包含59,249名患者的队列中,36.1%的患者完成了CRC筛查。个体水平模型包括14个人口统计学、临床和基于就诊的特征,经自抽样校正后的C统计量为0.722且校准充分。多层次模型添加了9个来自临床环境和社区特征的变量,经自抽样校正后的C统计量保持不变且校准仍然充分。添加多层次数据后,CRC筛查模型的预测能力并未提高。我们的研究结果表明,多层次数据并未改善对CRC筛查可能性的预测。