Blumenthal Daniel M, Singal Gaurav, Mangla Shikha S, Macklin Eric A, Chung Daniel C
Department of Internal Medicine, Massachusetts General Hospital, Boston, MA, USA.
J Gen Intern Med. 2015 Jun;30(6):724-31. doi: 10.1007/s11606-014-3165-6. Epub 2015 Jan 14.
Accurately predicting the risk of no-show for a scheduled colonoscopy can help target interventions to improve compliance with colonoscopy, and thereby reduce the disease burden of colorectal cancer and enhance the utilization of resources within endoscopy units.
We aimed to utilize information available in an electronic medical record (EMR) and endoscopy scheduling system to create a predictive model for no-show risk, and to simultaneously evaluate the role for natural language processing (NLP) in developing such a model.
This was a retrospective observational study using discovery and validation phases to design a colonoscopy non-adherence prediction model. An NLP-derived variable called the Non-Adherence Ratio ("NAR") was developed, validated, and included in the model.
Patients scheduled for outpatient colonoscopy at an Academic Medical Center (AMC) that is part of a multi-hospital health system, 2009 to 2011, were included in the study.
Odds ratios for non-adherence were calculated for all variables in the discovery cohort, and an Area Under the Receiver Operating Curve (AUC) was calculated for the final non-adherence prediction model.
The non-adherence model included six variables: 1) gender; 2) history of psychiatric illness, 3) NAR; 4) wait time in months; 5) number of prior missed endoscopies; and 6) education level. The model achieved discrimination in the validation cohort (AUC= =70.2 %). At a threshold non-adherence score of 0.46, the model's sensitivity and specificity were 33 % and 92 %, respectively. Removing the NAR from the model significantly reduced its predictive power (AUC = 64.3 %, difference = 5.9 %, p < 0.001).
A six-variable model using readily available clinical and demographic information demonstrated accuracy for predicting colonoscopy non-adherence. The NAR, a novel variable developed using NLP technology, significantly strengthened this model's predictive power.
准确预测结肠镜检查预约失约风险有助于确定干预措施,以提高结肠镜检查的依从性,从而减轻结直肠癌的疾病负担,并提高内镜科室资源的利用率。
我们旨在利用电子病历(EMR)和内镜检查预约系统中的可用信息,创建一个失约风险预测模型,并同时评估自然语言处理(NLP)在开发此类模型中的作用。
这是一项回顾性观察研究,使用发现和验证阶段来设计结肠镜检查不依从预测模型。一个名为不依从率(“NAR”)的NLP衍生变量被开发、验证并纳入模型。
2009年至2011年在一家多医院卫生系统所属的学术医疗中心(AMC)预约门诊结肠镜检查的患者被纳入研究。
计算发现队列中所有变量的不依从比值比,并计算最终不依从预测模型的受试者操作特征曲线下面积(AUC)。
不依从模型包括六个变量:1)性别;2)精神疾病史;3)NAR;4)等待月数;5)既往错过内镜检查的次数;6)教育水平。该模型在验证队列中具有区分能力(AUC = 70.2%)。在不依从评分阈值为0.46时,该模型的敏感性和特异性分别为33%和92%。从模型中去除NAR会显著降低其预测能力(AUC = 64.3%,差异 = 5.9%,p < 0.001)。
一个使用易于获得的临床和人口统计学信息的六变量模型在预测结肠镜检查不依从方面显示出准确性。NAR是使用NLP技术开发的一个新变量,显著增强了该模型的预测能力。