Abbott Kenneth, Ho Yen-Yi, Erickson Jennifer
Minnesota Disability Determination Services, Saint Paul, Minnesota, USA.
Department of Statistics, College of Arts and Sciences, University of South Carolina, Columbia, SC, USA.
J Am Med Inform Assoc. 2017 Jul 1;24(4):709-716. doi: 10.1093/jamia/ocw159.
Every year, thousands of patients die waiting for disability benefits from the Social Security Administration. Some qualify for expedited service under the Compassionate Allowance (CAL) initiative, but CAL software focuses exclusively on information from a single form field. This paper describes the development of a supplemental process for identifying some overlooked but gravely ill applicants, through automatic annotation of health records accompanying new claims. We explore improved prioritization instead of fully autonomous claims approval.
We developed a sample of claims containing medical records at the moment of arrival in a single office. A series of tools annotated both patient records and public Web page descriptions of CAL medical conditions. We trained random forests to identify CAL patients and validated each model with 10-fold cross validation.
Our main model, a general CAL classifier, had an area under the receiver operating characteristic curve of 0.915. Combining this classifier with existing software improved sensitivity from 0.960 to 0.994, detecting every deceased patient, but reducing positive predictive value to 0.216.
True positive CAL identification is a priority, given CAL patient mortality. Mere prioritization of the false positives would not create a meaningful burden in terms of manual review. Death certificate data suggest the presence of truly ill patients among putative false positives.
To a limited extent, it is possible to identify gravely ill Social Security disability applicants by analyzing annotations of unstructured electronic health records, and the level of identification is sufficient to be useful in prioritizing case reviews.
每年都有成千上万的患者在等待社会保障管理局的残疾福利金时死亡。一些患者符合“同情性津贴(CAL)”计划下的加急服务条件,但CAL软件仅专注于单个表单字段中的信息。本文描述了一种补充流程的开发,该流程通过对新申请附带的健康记录进行自动标注,来识别一些被忽视但病情严重的申请人。我们探索改进优先级排序,而非完全自主进行理赔审批。
我们创建了一个在单个办公室收到申请时包含医疗记录的样本。一系列工具对患者记录以及CAL医疗状况的公共网页描述进行了标注。我们训练随机森林来识别CAL患者,并通过10折交叉验证对每个模型进行验证。
我们的主要模型,即通用CAL分类器,在接收者操作特征曲线下的面积为0.915。将此分类器与现有软件相结合,灵敏度从0.960提高到了0.994,能检测到每一位死亡患者,但阳性预测值降至0.216。
鉴于CAL患者的死亡率,准确识别真阳性患者是当务之急。仅对假阳性进行优先级排序在人工审核方面不会造成有意义的负担。死亡证明数据表明在推定的假阳性患者中存在真正患病的患者。
在一定程度上,通过分析非结构化电子健康记录的标注来识别病情严重的社会保障残疾申请人是可行的,且识别水平足以在确定案件审核优先级方面发挥作用。