Zhang Yiye, Padman Rema
The H. John Heinz III College, Carnegie Mellon University, 4800 Forbes Ave, Pittsburgh, PA 15213. E-mail:
Am J Manag Care. 2015 Dec 1;21(12):e661-8.
Chronic diseases are common, complex, and expensive health conditions that can benefit from innovations in healthcare service delivery enabled by information technology and advanced analytic methods. This paper proposes a data-driven approach, illustrated in the context of chronic kidney disease (CKD), to develop clinical pathways of care delivery from electronic health record (EHR) data.
We analyzed structured and de-identified EHR data from 2009 to 2013 of 664 CKD patients with multiple chronic conditions.
Machine learning algorithms were used to learn data-driven and practice-based clinical pathways that cluster patients into subgroups and model the co-progression of their encounter types, diagnoses, medications, and biochemical measurements. Given a pattern of biochemical measurements, our algorithm identifies the most probable clinical pathways, and makes predictions regarding future states, with and without temporal information. CKD stages, their complications, and common medications are included in the clinical pathways.
Using the EHR data of 664 patients who were initially in CKD stage 3 and hypertensive, we identified 7 patient subgroups-each distinguished primarily by the type of complications suffered by the patients. Our algorithm demonstrates fair accuracy (up to 44% and 75%, respectively) in learning the most probable clinical pathways and predicting future states associated with temporal patterns of biochemical measurements and patient subgroups.
Data-driven clinical pathway learning summarizes multidimensional and longitudinal information from EHRs into clusters of common sequences of patient visits that may assist in the efficient review of current practices and identifying potential innovations in the care delivery process.
慢性病是常见、复杂且费用高昂的健康状况,可受益于信息技术和先进分析方法带来的医疗服务提供方面的创新。本文提出一种数据驱动方法,以慢性肾脏病(CKD)为例,从电子健康记录(EHR)数据中开发护理提供的临床路径。
我们分析了2009年至2013年664例患有多种慢性病的CKD患者的结构化且经过去识别处理的EHR数据。
使用机器学习算法来学习数据驱动且基于实践的临床路径,将患者聚类为亚组,并对其就诊类型、诊断、药物治疗和生化测量的共同进展进行建模。给定生化测量模式,我们的算法识别最可能的临床路径,并对有无时间信息的未来状态进行预测。临床路径包括CKD分期、其并发症和常用药物。
利用664例最初处于CKD 3期且患有高血压的患者的EHR数据,我们识别出7个患者亚组——每个亚组主要由患者所患并发症类型区分。我们的算法在学习最可能的临床路径以及预测与生化测量时间模式和患者亚组相关的未来状态方面表现出相当的准确性(分别高达44%和75%)。
数据驱动的临床路径学习将EHR中的多维和纵向信息总结为患者就诊常见序列的聚类,这可能有助于有效审查当前实践并识别护理提供过程中的潜在创新。