Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Italy.
University of Manchester, UK.
Artif Intell Med. 2020 May;105:101855. doi: 10.1016/j.artmed.2020.101855. Epub 2020 Apr 15.
In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a cohort of 3000 breast cancer patients. The applied method relies on longitudinal data extracted from electronic health records, recorded from the first surgical procedure after a breast cancer diagnosis. Careflows are mined from events data recorded for administrative purposes, including procedures from ICD9 - CM billing codes and chemotherapy treatments. Events data have been pre-processed with Topic Modelling to create composite events based on concurrent procedures. The results of the careflow mining algorithm allow the discovery of electronic temporal phenotypes across the studied population. These phenotypes are further characterized on the basis of clinical traits and tumour histopathology, as well as in terms of relapses, metastasis occurrence and 5-year survival rates. Results are highly significant from a clinical perspective, since phenotypes describe well characterized pathology classes, and the careflows are well matched with existing clinical guidelines. The analysis thus facilitates deriving real-world evidence that can inform clinicians as well as hospital decision makers.
在这项工作中,我们描述了一种护理流程挖掘算法在一个 3000 名乳腺癌患者队列中的应用,以检测最常见的护理模式。所应用的方法依赖于从电子健康记录中提取的纵向数据,这些数据记录自乳腺癌诊断后的第一次手术。护理流程是从为管理目的记录的事件数据中挖掘出来的,包括 ICD9-CM 计费代码和化疗治疗的程序。事件数据已经通过主题建模进行了预处理,以便根据并发程序创建复合事件。护理流程挖掘算法的结果允许在研究人群中发现电子时间表型。在此基础上,根据临床特征和肿瘤组织病理学特征,以及复发、转移发生和 5 年生存率对这些表型进行了进一步的特征描述。从临床角度来看,结果是非常显著的,因为表型描述了具有良好特征的病理类,并且护理流程与现有的临床指南非常匹配。因此,该分析有助于得出可以为临床医生和医院决策者提供信息的真实世界证据。