Dagliati A, Sacchi L, Zambelli A, Tibollo V, Pavesi L, Holmes J H, Bellazzi R
Department of Electrical, Computer and Biomedical Engineering and Centre for Health Technologies, University of Pavia, Italy.
Department of Electrical, Computer and Biomedical Engineering and Centre for Health Technologies, University of Pavia, Italy.
J Biomed Inform. 2017 Feb;66:136-147. doi: 10.1016/j.jbi.2016.12.012. Epub 2017 Jan 3.
In this work we present a careflow mining approach designed to analyze heterogeneous longitudinal data and to identify phenotypes in a patient cohort. The main idea underlying our approach is to combine methods derived from sequential pattern mining and temporal data mining to derive frequent healthcare histories (careflows) in a population of patients. This approach was applied to an integrated data repository containing clinical and administrative data of more than 4000 breast cancer patients. We used the mined histories to identify sub-cohorts of patients grouped according to healthcare activities pathways, then we characterized these sub-cohorts with clinical data. In this way, we were able to perform temporal electronic phenotyping of electronic health records (EHR) data.
在这项工作中,我们提出了一种护理流程挖掘方法,旨在分析异质纵向数据并识别患者队列中的表型。我们方法背后的主要思想是结合从序列模式挖掘和时态数据挖掘派生的方法,以在患者群体中得出频繁的医疗保健历史(护理流程)。该方法应用于一个集成数据存储库,其中包含4000多名乳腺癌患者的临床和管理数据。我们使用挖掘出的历史记录来识别根据医疗保健活动路径分组的患者亚组,然后用临床数据对这些亚组进行特征描述。通过这种方式,我们能够对电子健康记录(EHR)数据进行时态电子表型分析。