The University of Melbourne, Australia.
The University of Melbourne, Australia.
J Biomed Inform. 2022 Jun;130:104081. doi: 10.1016/j.jbi.2022.104081. Epub 2022 May 4.
Process mining is a discipline sitting between data mining and process science, whose goal is to provide theoretical methods and software tools to analyse process execution data, known as event logs. Although process mining was originally conceived to facilitate business process management activities, research studies have shown the benefit of leveraging process mining in healthcare contexts. However, applying process mining tools to analyse healthcare process execution data is not straightforward. In this paper, we show a methodology to: i) prepare general practice healthcare process data for conducting a process mining analysis; ii) select and apply suitable process mining solutions for successfully executing the analysis; and iii) extract valuable insights from the obtained results, alongside leads for traditional data mining analysis. By doing so, we identified two major challenges when using process mining solutions for analysing healthcare process data, and highlighted benefits and limitations of the state-of-the-art process mining techniques when dealing with highly variable processes and large data-sets. While we provide solutions to the identified challenges, the overarching goal of this study was to detect differences between the patients' health services utilization pattern observed in 2020-during the COVID-19 pandemic and mandatory lock-downs -and the one observed in the prior four years, 2016 to 2019. By using a combination of process mining techniques and traditional data mining, we were able to demonstrate that vaccinations in Victoria did not drop drastically-as other interactions did. On the contrary, we observed a surge of influenza and pneumococcus vaccinations in 2020, as opposed to other research findings of similar studies conducted in different geographical areas.
流程挖掘是数据挖掘和流程科学之间的一门学科,其目标是提供理论方法和软件工具来分析流程执行数据,即事件日志。尽管流程挖掘最初是为了促进业务流程管理活动而构想的,但研究表明,在医疗保健环境中利用流程挖掘具有益处。然而,将流程挖掘工具应用于分析医疗保健流程执行数据并不简单。在本文中,我们展示了一种方法,用于:i)准备一般实践医疗保健流程数据以进行流程挖掘分析;ii)选择和应用合适的流程挖掘解决方案以成功执行分析;以及 iii)从获得的结果中提取有价值的见解,并为传统数据挖掘分析提供线索。通过这样做,我们确定了在使用流程挖掘解决方案分析医疗保健流程数据时的两个主要挑战,并强调了在处理高度变化的流程和大数据集时,最新的流程挖掘技术的优势和局限性。虽然我们为所识别的挑战提供了解决方案,但本研究的总体目标是检测在 2020 年(在 COVID-19 大流行和强制性封锁期间)观察到的患者健康服务利用模式与前四年(2016 年至 2019 年)观察到的模式之间的差异。通过结合流程挖掘技术和传统数据挖掘,我们能够证明维多利亚州的疫苗接种并没有像其他交互作用那样急剧下降。相反,我们观察到 2020 年流感和肺炎球菌疫苗接种的激增,与在不同地理区域进行的类似研究的其他研究结果相反。