Trescato Isotta, Tavazzi Erica, Vettoretti Martina, Gatta Roberto, Vasta Rosario, Chio Adriano, Camillo Barbara Di
IEEE J Biomed Health Inform. 2024 Dec;28(12):7553-7564. doi: 10.1109/JBHI.2024.3453602. Epub 2024 Dec 5.
DYNAMITE, an acronym for DYNamic Archetypal analysis for MIning disease TrajEctories, is a new methodology developed specifically to model disease progression by exploiting information available in longitudinal clinical datasets. First, archetypal analysis is applied to data organised in matrix form, with the aim of finding extreme and representative disease states (archetypes) linked to the original data through convex coefficients. Then, each original observation is associated with a single archetype based on their similarity; finally, an event log is created encoding the progression of disease states for each patient in terms of archetype states. In the last stage of the procedure, archetypal analysis is coupled with process mining, which allows the event log archetypes to be visualised graphically as sequences of disease states, allowing the clinical trajectories of patients to be extracted and examined. As a proof of concept, we applied the proposed method to data from a cohort of amyotrophic lateral sclerosis patients whose progression was monitored using the 12-item ALSFRS-R questionnaire. Without any a priori knowledge, DYNAMITE identified six archetypes clearly describing different types and severity of impairment and provided reliable clinical trajectories consistent with the prognosis of amyotrophic lateral sclerosis patients. DYNAMITE offers high interpretability at every stage of the analysis, which makes it particularly suitable for use in healthcare where explainability is paramount, and enables analysis of clinical trajectories at both individual and population levels.
DYNAMITE是“用于挖掘疾病轨迹的动态原型分析”(DYNamic Archetypal analysis for MIning disease TrajEctories)的首字母缩写,是一种专门开发的新方法,通过利用纵向临床数据集中的可用信息来模拟疾病进展。首先,将原型分析应用于以矩阵形式组织的数据,目的是通过凸系数找到与原始数据相关的极端和代表性疾病状态(原型)。然后,根据每个原始观察值与单个原型的相似性进行关联;最后,创建一个事件日志,根据原型状态对每个患者的疾病状态进展进行编码。在该过程的最后阶段,将原型分析与过程挖掘相结合,这使得事件日志原型能够以疾病状态序列的形式进行图形化可视化,从而提取和检查患者的临床轨迹。作为概念验证,我们将所提出的方法应用于一组肌萎缩侧索硬化症患者的数据,这些患者的病情进展使用12项ALSFRS-R问卷进行监测。在没有任何先验知识的情况下,DYNAMITE识别出六个原型,清晰地描述了不同类型和严重程度的损伤,并提供了与肌萎缩侧索硬化症患者预后一致的可靠临床轨迹。DYNAMITE在分析的每个阶段都具有很高的可解释性,这使其特别适合在可解释性至关重要的医疗保健领域使用,并能够在个体和群体层面分析临床轨迹。