Department of Computer Science and Engineering, McKelvey School of Engineering, Washington University in St Louis, St. Louis, Missouri, United States of America.
Institute for Informatics, Data Science and Biostatistics (I2DB), Washington University School of Medicine, St. Louis, Missouri, United States of America.
PLoS One. 2024 Nov 14;19(11):e0313425. doi: 10.1371/journal.pone.0313425. eCollection 2024.
Dementia is characterized by a decline in memory and thinking that is significant enough to impair function in activities of daily living. Patients seen in dementia specialty clinics are highly heterogenous with a variety of different symptoms that progress at different rates. Recent research has focused on finding data-driven subtypes for revealing new insights into dementia's underlying heterogeneity, rather than assuming that the cohort is homogenous. However, current studies on dementia subtyping have the following limitations: (i) focusing on AD-related dementia only and not examining heterogeneity within dementia as a whole, (ii) using only cross-sectional baseline visit information for clustering and (iii) predominantly relying on expensive imaging biomarkers as features for clustering. In this study, we seek to overcome such limitations, using a data-driven unsupervised clustering algorithm named SillyPutty, in combination with hierarchical clustering on cognitive assessment scores to estimate subtypes within a real-world clinical dementia cohort. We use a longitudinal patient data set for our clustering analysis, instead of relying only on baseline visits, allowing us to explore the ongoing temporal relationship between subtypes and disease progression over time. Results showed that subtypes with very mild or mild dementia were more heterogenous in their cognitive profiles and risk of disease progression.
痴呆症的特征是记忆力和思维能力显著下降,以至于影响日常生活活动的功能。在痴呆症专科诊所就诊的患者高度异质,具有各种不同的症状,且进展速度不同。最近的研究集中在寻找数据驱动的亚型,以揭示痴呆症潜在异质性的新见解,而不是假设队列是同质的。然而,目前关于痴呆症亚型的研究存在以下局限性:(i)仅关注 AD 相关痴呆症,而不检查整个痴呆症的异质性,(ii)仅使用横断面基线访视信息进行聚类,以及(iii)主要依赖昂贵的成像生物标志物作为聚类的特征。在这项研究中,我们试图克服这些局限性,使用一种名为 SillyPutty 的数据驱动无监督聚类算法,结合认知评估分数的层次聚类,以估计真实临床痴呆队列中的亚型。我们使用纵向患者数据集进行聚类分析,而不是仅依赖基线访视,从而可以探索亚型与疾病进展之间随时间推移的持续时间关系。结果表明,非常轻度或轻度痴呆症的亚型在认知特征和疾病进展风险方面更加异质。