School of Pharmacy, University of Eastern Finland, Kuopio, Finland.
Kuopio Research Centre of Geriatric Care, University of Eastern Finland, Kuopio, Finland.
J Alzheimers Dis. 2024;101(4):1393-1403. doi: 10.3233/JAD-240490.
Multimorbidity is common in older adults and complicates diagnosing and care for this population.
We investigated co-occurrence patterns (clustering) of medical conditions in persons with Alzheimer's disease (AD) and their matched controls.
The register-based Medication use and Alzheimer's disease study (MEDALZ) includes 70,718 community-dwelling persons with incident AD diagnosed during 2005-2011 in Finland and a matched comparison cohort. Latent Dirichlet Allocation was used to cluster the comorbidities (ICD-10 diagnosis codes). Modeling was performed separately for AD and control cohorts. We experimented with different numbers of clusters (also known as topics in the field of Natural Language Processing) ranging from five to 20.
In both cohorts, 17 of the 20 most frequent diagnoses were the same. Based on a qualitative assessment by medical experts, the cluster patterns were not affected by the number of clusters, but the best interpretability was observed in the 10-cluster model. Quantitative assessment of the optimal number of clusters by log-likelihood estimate did not imply a specific optimal number of clusters. Multidimensional scaling visualized the variability in cluster size and (dis)similarity between the clusters with more overlapping of clusters and variation in group size seen in the AD cohort.
Early signs and symptoms of AD were more commonly clustered together in the AD cohort than in the comparison cohort. This study experimented with using natural language processing techniques for clustering patterns from an epidemiological study. From the computed clusters, it was possible to qualitatively identify multimorbidity that differentiates AD cases and controls.
多种疾病在老年人中很常见,这使得对该人群的诊断和治疗变得复杂。
我们研究了阿尔茨海默病(AD)患者及其匹配对照者的合并疾病(共病)的共现模式(聚类)。
基于登记的药物使用和阿尔茨海默病研究(MEDALZ)包括 70718 名在芬兰于 2005-2011 年间确诊为 AD 的居住在社区的患者和一个匹配的对照组。使用潜在狄利克雷分配(Latent Dirichlet Allocation)对合并症(ICD-10 诊断代码)进行聚类。分别对 AD 和对照组进行建模。我们尝试了不同数量的聚类(在自然语言处理领域也称为主题),从 5 个到 20 个不等。
在两个队列中,20 个最常见的诊断中有 17 个是相同的。基于医学专家的定性评估,聚类模式不受聚类数量的影响,但在 10 聚类模型中观察到最佳的可解释性。通过对数似然估计对最佳聚类数量的定量评估并不意味着存在特定的最佳聚类数量。多维尺度可视化了聚类大小的可变性和聚类之间的(不)相似性,AD 队列中的聚类重叠更多,组大小变化更大。
AD 的早期迹象和症状在 AD 队列中比在对照组中更常聚类在一起。本研究尝试使用自然语言处理技术对来自流行病学研究的聚类模式进行聚类。从计算出的聚类中,可以定性地识别出区分 AD 病例和对照组的多种合并症。