Hendricks Renee M, Khasawneh Mohammad T
Department of Systems Science and Industrial Engineering, Binghamton University, Binghamton, NY 13902, USA.
Aging Dis. 2021 Oct 1;12(7):1567-1586. doi: 10.14336/AD.2021.0519. eCollection 2021 Oct.
One way to understand the Parkinson's disease (PD) population is to investigate the similarities and differences among patients through cluster analysis, which may lead to defined, patient subgroups for diagnosis, progression tracking and treatment planning. This paper provides a systematic review of PD patient clustering research, evaluating the variables included in clustering, the cluster methods applied, the resulting patient subgroups, and evaluation metrics. A search was conducted from 1999 to 2021 on the PubMed database, using various search terms including: Parkinson's disease, cluster, and analysis. The majority of studies included a variety of clinical scale scores for clustering, of which many provide a numerical, but ordinal, categorical value. Even though the scale scores are ordinal, these were treated as numerical values with numerical and continuous values being the focus of the clustering, with limited attention to categorical variables, such as gender and family history, which may also provide useful insights into disease diagnosis, progression, and treatment. The results pointed to two to five patient clusters, with similarities among the age of onset and disease duration. The studies lacked the use of existing clustering evaluation metrics which points to a need for a thorough, analysis framework, and consensus on the appropriate variables to include in cluster analysis. Accurate cluster analysis may assist with determining if PD patients' symptoms can be treated based on a subgroup of features, if personalized care is required, or if a mix of individualized and group-based care is the best approach.
了解帕金森病(PD)患者群体的一种方法是通过聚类分析来研究患者之间的异同,这可能会形成明确的患者亚组,用于诊断、病情跟踪和治疗规划。本文对PD患者聚类研究进行了系统综述,评估了聚类中包含的变量、应用的聚类方法、产生的患者亚组以及评估指标。在1999年至2021年期间,使用包括“帕金森病”“聚类”和“分析”等各种检索词,在PubMed数据库中进行了检索。大多数研究纳入了各种临床量表评分进行聚类,其中许多评分提供的是数值,但为有序分类值。尽管量表评分是有序的,但这些评分被视为数值,数值和连续值成为聚类的重点,而对分类变量(如性别和家族史)的关注有限,而这些变量可能也能为疾病诊断、病情进展和治疗提供有用的见解。结果显示有两到五个患者聚类,在发病年龄和病程方面存在相似性。这些研究缺乏对现有聚类评估指标的使用,这表明需要一个全面的分析框架,并就聚类分析中应纳入的适当变量达成共识。准确的聚类分析可能有助于确定是否可以基于一组特征对PD患者的症状进行治疗、是否需要个性化护理,或者个性化护理与基于群体的护理相结合是否是最佳方法。