Kim Era, Oh Wonsuk, Pieczkiewicz David S, Castro M Regina, Caraballo Pedro J, Simon Gyorgy J
Institute for Health Informatics, University of Minnesota, Minneapolis, MN.
Mayo Clinic, Rochester, MN.
AMIA Annu Symp Proc. 2014 Nov 14;2014:1815-24. eCollection 2014.
Type 2 Diabetes Mellitus is a progressive disease with increased risk of developing serious complications. Identifying subpopulations and their relevant risk factors can contribute to the prevention and effective management of diabetes. We use a novel divisive hierarchical clustering technique to identify clinically interesting subpopulations in a large cohort of Olmsted County, MN residents. Our results show that our clustering algorithm successfully identified clinically interesting clusters consisting of patients with higher or lower risk of diabetes than the general population. The proposed algorithm offers fine control over the granularity of the clustering, has the ability to seamlessly discover and incorporate interactions among the risk factors, and can handle non-proportional hazards, as well. It has the potential to significantly impact clinical practice by recognizing patients with specific risk factors who may benefit from an alternative management approach potentially leading to the prevention of diabetes and its complications.
2型糖尿病是一种渐进性疾病,发生严重并发症的风险会增加。识别亚群及其相关风险因素有助于糖尿病的预防和有效管理。我们使用一种新颖的分裂层次聚类技术,在明尼苏达州奥姆斯特德县的一大群居民中识别具有临床意义的亚群。我们的结果表明,我们的聚类算法成功识别出了临床上有意义的聚类,这些聚类中的患者患糖尿病的风险高于或低于一般人群。所提出的算法对聚类的粒度有很好的控制能力,能够无缝发现并纳入风险因素之间的相互作用,并且还能处理非比例风险。它有可能通过识别具有特定风险因素的患者来显著影响临床实践,这些患者可能从替代管理方法中受益,这可能会预防糖尿病及其并发症。