Xu Zhenxing, Wang Fei, Adekkanattu Prakash, Bose Budhaditya, Vekaria Veer, Brandt Pascal, Jiang Guoqian, Kiefer Richard C, Luo Yuan, Pacheco Jennifer A, Rasmussen Luke V, Xu Jie, Alexopoulos George, Pathak Jyotishman
Weill Cornell Medicine New York New York USA.
University of Washington Seattle Washington USA.
Learn Health Syst. 2020 Aug 3;4(4):e10241. doi: 10.1002/lrh2.10241. eCollection 2020 Oct.
To identify depression subphenotypes from Electronic Health Records (EHRs) using machine learning methods, and analyze their characteristics with respect to patient demographics, comorbidities, and medications.
Using EHRs from the INSIGHT Clinical Research Network (CRN) database, multiple machine learning (ML) algorithms were applied to analyze 11 275 patients with depression to discern depression subphenotypes with distinct characteristics.
Using the computational approaches, we derived three depression subphenotypes: Phenotype_A (n = 2791; 31.35%) included patients who were the oldest (mean (SD) age, 72.55 (14.93) years), had the most comorbidities, and took the most medications. The most common comorbidities in this cluster of patients were hyperlipidemia, hypertension, and diabetes. Phenotype_B (mean (SD) age, 68.44 (19.09) years) was the largest cluster (n = 4687; 52.65%), and included patients suffering from moderate loss of body function. Asthma, fibromyalgia, and Chronic Pain and Fatigue (CPF) were common comorbidities in this subphenotype. Phenotype_C (n = 1452; 16.31%) included patients who were younger (mean (SD) age, 63.47 (18.81) years), had the fewest comorbidities, and took fewer medications. Anxiety and tobacco use were common comorbidities in this subphenotype.
Computationally deriving depression subtypes can provide meaningful insights and improve understanding of depression as a heterogeneous disorder. Further investigation is needed to assess the utility of these derived phenotypes to inform clinical trial design and interpretation in routine patient care.
使用机器学习方法从电子健康记录(EHR)中识别抑郁症亚表型,并分析其在患者人口统计学、合并症和药物治疗方面的特征。
使用来自洞察临床研究网络(CRN)数据库的EHR,应用多种机器学习(ML)算法分析11275例抑郁症患者,以辨别具有不同特征的抑郁症亚表型。
使用计算方法,我们得出了三种抑郁症亚表型:表型_A(n = 2791;31.35%)包括年龄最大(平均(标准差)年龄,72.55(14.93)岁)、合并症最多且用药最多的患者。该组患者中最常见的合并症是高脂血症、高血压和糖尿病。表型_B(平均(标准差)年龄,68.44(19.09)岁)是最大的一组(n = 4687;52.65%),包括身体功能中度丧失的患者。哮喘、纤维肌痛和慢性疼痛与疲劳(CPF)是该亚表型中常见的合并症。表型_C(n = 1452;16.31%)包括年龄较小(平均(标准差)年龄,63.47(18.81)岁)、合并症最少且用药较少的患者。焦虑和吸烟是该亚表型中常见的合并症。
通过计算得出抑郁症亚型可以提供有意义的见解,并增进对抑郁症这种异质性疾病的理解。需要进一步研究以评估这些得出的表型在为常规患者护理中的临床试验设计和解释提供信息方面的效用。