Respiratory Institute, Cleveland Clinic, Cleveland, OH; Lerner Research Institute, Cleveland Clinic, Cleveland, OH.
Respiratory Institute, Cleveland Clinic, Cleveland, OH.
Chest. 2021 May;159(5):1747-1757. doi: 10.1016/j.chest.2020.12.051. Epub 2021 Jan 10.
Asthma exacerbations result in significant health and economic burden, but are difficult to predict.
Can machine learning (ML) models with large-scale outpatient data predict asthma exacerbations?
We analyzed data extracted from electronic health records (EHRs) of asthma patients treated at the Cleveland Clinic from 2010 through 2018. Demographic information, comorbidities, laboratory values, and asthma medications were included as covariates. Three different models were built with logistic regression, random forests, and a gradient boosting decision tree to predict: (1) nonsevere asthma exacerbation requiring oral glucocorticoid burst, (2) ED visits, and (3) hospitalizations.
Of 60,302 patients, 19,772 (32.8%) had at least one nonsevere exacerbation requiring oral glucocorticoid burst, 1,748 (2.9%) requiring and ED visit and 902 (1.5%) requiring hospitalization. Nonsevere exacerbation, ED visit, and hospitalization were predicted best by light gradient boosting machine, an algorithm used in ML to fit predictive analytic models, and had an area under the receiver operating characteristic curve of 0.71 (95% CI, 0.70-0.72), 0.88 (95% CI, 0.86-0.89), and 0.85 (95% CI, 0.82-0.88), respectively. Risk factors for all three outcomes included age, long-acting β agonist, high-dose inhaled glucocorticoid, or chronic oral glucocorticoid therapy. In subgroup analysis of 9,448 patients with spirometry data, low FEV and FEV to FVC ratio were identified as top risk factors for asthma exacerbation, ED visits, and hospitalization. However, adding pulmonary function tests did not improve models' prediction performance.
Models built with an ML algorithm from real-world outpatient EHR data accurately predicted asthma exacerbation and can be incorporated into clinical decision tools to enhance outpatient care and to prevent adverse outcomes.
哮喘恶化会导致严重的健康和经济负担,但难以预测。
使用大规模门诊数据的机器学习(ML)模型能否预测哮喘恶化?
我们分析了克利夫兰诊所 2010 年至 2018 年期间接受治疗的哮喘患者的电子健康记录(EHR)中提取的数据。将人口统计学信息、合并症、实验室值和哮喘药物作为协变量。使用逻辑回归、随机森林和梯度提升决策树构建了三种不同的模型,以预测:(1)需要口服糖皮质激素冲击的非重度哮喘恶化,(2)急诊就诊和(3)住院治疗。
在 60302 名患者中,有 19772 名(32.8%)至少发生了一次需要口服糖皮质激素冲击的非重度恶化,有 1748 名(2.9%)需要急诊就诊,有 902 名(1.5%)需要住院治疗。轻度梯度提升机对非重度恶化、急诊就诊和住院治疗的预测效果最好,这是一种用于 ML 的算法,用于拟合预测分析模型,其接受者操作特征曲线下面积分别为 0.71(95%CI,0.70-0.72)、0.88(95%CI,0.86-0.89)和 0.85(95%CI,0.82-0.88)。所有三种结果的危险因素包括年龄、长效β激动剂、高剂量吸入糖皮质激素或慢性口服糖皮质激素治疗。在 9448 名有肺功能数据的患者亚组分析中,发现低 FEV 和 FEV/FVC 比值是哮喘恶化、急诊就诊和住院治疗的首要危险因素。然而,添加肺功能测试并没有提高模型的预测性能。
使用真实世界门诊 EHR 数据构建的 ML 算法模型可以准确预测哮喘恶化,并可纳入临床决策工具,以加强门诊护理并预防不良后果。