Tibble Holly, Sheikh Aziz, Tsanas Athanasios
Usher Institute, The University of Edinburgh, Edinburgh, UK.
Asthma UK Centre for Applied Research, Edinburgh, UK.
NPJ Prim Care Respir Med. 2025 Apr 23;35(1):24. doi: 10.1038/s41533-025-00428-8.
Primary care consultations provide an opportunity for patients and clinicians to assess asthma attack risk. Using a data-driven risk prediction tool with routinely collected health records may be an efficient way to aid promotion of effective self-management, and support clinical decision making. Longitudinal Scottish primary care data for 21,250 asthma patients were used to predict the risk of asthma attacks in the following year. A selection of machine learning algorithms (i.e., Naïve Bayes Classifier, Logistic Regression, Random Forests, and Extreme Gradient Boosting), hyperparameters, training data enrichment methods were explored, and validated in a random unseen data partition. Our final Logistic Regression model achieved the best performance when no training data enrichment was applied. Around 1 in 3 (36.2%) predicted high-risk patients had an attack within one year of consultation, compared to approximately 1 in 16 in the predicted low-risk group (6.7%). The model was well calibrated, with a calibration slope of 1.02 and an intercept of 0.004, and the Area under the Curve was 0.75. This model has the potential to increase the efficiency of routine asthma care by creating new personalized care pathways mapped to predicted risk of asthma attacks, such as priority ranking patients for scheduled consultations and interventions. Furthermore, it could be used to educate patients about their individual risk and risk factors, and promote healthier lifestyle changes, use of self-management plans, and early emergency care seeking following rapid symptom deterioration.
基层医疗咨询为患者和临床医生评估哮喘发作风险提供了机会。使用基于常规收集的健康记录的数据驱动风险预测工具,可能是促进有效自我管理和支持临床决策的有效方法。利用来自21250名哮喘患者的苏格兰纵向基层医疗数据来预测下一年哮喘发作的风险。我们探索了一系列机器学习算法(即朴素贝叶斯分类器、逻辑回归、随机森林和极端梯度提升)、超参数、训练数据增强方法,并在一个随机的未见数据分区中进行了验证。在未应用训练数据增强时,我们最终的逻辑回归模型表现最佳。在预测为高风险的患者中,约三分之一(36.2%)在咨询后一年内发作,而在预测为低风险的组中,这一比例约为十六分之一(6.7%)。该模型校准良好,校准斜率为1.02,截距为0.004,曲线下面积为0.75。通过创建与预测的哮喘发作风险相匹配的新的个性化护理路径,如为预约咨询和干预对患者进行优先级排序,该模型有可能提高常规哮喘护理的效率。此外,它可用于告知患者其个人风险和风险因素,促进更健康的生活方式改变、自我管理计划的使用,以及在症状迅速恶化后尽早寻求急救护理。