Usher Institute, Edinburgh Medical School, The University of Edinburgh, Edinburgh, UK.
Usher Institute, Edinburgh Medical School, The University of Edinburgh, Edinburgh, UK
BMJ Open. 2020 Jul 23;10(7):e036099. doi: 10.1136/bmjopen-2019-036099.
Most asthma attacks and subsequent deaths are potentially preventable. We aim to develop a prognostic tool for identifying patients at high risk of asthma attacks in primary care by leveraging advances in machine learning.
Current prognostic tools use logistic regression to develop a risk scoring model for asthma attacks. We propose to build on this by systematically applying various well-known machine learning techniques to a large longitudinal deidentified primary care database, the Optimum Patient Care Research Database, and comparatively evaluate their performance with the existing logistic regression model and against each other. Machine learning algorithms vary in their predictive abilities based on the dataset and the approach to analysis employed. We will undertake feature selection, classification (both one-class and two-class classifiers) and performance evaluation. Patients who have had actively treated clinician-diagnosed asthma, aged 8-80 years and with 3 years of continuous data, from 2016 to 2018, will be selected. Risk factors will be obtained from the first year, while the next 2 years will form the outcome period, in which the primary endpoint will be the occurrence of an asthma attack.
We have obtained approval from OPCRD's Anonymous Data Ethics Protocols and Transparency (ADEPT) Committee. We will seek ethics approval from The University of Edinburgh's Research Ethics Group (UREG). We aim to present our findings at scientific conferences and in peer-reviewed journals.
大多数哮喘发作和随后的死亡都是可以预防的。我们旨在通过利用机器学习的进步,为初级保健中识别哮喘发作高危患者开发一种预测工具。
目前的预测工具使用逻辑回归来开发哮喘发作的风险评分模型。我们建议在此基础上,通过系统地将各种知名的机器学习技术应用于大型纵向去识别初级保健数据库——最佳患者护理研究数据库,并将其性能与现有的逻辑回归模型和彼此进行比较评估。机器学习算法根据数据集和所采用的分析方法在预测能力上存在差异。我们将进行特征选择、分类(单类和二类分类器)和性能评估。选择 2016 年至 2018 年期间,有经过积极治疗的临床医生诊断为哮喘、年龄在 8 至 80 岁之间且有 3 年连续数据的患者。风险因素将从第一年获得,而接下来的 2 年将构成结果期,主要终点将是哮喘发作的发生。
我们已经获得了 OPCRD 的匿名数据伦理协议和透明度 (ADEPT) 委员会的批准。我们将寻求爱丁堡大学研究伦理小组 (UREG) 的伦理批准。我们旨在在科学会议和同行评议期刊上展示我们的研究结果。