Flinders Digital Health Research Centre, College of Nursing & Health Sciences, Flinders University, Adelaide SA 5001, Australia.
Chifley Business School, Torrens University, Australia, Adelaide, SA 5000, Australia.
Int J Environ Res Public Health. 2021 Mar 19;18(6):3187. doi: 10.3390/ijerph18063187.
Effective cardiovascular disease (CVD) prevention relies on timely identification and intervention for individuals at risk. Conventional formula-based techniques have been demonstrated to over- or under-predict the risk of CVD in the Australian population. This study assessed the ability of machine learning models to predict CVD mortality risk in the Australian population and compare performance with the well-established Framingham model. Data is drawn from three Australian cohort studies: the North West Adelaide Health Study (NWAHS), the Australian Diabetes, Obesity, and Lifestyle study, and the Melbourne Collaborative Cohort Study (MCCS). Four machine learning models for predicting 15-year CVD mortality risk were developed and compared to the 2008 Framingham model. Machine learning models performed significantly better compared to the Framingham model when applied to the three Australian cohorts. Machine learning based models improved prediction by 2.7% to 5.2% across three Australian cohorts. In an aggregated cohort, machine learning models improved prediction by up to 5.1% (area-under-curve (AUC) 0.852, 95% CI 0.837-0.867). Net reclassification improvement (NRI) was up to 26% with machine learning models. Machine learning based models also showed improved performance when stratified by sex and diabetes status. Results suggest a potential for improving CVD risk prediction in the Australian population using machine learning models.
有效的心血管疾病(CVD)预防依赖于对高危个体进行及时的识别和干预。基于常规公式的技术已被证明在预测澳大利亚人群 CVD 风险时存在过高或过低的情况。本研究评估了机器学习模型在预测澳大利亚人群 CVD 死亡率风险方面的能力,并将其性能与成熟的弗雷明汉模型进行了比较。数据来自三个澳大利亚队列研究:西北阿德莱德健康研究(NWAHS)、澳大利亚糖尿病、肥胖和生活方式研究以及墨尔本合作队列研究(MCCS)。开发了四个用于预测 15 年 CVD 死亡率风险的机器学习模型,并与 2008 年的弗雷明汉模型进行了比较。与弗雷明汉模型相比,当应用于三个澳大利亚队列时,机器学习模型的表现明显更好。基于机器学习的模型在三个澳大利亚队列中提高了 2.7%至 5.2%的预测精度。在一个综合队列中,机器学习模型将预测精度提高了 5.1%(曲线下面积(AUC)为 0.852,95%CI 为 0.837-0.867)。基于机器学习的模型的净重新分类改善(NRI)高达 26%。当按性别和糖尿病状况分层时,基于机器学习的模型也显示出了更好的性能。结果表明,机器学习模型有可能改善澳大利亚人群的 CVD 风险预测。