DiSPeA-University of Urbino Carlo Bo, Piazza della Repubblica 13, Urbino, 61029, Italy.
Istituto Auxologico Italiano IRCCS, Milan, Italy.
J Med Syst. 2022 Dec 29;47(1):1. doi: 10.1007/s10916-022-01900-5.
Many modifiable and non-modifiable risk factors have been associated with hypertension. However, current screening programs are still failing in identifying individuals at higher risk of hypertension. Given the major impact of high blood pressure on cardiovascular events and mortality, there is an urgent need to find new strategies to improve hypertension detection. We aimed to explore whether a machine learning (ML) algorithm can help identifying individuals predictors of hypertension. We analysed the data set generated by the questionnaires administered during the World Hypertension Day from 2015 to 2019. A total of 20206 individuals have been included for analysis. We tested five ML algorithms, exploiting different balancing techniques. Moreover, we computed the performance of the medical protocol currently adopted in the screening programs. Results show that a gain of sensitivity reflects in a loss of specificity, bringing to a scenario where there is not an algorithm and a configuration which properly outperforms against the others. However, Random Forest provides interesting performances (0.818 sensitivity - 0.629 specificity) compared with medical protocols (0.906 sensitivity - 0.230 specificity). Detection of hypertension at a population level still remains challenging and a machine learning approach could help in making screening programs more precise and cost effective, when based on accurate data collection. More studies are needed to identify new features to be acquired and to further improve the performances of ML models.
许多可改变和不可改变的危险因素与高血压有关。然而,目前的筛查计划仍然未能识别出高血压风险较高的个体。鉴于高血压对心血管事件和死亡率的重大影响,迫切需要寻找新的策略来改善高血压的检测。我们旨在探讨机器学习 (ML) 算法是否有助于识别高血压的个体预测因子。我们分析了 2015 年至 2019 年世界高血压日期间问卷调查生成的数据。共有 20206 人纳入分析。我们测试了五种利用不同平衡技术的 ML 算法。此外,我们还计算了目前在筛查计划中采用的医疗方案的性能。结果表明,敏感性的提高反映在特异性的降低,导致没有一种算法和配置能够比其他算法和配置更好地表现。然而,与医疗方案(0.906 敏感性-0.230 特异性)相比,随机森林提供了有趣的性能(0.818 敏感性-0.629 特异性)。在人群水平上检测高血压仍然具有挑战性,当基于准确的数据收集时,机器学习方法可以帮助使筛查计划更加精确和具有成本效益。需要进一步研究以确定要获取的新特征,并进一步提高 ML 模型的性能。