• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用非实验室风险因素和机器学习筛查高血压:印度尼西亚的一项回顾性横断面研究。

Screening hypertension using non-laboratory risk factors with machine learning: a retrospective cross-sectional study in Indonesia.

作者信息

Estiko Reza Ishak, Widyantoro Bambang, Juzar Dafsah Arifa, Yusup Ramdhan Maulana, Rakhmat Iqbal Fauzi, Rijanto Estiko

机构信息

Prof. Dr. Margono Soekarjo General Hospital, Purwokerto, Central Java, Indonesia.

Department of Cardiology and Vascular Medicine, Faculty of Medicine, Universitas Indonesia/National Cardiovascular Center Harapan Kita, Jakarta, Special Capital Region of Jakarta, Indonesia.

出版信息

BMJ Open. 2025 Aug 27;15(8):e092364. doi: 10.1136/bmjopen-2024-092364.

DOI:10.1136/bmjopen-2024-092364
PMID:40866068
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12406909/
Abstract

OBJECTIVE

This study aimed to screen for hypertension in a vast Indonesian population using machine learning (ML) and 11 non-laboratory risk factors, validating the results through internal and external validations.

SETTING AND PARTICIPANTS

From the initial 1 782 365 participants aged 15 and above registered at the Integrated Counseling Post primary care centres across Indonesia from 2014 to 2017, incomplete data and outliers were excluded, and 268 210 participants were included in our analysis. The dataset was split deterministically into a dataset for training using 10-fold internal cross-validation of 204 315 participants and another dataset for external validation of 63 895 participants.

DESIGN

This retrospective cross-sectional study used three ML algorithms, that is, random forest, gradient boosting and extreme gradient boosting (XGBoost), and compared them against logistic regression as a benchmark to screen hypertension based on the WHO and International Society of Hypertension criteria. The importance of the risk factors was ranked. By partly using continuous versus categorical age, waist circumference (WC) and body mass index (BMI) risk factors, we evaluated the screening performance regarding sensitivity and area under the receiver operating characteristic curve (AUC).

RESULTS

The external validations revealed that the XGBoost model performed the best in hypertension screening. The external validation, which partly uses continuous variables, provides 0.97 sensitivity and 0.75 AUC, indicating excellent screening capability. The importance rank of the risk factors was consecutively family history of hypertension (FH-HTN), age, WC, BMI, occupation, education, sex, smoking, low physical activity, lack of fruit or vegetable intake and alcohol consumption.

CONCLUSIONS

By using 11 easy-to-collect non-laboratory risk factors, the ML model successfully screens for hypertension with better performance than the benchmark. Using the numerical variables of age, WC and BMI yields a better discrimination capability than the categorical variables. FH-HTN and age are the two top risk factors for the development of hypertension. This study is a useful academic exercise and shows ML's importance in handling large data sets.

摘要

目的

本研究旨在利用机器学习(ML)和11个非实验室风险因素在广大印度尼西亚人群中筛查高血压,并通过内部和外部验证来验证结果。

设置与参与者

在2014年至2017年期间于印度尼西亚各地综合咨询初级保健中心登记的1782365名15岁及以上的初始参与者中,排除了不完整数据和异常值,268210名参与者纳入我们的分析。数据集被确定性地分为用于204315名参与者的10倍内部交叉验证训练的数据集和用于63895名参与者外部验证的另一个数据集。

设计

这项回顾性横断面研究使用了三种ML算法,即随机森林、梯度提升和极端梯度提升(XGBoost),并将它们与逻辑回归作为基准进行比较,以根据世界卫生组织和国际高血压学会标准筛查高血压。对风险因素的重要性进行了排名。通过部分使用连续与分类的年龄、腰围(WC)和体重指数(BMI)风险因素,我们评估了筛查在敏感性和受试者工作特征曲线下面积(AUC)方面的性能。

结果

外部验证显示,XGBoost模型在高血压筛查中表现最佳。部分使用连续变量的外部验证提供了0.97的敏感性和0.75的AUC,表明具有出色的筛查能力。风险因素的重要性排名依次为高血压家族史(FH-HTN)、年龄、WC、BMI、职业、教育程度、性别、吸烟、低体力活动、缺乏水果或蔬菜摄入以及饮酒。

结论

通过使用11个易于收集的非实验室风险因素,ML模型成功筛查高血压,其性能优于基准。使用年龄、WC和BMI的数值变量比分类变量具有更好的辨别能力。FH-HTN和年龄是高血压发生的两个首要风险因素。本研究是一项有用的学术实践,显示了ML在处理大数据集方面的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/8f2b7ad6e767/bmjopen-15-8-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/305bfa0a9bcc/bmjopen-15-8-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/17c6ea640425/bmjopen-15-8-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/363befc2e5f3/bmjopen-15-8-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/8f2b7ad6e767/bmjopen-15-8-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/305bfa0a9bcc/bmjopen-15-8-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/17c6ea640425/bmjopen-15-8-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/363befc2e5f3/bmjopen-15-8-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3461/12406909/8f2b7ad6e767/bmjopen-15-8-g004.jpg

相似文献

1
Screening hypertension using non-laboratory risk factors with machine learning: a retrospective cross-sectional study in Indonesia.利用非实验室风险因素和机器学习筛查高血压:印度尼西亚的一项回顾性横断面研究。
BMJ Open. 2025 Aug 27;15(8):e092364. doi: 10.1136/bmjopen-2024-092364.
2
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
3
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
8
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
9
Development and validation of a machine learning model based on laboratory parameters for preoperative prediction of Ki-67 expression in gliomas.基于实验室参数的机器学习模型用于术前预测胶质瘤中Ki-67表达的开发与验证
J Neurosurg. 2025 Mar 28:1-13. doi: 10.3171/2024.11.JNS241673.
10
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

本文引用的文献

1
Health Transformation in Indonesia Through Health Digitalization Strengthening: A Perspective From Occupational Health.印度尼西亚通过健康数字化强化实现卫生转型:职业健康视角。
J UOEH. 2024;46(1):73-77. doi: 10.7888/juoeh.46.73.
2
Age and vascular aging: an unexplored frontier.年龄与血管衰老:一个未被探索的领域。
Front Cardiovasc Med. 2023 Nov 9;10:1278795. doi: 10.3389/fcvm.2023.1278795. eCollection 2023.
3
Risk prediction model of self-reported hypertension for telemedicine based on the sociodemographic, occupational and health-related characteristics of seafarers: a cross-sectional epidemiological study.
基于海员社会人口学、职业和与健康相关特征的自我报告高血压远程医疗风险预测模型:一项横断面流行病学研究。
BMJ Open. 2023 Oct 4;13(10):e070146. doi: 10.1136/bmjopen-2022-070146.
4
Leakage and the reproducibility crisis in machine-learning-based science.基于机器学习的科学中的漏洞与可重复性危机。
Patterns (N Y). 2023 Aug 4;4(9):100804. doi: 10.1016/j.patter.2023.100804. eCollection 2023 Sep 8.
5
Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia.使用机器学习算法预测高血压风险:在埃塞俄比亚的一项横断面研究。
PLoS One. 2023 Aug 24;18(8):e0289613. doi: 10.1371/journal.pone.0289613. eCollection 2023.
6
The relationship between the intake of fruits, vegetables, and dairy products with hypertension: findings from the STEPS study.水果、蔬菜和乳制品摄入量与高血压之间的关系:STEPS研究结果
BMC Nutr. 2023 Aug 17;9(1):99. doi: 10.1186/s40795-023-00756-3.
7
Does the Disparity Patterning Differ between Diagnosed and Undiagnosed Hypertension among Adults? Evidence from Indonesia.成年人中已诊断和未诊断高血压之间的差异模式是否不同?来自印度尼西亚的证据。
Healthcare (Basel). 2023 Mar 10;11(6):816. doi: 10.3390/healthcare11060816.
8
Evaluation of a decided sample size in machine learning applications.机器学习应用中确定样本量的评估。
BMC Bioinformatics. 2023 Feb 14;24(1):48. doi: 10.1186/s12859-023-05156-9.
9
A personal history of research on hypertension From an encounter with hypertension to the development of hypertension practice based on out-of-clinic blood pressure measurements.高血压研究的个人史 从偶然遇到高血压到基于诊室外血压测量的高血压实践发展。
Hypertens Res. 2022 Nov;45(11):1726-1742. doi: 10.1038/s41440-022-01011-1. Epub 2022 Sep 8.
10
Machine Learning for Hypertension Prediction: a Systematic Review.机器学习在高血压预测中的应用:系统评价。
Curr Hypertens Rep. 2022 Nov;24(11):523-533. doi: 10.1007/s11906-022-01212-6. Epub 2022 Jun 22.