• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于常规健康检查数据的基于机器学习的高血压事件预测:在韩国和日本的 2 个独立全国队列中的推导和验证。

Machine Learning-Based Prediction for Incident Hypertension Based on Regular Health Checkup Data: Derivation and Validation in 2 Independent Nationwide Cohorts in South Korea and Japan.

机构信息

Department of Biomedical Engineering, Kyung Hee University, Yongin, Republic of Korea.

Center for Digital Health, Medical Science Research Institute, Kyung Hee University College of Medicine, Seoul, Republic of Korea.

出版信息

J Med Internet Res. 2024 Nov 5;26:e52794. doi: 10.2196/52794.

DOI:10.2196/52794
PMID:39499554
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11576616/
Abstract

BACKGROUND

Worldwide, cardiovascular diseases are the primary cause of death, with hypertension as a key contributor. In 2019, cardiovascular diseases led to 17.9 million deaths, predicted to reach 23 million by 2030.

OBJECTIVE

This study presents a new method to predict hypertension using demographic data, using 6 machine learning models for enhanced reliability and applicability. The goal is to harness artificial intelligence for early and accurate hypertension diagnosis across diverse populations.

METHODS

Data from 2 national cohort studies, National Health Insurance Service-National Sample Cohort (South Korea, n=244,814), conducted between 2002 and 2013 were used to train and test machine learning models designed to anticipate incident hypertension within 5 years of a health checkup involving those aged ≥20 years, and Japanese Medical Data Center cohort (Japan, n=1,296,649) were used for extra validation. An ensemble from 6 diverse machine learning models was used to identify the 5 most salient features contributing to hypertension by presenting a feature importance analysis to confirm the contribution of each future.

RESULTS

The Adaptive Boosting and logistic regression ensemble showed superior balanced accuracy (0.812, sensitivity 0.806, specificity 0.818, and area under the receiver operating characteristic curve 0.901). The 5 key hypertension indicators were age, diastolic blood pressure, BMI, systolic blood pressure, and fasting blood glucose. The Japanese Medical Data Center cohort dataset (extra validation set) corroborated these findings (balanced accuracy 0.741 and area under the receiver operating characteristic curve 0.824). The ensemble model was integrated into a public web portal for predicting hypertension onset based on health checkup data.

CONCLUSIONS

Comparative evaluation of our machine learning models against classical statistical models across 2 distinct studies emphasized the former's enhanced stability, generalizability, and reproducibility in predicting hypertension onset.

摘要

背景

在全球范围内,心血管疾病是主要的死亡原因,高血压是一个关键因素。2019 年,心血管疾病导致 1790 万人死亡,预计到 2030 年将达到 2300 万。

目的

本研究提出了一种使用人口统计学数据预测高血压的新方法,使用 6 种机器学习模型以提高可靠性和适用性。目标是利用人工智能对不同人群进行早期和准确的高血压诊断。

方法

使用来自 2 项全国队列研究的数据,即 2002 年至 2013 年进行的国家健康保险服务-国家样本队列(韩国,n=244814)和日本医疗数据中心队列(日本,n=1296649),对设计用于预测健康检查后 5 年内发生高血压的机器学习模型进行训练和测试,这些健康检查涉及年龄≥20 岁的人群。使用 6 种不同的机器学习模型的集成来识别导致高血压的 5 个最重要的特征,通过特征重要性分析来确认每个特征的贡献。

结果

自适应提升和逻辑回归集成模型表现出较高的平衡准确性(0.812、敏感性 0.806、特异性 0.818 和受试者工作特征曲线下面积 0.901)。5 个关键的高血压指标是年龄、舒张压、BMI、收缩压和空腹血糖。日本医疗数据中心队列数据集(额外的验证集)证实了这些发现(平衡准确性为 0.741,受试者工作特征曲线下面积为 0.824)。该集成模型已集成到一个公共网络门户中,用于根据健康检查数据预测高血压的发病。

结论

对我们的机器学习模型与 2 项不同研究中的经典统计模型的比较评估强调了前者在预测高血压发病方面的增强稳定性、通用性和可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57d3/11576616/f1036d8cd972/jmir_v26i1e52794_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57d3/11576616/f1036d8cd972/jmir_v26i1e52794_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57d3/11576616/f1036d8cd972/jmir_v26i1e52794_fig1.jpg

相似文献

1
Machine Learning-Based Prediction for Incident Hypertension Based on Regular Health Checkup Data: Derivation and Validation in 2 Independent Nationwide Cohorts in South Korea and Japan.基于常规健康检查数据的基于机器学习的高血压事件预测:在韩国和日本的 2 个独立全国队列中的推导和验证。
J Med Internet Res. 2024 Nov 5;26:e52794. doi: 10.2196/52794.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
6
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
7
Machine Learning-Based Analysis of Lifestyle Risk Factors for Atherosclerotic Cardiovascular Disease: Retrospective Case-Control Study.基于机器学习的动脉粥样硬化性心血管疾病生活方式风险因素分析:回顾性病例对照研究
JMIR Med Inform. 2025 Aug 7;13:e74415. doi: 10.2196/74415.
8
Development and Validation of a Robust and Interpretable Early Triaging Support System for Patients Hospitalized With COVID-19: Predictive Algorithm Modeling and Interpretation Study.开发和验证用于 COVID-19 住院患者的强大且可解释的早期分诊支持系统:预测算法建模和解释研究。
J Med Internet Res. 2024 Jan 11;26:e52134. doi: 10.2196/52134.
9
Machine learning-based model for predicting all-cause mortality in severe pneumonia.基于机器学习的重症肺炎全因死亡率预测模型。
BMJ Open Respir Res. 2025 Mar 22;12(1):e001983. doi: 10.1136/bmjresp-2023-001983.
10
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.

引用本文的文献

1
Development of an early prediction model for risk of influenza A and influenza B based on complete blood count examination.基于全血细胞计数检查的甲型和乙型流感风险早期预测模型的开发。
BMC Infect Dis. 2025 Sep 1;25(1):1088. doi: 10.1186/s12879-025-11502-4.
2
Artificial intelligence models predicting abnormal uterine bleeding after COVID-19 vaccination.预测新冠病毒疫苗接种后异常子宫出血的人工智能模型。
Sci Rep. 2025 Feb 27;15(1):7081. doi: 10.1038/s41598-025-91882-4.
3
Machine Learning-Based Prediction of Substance Use in Adolescents in Three Independent Worldwide Cohorts: Algorithm Development and Validation Study.

本文引用的文献

1
Prevalence, Awareness, Treatment, and Control of Type 2 Diabetes in South Korea (1998 to 2022): Nationwide Cross-Sectional Study.韩国 2 型糖尿病的患病率、知晓率、治疗率和控制率(1998 年至 2022 年):全国性横断面研究。
JMIR Public Health Surveill. 2024 Aug 27;10:e59571. doi: 10.2196/59571.
2
Resilient Artificial Intelligence in Health: Synthesis and Research Agenda Toward Next-Generation Trustworthy Clinical Decision Support.健康领域的弹性人工智能:迈向下一代值得信赖的临床决策支持的综合与研究议程。
J Med Internet Res. 2024 Jun 28;26:e50295. doi: 10.2196/50295.
3
Short- and long-term neuropsychiatric outcomes in long COVID in South Korea and Japan.
基于机器学习对全球三个独立队列中青少年物质使用情况的预测:算法开发与验证研究
J Med Internet Res. 2025 Feb 24;27:e62805. doi: 10.2196/62805.
韩国和日本长新冠的短期和长期神经精神结局。
Nat Hum Behav. 2024 Aug;8(8):1530-1544. doi: 10.1038/s41562-024-01895-8. Epub 2024 Jun 25.
4
Pitfalls in Developing Machine Learning Models for Predicting Cardiovascular Diseases: Challenge and Solutions.机器学习模型在预测心血管疾病中的陷阱:挑战与解决方案。
J Med Internet Res. 2024 Jul 26;26:e47645. doi: 10.2196/47645.
5
Acute and post-acute respiratory complications of SARS-CoV-2 infection: population-based cohort study in South Korea and Japan.SARS-CoV-2 感染的急性和后期呼吸道并发症:韩国和日本基于人群的队列研究。
Nat Commun. 2024 May 27;15(1):4499. doi: 10.1038/s41467-024-48825-w.
6
Machine learning to predict the occurrence of thyroid nodules: towards a quantitative approach for judicious utilization of thyroid ultrasonography.机器学习预测甲状腺结节的发生:为甲状腺超声检查的合理应用提供定量方法。
Front Endocrinol (Lausanne). 2024 May 7;15:1385836. doi: 10.3389/fendo.2024.1385836. eCollection 2024.
7
Machine Learning-Based Prediction of Suicidal Thinking in Adolescents by Derivation and Validation in 3 Independent Worldwide Cohorts: Algorithm Development and Validation Study.基于机器学习的青少年自杀思维预测:在 3 个独立的全球队列中的推导和验证:算法开发和验证研究。
J Med Internet Res. 2024 May 17;26:e55913. doi: 10.2196/55913.
8
Evaluation metrics and statistical tests for machine learning.机器学习的评估指标和统计检验。
Sci Rep. 2024 Mar 13;14(1):6086. doi: 10.1038/s41598-024-56706-x.
9
Machine Learning-Based Prediction of Suicidality in Adolescents With Allergic Rhinitis: Derivation and Validation in 2 Independent Nationwide Cohorts.基于机器学习的青少年变应性鼻炎自杀倾向预测:在 2 个独立的全国性队列中的推导和验证。
J Med Internet Res. 2024 Feb 14;26:e51473. doi: 10.2196/51473.
10
Trends in hypertension prevalence, awareness, treatment, and control in South Korea, 1998-2021: a nationally representative serial study.韩国 1998-2021 年高血压患病率、知晓率、治疗率和控制率的趋势:一项全国代表性的连续研究。
Sci Rep. 2023 Dec 8;13(1):21724. doi: 10.1038/s41598-023-49055-8.