• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于电子病历数据的高血压和高血脂预测可解释模型(DHDIP)

DHDIP: An interpretable model for hypertension and hyperlipidemia prediction based on EMR data.

机构信息

College of Big Data Statistics, Guizhou University of Finance and Economics, Guiyang 550025, PR China; College of Statistics and Data Science, Xinjiang University of Finance and Economics, Urumqi 830012, PR China.

College of Statistics and Data Science, Xinjiang University of Finance and Economics, Urumqi 830012, PR China.

出版信息

Comput Methods Programs Biomed. 2022 Nov;226:107088. doi: 10.1016/j.cmpb.2022.107088. Epub 2022 Aug 28.

DOI:10.1016/j.cmpb.2022.107088
PMID:36096022
Abstract

BACKGROUND AND OBJECTIVE

Traditional hypertension and hyperlipidemia prediction models suffer from uneven modeling data sources, small sample sizes, and a lack of uniform standards for the index system, resulting in the model failing to fulfill clinical applications. To address this issue, this work will offer DHDIP, an interpretable hypertension and hyperlipidemia prediction model based on EMR data.

METHODS

First, we will select massive high-dimensional, unstructured EMR data as a unified modeling data source, and propose a pre-processing algorithm for EMR data to solve the problem that EMR data cannot be directly processed by machine learning algorithms. Second, a variety of mainstream models such as XGBoost, CatBoost, and RandomForest are selected for modeling, and the best adaptation algorithms are identified by performance comparison. Finally, the SHAP framework was introduced into the DHDIP model, thus identifying the main factors contributing to hypertension and hyperlipidemia, effectively enhancing the interpretability of the model.

RESULTS

The DHDIP model's MSE value is 0.0285, and its LOSS value is 0.0054, both of which are better than previous studies.

CONCLUSION

The model balances performance and interpretability. Multi-objective learning allows for a more thorough analysis and prediction of the condition, which not only lowers the cost of disease prediction but also aids physicians in clinical diagnosis. In addition, the datasets and source code are available from this link: https://github.com/Xiaoyao-Jia/DHDIP.

摘要

背景与目的

传统的高血压和高血脂预测模型存在建模数据源不均衡、样本量小以及指标体系缺乏统一标准等问题,导致模型无法满足临床应用的需求。针对这一问题,本研究提出了基于电子病历数据的可解释高血压和高血脂预测模型 DHDIP。

方法

首先,我们将选择海量的高维、非结构化的电子病历数据作为统一的建模数据源,并提出一种电子病历数据的预处理算法,以解决电子病历数据无法直接被机器学习算法处理的问题。其次,我们选择了 XGBoost、CatBoost 和 RandomForest 等多种主流模型进行建模,并通过性能比较确定最佳的适应算法。最后,我们将 SHAP 框架引入 DHDIP 模型中,从而识别出导致高血压和高血脂的主要因素,有效提高了模型的可解释性。

结果

DHDIP 模型的 MSE 值为 0.0285,LOSS 值为 0.0054,均优于以往的研究。

结论

该模型在性能和可解释性之间取得了平衡。多目标学习可以更彻底地分析和预测病情,不仅降低了疾病预测的成本,还可以帮助医生进行临床诊断。此外,数据集和源代码可以从以下链接获取:https://github.com/Xiaoyao-Jia/DHDIP。

相似文献

1
DHDIP: An interpretable model for hypertension and hyperlipidemia prediction based on EMR data.基于电子病历数据的高血压和高血脂预测可解释模型(DHDIP)
Comput Methods Programs Biomed. 2022 Nov;226:107088. doi: 10.1016/j.cmpb.2022.107088. Epub 2022 Aug 28.
2
Predicting Mortality in Intensive Care Unit Patients With Heart Failure Using an Interpretable Machine Learning Model: Retrospective Cohort Study.利用可解释机器学习模型预测重症监护病房心力衰竭患者的死亡率:回顾性队列研究。
J Med Internet Res. 2022 Aug 9;24(8):e38082. doi: 10.2196/38082.
3
Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.用于预测心力衰竭合并高血压重症患者28天全因院内死亡率的可解释机器学习:一项基于重症监护医学信息集市数据库-IV和电子重症监护病房数据库的回顾性队列研究
Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.
4
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
5
Early Prediction of Clinical Response to Etanercept Treatment in Juvenile Idiopathic Arthritis Using Machine Learning.使用机器学习对幼年特发性关节炎患者依那西普治疗的临床反应进行早期预测
Front Pharmacol. 2020 Jul 31;11:1164. doi: 10.3389/fphar.2020.01164. eCollection 2020.
6
A Noninvasive, Economical, and Instant-Result Method to Diagnose and Monitor Type 2 Diabetes Using Pulse Wave: Case-Control Study.一种使用脉搏波诊断和监测 2 型糖尿病的无创、经济、即时结果方法:病例对照研究。
JMIR Mhealth Uhealth. 2019 Apr 23;7(4):e11959. doi: 10.2196/11959.
7
Survival prediction models: an introduction to discrete-time modeling.生存预测模型:离散时间建模简介。
BMC Med Res Methodol. 2022 Jul 26;22(1):207. doi: 10.1186/s12874-022-01679-6.
8
Predicting dementia with routine care EMR data.利用常规护理电子病历数据预测痴呆症。
Artif Intell Med. 2020 Jan;102:101771. doi: 10.1016/j.artmed.2019.101771. Epub 2019 Dec 5.
9
Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.股骨颈骨折后股骨头坏死的预测模型:基于机器学习的开发与验证研究
JMIR Med Inform. 2021 Nov 19;9(11):e30079. doi: 10.2196/30079.
10
Construction and Evaluation of Robust Interpretation Models for Breast Cancer Metastasis Prediction.构建稳健的乳腺癌转移预测解释模型及其评估。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1344-1353. doi: 10.1109/TCBB.2021.3120673. Epub 2022 Jun 3.

引用本文的文献

1
Transforming Healthcare: The AI Revolution in the Comprehensive Care of Hypertension.变革医疗保健:高血压综合护理中的人工智能革命
Clin Pract. 2024 Jul 10;14(4):1357-1374. doi: 10.3390/clinpract14040109.
2
Plant disease prescription recommendation based on electronic medical records and sentence embedding retrieval.基于电子病历和句子嵌入检索的植物病害处方推荐
Plant Methods. 2023 Aug 26;19(1):91. doi: 10.1186/s13007-023-01070-6.