• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用稀疏平衡支持向量机从电子健康记录中发现 2 型糖尿病。

Discovering the Type 2 Diabetes in Electronic Health Records Using the Sparse Balanced Support Vector Machine.

出版信息

IEEE J Biomed Health Inform. 2020 Jan;24(1):235-246. doi: 10.1109/JBHI.2019.2899218. Epub 2019 Feb 13.

DOI:10.1109/JBHI.2019.2899218
PMID:30762572
Abstract

The diagnosis of type 2 diabetes (T2D) at an early stage has a key role for an adequate T2D integrated management system and patient's follow-up. Recent years have witnessed an increasing amount of available electronic health record (EHR) data and machine learning (ML) techniques have been considerably evolving. However, managing and modeling this amount of information may lead to several challenges, such as overfitting, model interpretability, and computational cost. Starting from these motivations, we introduced an ML method called sparse balanced support vector machine (SB-SVM) for discovering T2D in a novel collected EHR dataset (named Federazione Italiana Medici di Medicina Generale dataset). In particular, among all the EHR features related to exemptions, examination, and drug prescriptions, we have selected only those collected before T2D diagnosis from an uniform age group of subjects. We demonstrated the reliability of the introduced approach with respect to other ML and deep learning approaches widely employed in the state-of-the-art for solving this task. Results evidence that the SB-SVM overcomes the other state-of-the-art competitors providing the best compromise between predictive performance and computation time. Additionally, the induced sparsity allows to increase the model interpretability, while implicitly managing high-dimensional data and the usual unbalanced class distribution.

摘要

在早期诊断 2 型糖尿病 (T2D) 对于完善的 T2D 综合管理系统和患者随访至关重要。近年来,电子健康记录 (EHR) 数据不断增加,机器学习 (ML) 技术也得到了极大的发展。然而,管理和建模如此大量的信息可能会带来一些挑战,例如过拟合、模型可解释性和计算成本。基于这些动机,我们引入了一种名为稀疏平衡支持向量机 (SB-SVM) 的 ML 方法,用于在一个新收集的 EHR 数据集 (称为意大利全科医生联合会数据集) 中发现 T2D。具体来说,在与豁免、检查和药物处方相关的所有 EHR 特征中,我们只选择了在 T2D 诊断前从一个统一年龄组的受试者中收集的特征。我们证明了所提出的方法相对于其他广泛应用于该任务的 ML 和深度学习方法的可靠性。结果表明,SB-SVM 克服了其他最先进的竞争对手,在预测性能和计算时间之间提供了最佳的折衷。此外,诱导稀疏性允许提高模型的可解释性,同时隐式管理高维数据和常见的不平衡类分布。

相似文献

1
Discovering the Type 2 Diabetes in Electronic Health Records Using the Sparse Balanced Support Vector Machine.利用稀疏平衡支持向量机从电子健康记录中发现 2 型糖尿病。
IEEE J Biomed Health Inform. 2020 Jan;24(1):235-246. doi: 10.1109/JBHI.2019.2899218. Epub 2019 Feb 13.
2
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
3
Early temporal prediction of Type 2 Diabetes Risk Condition from a General Practitioner Electronic Health Record: A Multiple Instance Boosting Approach.基于全科医生电子健康记录的2型糖尿病风险状况早期时间预测:一种多实例增强方法。
Artif Intell Med. 2020 May;105:101847. doi: 10.1016/j.artmed.2020.101847. Epub 2020 May 6.
4
Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records.2型糖尿病进展预测模型的逆向工程与评估:基于电子健康记录的机器学习应用
J Diabetes Sci Technol. 2015 Dec 20;10(1):6-18. doi: 10.1177/1932296815620200.
5
Identifying people at risk of developing type 2 diabetes: A comparison of predictive analytics techniques and predictor variables.识别有患 2 型糖尿病风险的人群:预测分析技术和预测变量的比较。
Int J Med Inform. 2018 Nov;119:22-38. doi: 10.1016/j.ijmedinf.2018.08.008. Epub 2018 Aug 28.
6
Deep generative learning for automated EHR diagnosis of traditional Chinese medicine.基于深度学习的中医电子病历自动化诊断
Comput Methods Programs Biomed. 2019 Jun;174:17-23. doi: 10.1016/j.cmpb.2018.05.008. Epub 2018 May 4.
7
Risk prediction using natural language processing of electronic mental health records in an inpatient forensic psychiatry setting.利用电子心理健康记录的自然语言处理进行住院法医精神病学环境中的风险预测。
J Biomed Inform. 2018 Oct;86:49-58. doi: 10.1016/j.jbi.2018.08.007. Epub 2018 Aug 14.
8
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
9
Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction.基于随机森林的单核苷酸多态性相关性学习在 2 型糖尿病风险预测中的应用。
Artif Intell Med. 2018 Apr;85:43-49. doi: 10.1016/j.artmed.2017.09.005. Epub 2017 Sep 22.
10
Predicting long-term type 2 diabetes with support vector machine using oral glucose tolerance test.使用口服葡萄糖耐量试验的支持向量机预测长期 2 型糖尿病。
PLoS One. 2019 Dec 11;14(12):e0219636. doi: 10.1371/journal.pone.0219636. eCollection 2019.

引用本文的文献

1
Machine learning and artificial intelligence in type 2 diabetes prediction: a comprehensive 33-year bibliometric and literature analysis.机器学习与人工智能在2型糖尿病预测中的应用:一项为期33年的全面文献计量学与文献分析
Front Digit Health. 2025 Mar 27;7:1557467. doi: 10.3389/fdgth.2025.1557467. eCollection 2025.
2
Twenty-Five Years of Evolution and Hurdles in Electronic Health Records and Interoperability in Medical Research: Comprehensive Review.电子健康记录在医学研究中的25年发展历程与障碍及互操作性:全面综述
J Med Internet Res. 2025 Jan 9;27:e59024. doi: 10.2196/59024.
3
Classification of AO/OTA 31A/B femur fractures in X-ray images using YOLOv8 and advanced data augmentation techniques.
使用YOLOv8和先进的数据增强技术对X射线图像中的AO/OTA 31A/B型股骨骨折进行分类。
Bone Rep. 2024 Sep 16;22:101801. doi: 10.1016/j.bonr.2024.101801. eCollection 2024 Sep.
4
Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records.利用五年队列电子健康记录预测 2 型糖尿病新发病例的大型语言多模态模型。
Sci Rep. 2024 Sep 6;14(1):20774. doi: 10.1038/s41598-024-71020-2.
5
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records.一种使用条件归一化流来解决个人健康记录中极端类别不平衡问题的机器学习方法。
BioData Min. 2024 May 25;17(1):14. doi: 10.1186/s13040-024-00366-0.
6
Forecasting and what-if analysis of new positive COVID-19 cases during the first three waves in Italy.预测和意大利前三波 COVID-19 新增阳性病例的情景分析。
Med Biol Eng Comput. 2023 Aug;61(8):2051-2066. doi: 10.1007/s11517-023-02831-0. Epub 2023 Jun 14.
7
Machine learning using the extreme gradient boosting (XGBoost) algorithm predicts 5-day delta of SOFA score at ICU admission in COVID-19 patients.使用极端梯度提升(XGBoost)算法的机器学习可预测新冠肺炎患者入住重症监护病房(ICU)时序贯器官衰竭评估(SOFA)评分的5天变化值。
J Intensive Med. 2021 Oct 22;1(2):110-116. doi: 10.1016/j.jointm.2021.09.002. eCollection 2021 Oct.
8
Uncovering the potential of innovation ecosystems in the healthcare sector after the COVID-19 crisis.揭示 COVID-19 危机后医疗保健领域创新生态系统的潜力。
Health Policy. 2023 Jan;127:80-86. doi: 10.1016/j.healthpol.2022.12.001. Epub 2022 Dec 5.
9
Machine Learning for Predicting Hyperglycemic Cases Induced by PD-1/PD-L1 Inhibitors.基于机器学习预测 PD-1/PD-L1 抑制剂引起的高血糖病例
J Healthc Eng. 2022 Aug 19;2022:6278854. doi: 10.1155/2022/6278854. eCollection 2022.
10
Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World.利用智能互联健康的力量应对新冠疫情:物联网、人工智能、机器人技术和区块链打造更美好的世界。
IEEE Internet Things J. 2021 Apr 19;8(16):12826-12846. doi: 10.1109/JIOT.2021.3073904. eCollection 2021 Aug 15.