• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2型糖尿病进展预测模型的逆向工程与评估:基于电子健康记录的机器学习应用

Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records.

作者信息

Anderson Jeffrey P, Parikh Jignesh R, Shenfeld Daniel K, Ivanov Vladimir, Marks Casey, Church Bruce W, Laramie Jason M, Mardekian Jack, Piper Beth Anne, Willke Richard J, Rublee Dale A

机构信息

GNS Healthcare, Cambridge, MA, USA

GNS Healthcare, Cambridge, MA, USA.

出版信息

J Diabetes Sci Technol. 2015 Dec 20;10(1):6-18. doi: 10.1177/1932296815620200.

DOI:10.1177/1932296815620200
PMID:26685993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4738229/
Abstract

BACKGROUND

Application of novel machine learning approaches to electronic health record (EHR) data could provide valuable insights into disease processes. We utilized this approach to build predictive models for progression to prediabetes and type 2 diabetes (T2D).

METHODS

Using a novel analytical platform (Reverse Engineering and Forward Simulation [REFS]), we built prediction model ensembles for progression to prediabetes or T2D from an aggregated EHR data sample. REFS relies on a Bayesian scoring algorithm to explore a wide model space, and outputs a distribution of risk estimates from an ensemble of prediction models. We retrospectively followed 24 331 adults for transitions to prediabetes or T2D, 2007-2012. Accuracy of prediction models was assessed using an area under the curve (AUC) statistic, and validated in an independent data set.

RESULTS

Our primary ensemble of models accurately predicted progression to T2D (AUC = 0.76), and was validated out of sample (AUC = 0.78). Models of progression to T2D consisted primarily of established risk factors (blood glucose, blood pressure, triglycerides, hypertension, lipid disorders, socioeconomic factors), whereas models of progression to prediabetes included novel factors (high-density lipoprotein, alanine aminotransferase, C-reactive protein, body temperature; AUC = 0.70).

CONCLUSIONS

We constructed accurate prediction models from EHR data using a hypothesis-free machine learning approach. Identification of established risk factors for T2D serves as proof of concept for this analytical approach, while novel factors selected by REFS represent emerging areas of T2D research. This methodology has potentially valuable downstream applications to personalized medicine and clinical research.

摘要

背景

将新型机器学习方法应用于电子健康记录(EHR)数据可为疾病进程提供有价值的见解。我们利用这种方法构建了预测模型,以预测糖尿病前期和2型糖尿病(T2D)的进展情况。

方法

我们使用一个新型分析平台(逆向工程与正向模拟[REFS]),从汇总的EHR数据样本中构建了糖尿病前期或T2D进展的预测模型集成。REFS依靠贝叶斯评分算法来探索广阔的模型空间,并从预测模型集成中输出风险估计分布。我们对24331名成年人进行了回顾性随访,观察他们在2007年至2012年间向糖尿病前期或T2D的转变情况。使用曲线下面积(AUC)统计量评估预测模型的准确性,并在独立数据集中进行验证。

结果

我们的主要模型集成准确预测了T2D的进展(AUC = 0.76),并在样本外得到验证(AUC = 0.78)。T2D进展模型主要由既定风险因素(血糖、血压、甘油三酯、高血压、脂质紊乱、社会经济因素)组成,而糖尿病前期进展模型则包括新因素(高密度脂蛋白、丙氨酸转氨酶、C反应蛋白、体温;AUC = 0.70)。

结论

我们使用无假设机器学习方法从EHR数据构建了准确的预测模型。确定T2D的既定风险因素是这种分析方法的概念验证,而REFS选择的新因素代表了T2D研究的新兴领域。这种方法在个性化医疗和临床研究方面具有潜在的重要下游应用。

相似文献

1
Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records.2型糖尿病进展预测模型的逆向工程与评估:基于电子健康记录的机器学习应用
J Diabetes Sci Technol. 2015 Dec 20;10(1):6-18. doi: 10.1177/1932296815620200.
2
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
3
Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study.基于电子病历中的屈光数据预测中国学龄儿童近视进展:一项回顾性、多中心机器学习研究。
PLoS Med. 2018 Nov 6;15(11):e1002674. doi: 10.1371/journal.pmed.1002674. eCollection 2018 Nov.
4
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
5
Incorporating temporal EHR data in predictive models for risk stratification of renal function deterioration.将时间性电子健康记录数据纳入肾功能恶化风险分层的预测模型中。
J Biomed Inform. 2015 Feb;53:220-8. doi: 10.1016/j.jbi.2014.11.005. Epub 2014 Nov 15.
6
An interpretable predictive deep learning platform for pediatric metabolic diseases.一个可解释的预测性深度学习平台,用于儿科代谢疾病。
J Am Med Inform Assoc. 2024 May 20;31(6):1227-1238. doi: 10.1093/jamia/ocae049.
7
Machine Learning for Predicting the Risk of Transition from Prediabetes to Diabetes.机器学习预测糖尿病前期向糖尿病的转变风险。
Diabetes Technol Ther. 2022 Nov;24(11):842-847. doi: 10.1089/dia.2022.0210. Epub 2022 Oct 11.
8
Prediction of Type 2 Diabetes Based on Machine Learning Algorithm.基于机器学习算法的 2 型糖尿病预测。
Int J Environ Res Public Health. 2021 Mar 23;18(6):3317. doi: 10.3390/ijerph18063317.
9
Prediction of progression from pre-diabetes to diabetes: Development and validation of a machine learning model.从前期糖尿病到糖尿病的进展预测:机器学习模型的开发和验证。
Diabetes Metab Res Rev. 2020 Feb;36(2):e3252. doi: 10.1002/dmrr.3252. Epub 2020 Jan 14.
10
Exploring Prediabetes Pathways Using Explainable AI on Data from Electronic Medical Records.利用电子病历中的可解释人工智能探索前驱糖尿病途径。
Stud Health Technol Inform. 2024 Aug 22;316:736-740. doi: 10.3233/SHTI240519.

引用本文的文献

1
Applications of Artificial Intelligence and Machine Learning in Prediabetes: A Scoping Review.人工智能和机器学习在糖尿病前期的应用:一项范围综述
J Diabetes Sci Technol. 2025 Jul 8:19322968251351995. doi: 10.1177/19322968251351995.
2
The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review.机器学习在疾病预测与管理中分析真实世界数据的应用:系统评价
JMIR Med Inform. 2025 Jun 19;13:e68898. doi: 10.2196/68898.
3
Machine learning and artificial intelligence in type 2 diabetes prediction: a comprehensive 33-year bibliometric and literature analysis.机器学习与人工智能在2型糖尿病预测中的应用:一项为期33年的全面文献计量学与文献分析
Front Digit Health. 2025 Mar 27;7:1557467. doi: 10.3389/fdgth.2025.1557467. eCollection 2025.
4
The relationship between epigenetic biomarkers and the risk of diabetes and cancer: a machine learning modeling approach.表观遗传生物标志物与糖尿病和癌症风险之间的关系:一种机器学习建模方法。
Front Public Health. 2025 Mar 21;13:1509458. doi: 10.3389/fpubh.2025.1509458. eCollection 2025.
5
Development and validation of a chronic kidney disease progression model using patient-level simulations.利用患者水平模拟开发和验证慢性肾脏病进展模型。
Ren Fail. 2024 Dec;46(2):2406402. doi: 10.1080/0886022X.2024.2406402. Epub 2024 Oct 21.
6
Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset.通过多视图协同训练机器学习模型对 2 型糖尿病严重低血糖进行预测,解决数据集不平衡问题。
Sci Rep. 2024 Sep 30;14(1):22741. doi: 10.1038/s41598-024-69844-z.
7
Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study.基于机器学习的急性淋巴细胞白血病患者死亡率和复发率预后因素评估:一项比较模拟研究。
BMC Med Inform Decis Mak. 2024 Sep 16;24(1):261. doi: 10.1186/s12911-024-02645-6.
8
A Comprehensive Investigation: Developing the Pharmaceutical Industry through Artificial Intelligence.一项全面调查:通过人工智能发展制药行业
Curr Drug Discov Technol. 2024 Sep 5. doi: 10.2174/0115701638313233240830132804.
9
PyCaret for Predicting Type 2 Diabetes: A Phenotype- and Gender-Based Approach with the "Nurses' Health Study" and the "Health Professionals' Follow-Up Study" Datasets.使用“护士健康研究”和“卫生专业人员随访研究”数据集,基于表型和性别的方法,用PyCaret预测2型糖尿病
J Pers Med. 2024 Jul 29;14(8):804. doi: 10.3390/jpm14080804.
10
Predictive modeling of multi-class diabetes mellitus using machine learning and filtering iraqi diabetes data dynamics.基于机器学习的多类别糖尿病预测建模及伊拉克糖尿病数据动态过滤
PLoS One. 2024 May 16;19(5):e0300785. doi: 10.1371/journal.pone.0300785. eCollection 2024.

本文引用的文献

1
Novel predictive models for metabolic syndrome risk: a "big data" analytic approach.代谢综合征风险的新型预测模型:一种“大数据”分析方法。
Am J Manag Care. 2014 Jun 1;20(6):e221-8.
2
Changes in diabetes-related complications in the United States, 1990-2010.美国 1990-2010 年糖尿病相关并发症的变化。
N Engl J Med. 2014 Apr 17;370(16):1514-23. doi: 10.1056/NEJMoa1310799.
3
Diagnosis and classification of diabetes mellitus.糖尿病的诊断与分类
Diabetes Care. 2014 Jan;37 Suppl 1:S81-90. doi: 10.2337/dc14-S081.
4
Liver aminotransferases and risk of incident type 2 diabetes: a systematic review and meta-analysis.肝氨基转移酶与 2 型糖尿病发病风险:系统评价和荟萃分析。
Am J Epidemiol. 2013 Jul 15;178(2):159-71. doi: 10.1093/aje/kws469. Epub 2013 May 31.
5
Prediction models for risk of developing type 2 diabetes: systematic literature search and independent external validation study.预测 2 型糖尿病发病风险的模型:系统文献检索和独立外部验证研究。
BMJ. 2012 Sep 18;345:e5900. doi: 10.1136/bmj.e5900.
6
Clinical practice. Diagnosis of diabetes.临床实践。糖尿病的诊断。
N Engl J Med. 2012 Aug 9;367(6):542-50. doi: 10.1056/NEJMcp1103643.
7
Prediabetes: a high-risk state for diabetes development.糖尿病前期:糖尿病发展的高危状态。
Lancet. 2012 Jun 16;379(9833):2279-90. doi: 10.1016/S0140-6736(12)60283-9. Epub 2012 Jun 9.
8
Analyzing partially missing confounder information in comparative effectiveness and safety research of therapeutics.分析治疗学的比较有效性和安全性研究中部分缺失的混杂因素信息。
Pharmacoepidemiol Drug Saf. 2012 May;21 Suppl 2(0 2):13-20. doi: 10.1002/pds.3248.
9
The emerging role of HDL in glucose metabolism.HDL 在糖代谢中的新兴作用。
Nat Rev Endocrinol. 2012 Jan 24;8(4):237-45. doi: 10.1038/nrendo.2011.235.
10
Interaction between cholesteryl ester transfer protein and hepatic lipase encoding genes and the risk of type 2 diabetes: results from the Telde study.载脂蛋白酯酶和肝脂肪酶基因与 2 型糖尿病风险的相互作用:特内里费研究的结果。
PLoS One. 2011;6(11):e27208. doi: 10.1371/journal.pone.0027208. Epub 2011 Nov 3.