基于电子健康记录预测糖尿病相关住院情况。

Predicting diabetes-related hospitalizations based on electronic health records.

机构信息

Center for Information and Systems Engineering, Boston University, Boston, MA, USA.

出版信息

Stat Methods Med Res. 2019 Dec;28(12):3667-3682. doi: 10.1177/0962280218810911. Epub 2018 Nov 25.

DOI:10.1177/0962280218810911

PMID:30474497

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7537810/

Abstract

To derive a predictive model to identify patients likely to be hospitalized during the following year due to complications attributed to Type II diabetes. A variety of supervised machine learning classification methods were tested and a new method that discovers hidden patient clusters in the positive class (hospitalized) was developed while, at the same time, sparse linear support vector machine classifiers were derived to separate positive samples from the negative ones (non-hospitalized). The convergence of the new method was established and theoretical guarantees were proved on how the classifiers it produces generalize to a test set not seen during training. The methods were tested on a large set of patients from the Boston Medical Center - the largest safety net hospital in New England. It is found that our new joint clustering/classification method achieves an accuracy of 89% (measured in terms of area under the ROC Curve) and yields informative clusters which can help interpret the classification results, thus increasing the trust of physicians to the algorithmic output and providing some guidance towards preventive measures. While it is possible to increase accuracy to 92% with other methods, this comes with increased computational cost and lack of interpretability. The analysis shows that even a modest probability of preventive actions being effective (more than 19%) suffices to generate significant hospital care savings. Predictive models are proposed that can help avert hospitalizations, improve health outcomes and drastically reduce hospital expenditures. The scope for savings is significant as it has been estimated that in the USA alone, about $5.8 billion are spent each year on diabetes-related hospitalizations that could be prevented.

摘要

开发一种预测模型，以识别在未来一年内可能因 II 型糖尿病并发症而住院的患者。测试了各种监督机器学习分类方法，并开发了一种新方法，该方法可以在阳性（住院）类中发现隐藏的患者簇，同时，还衍生出稀疏线性支持向量机分类器，以将阳性样本与阴性样本（未住院）分开。证明了新方法的收敛性，并证明了如何将其产生的分类器推广到训练期间未看到的测试集上的理论保证。在新英格兰最大的安全网医院波士顿医疗中心的一大批患者中测试了这些方法。结果发现，我们的新联合聚类/分类方法的准确率达到 89%（以 ROC 曲线下的面积衡量），并产生了有助于解释分类结果的信息丰富的簇，从而增加了医生对算法输出的信任，并为预防措施提供了一些指导。虽然使用其他方法可以将准确率提高到 92%，但这会增加计算成本和缺乏可解释性。分析表明，即使预防措施有效的概率适中（超过 19%），也足以节省大量的医院护理费用。提出了可以帮助避免住院、改善健康结果并大幅降低医院支出的预测模型。节省的范围很大，因为据估计，仅在美国，每年就有大约 58 亿美元用于可以预防的糖尿病相关住院治疗。

相似文献

Predicting diabetes-related hospitalizations based on electronic health records.

Stat Methods Med Res. 2019 Dec;28(12):3667-3682. doi: 10.1177/0962280218810911. Epub 2018 Nov 25.

Predicting Chronic Disease Hospitalizations from Electronic Health Records: An Interpretable Classification Approach.

Proc IEEE Inst Electr Electron Eng. 2018 Apr;106(4):690-707. doi: 10.1109/JPROC.2017.2789319. Epub 2018 Feb 6.

Prediction of hospitalization due to heart diseases by supervised learning methods.

Int J Med Inform. 2015 Mar;84(3):189-97. doi: 10.1016/j.ijmedinf.2014.10.002. Epub 2014 Oct 16.

Federated learning of predictive models from federated Electronic Health Records.

Int J Med Inform. 2018 Apr;112:59-67. doi: 10.1016/j.ijmedinf.2018.01.007. Epub 2018 Jan 12.

Development and Validation of an Electronic Health Record-Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment.

JAMA Netw Open. 2018 Aug 3;1(4):e181018. doi: 10.1001/jamanetworkopen.2018.1018.

Development and Validation of Machine Learning Models for Prediction of 1-Year Mortality Utilizing Electronic Medical Record Data Available at the End of Hospitalization in Multicondition Patients: a Proof-of-Concept Study.

J Gen Intern Med. 2018 Jun;33(6):921-928. doi: 10.1007/s11606-018-4316-y. Epub 2018 Jan 30.

Predicting the onset of type 2 diabetes using wide and deep learning with electronic health records.

Comput Methods Programs Biomed. 2019 Dec;182:105055. doi: 10.1016/j.cmpb.2019.105055. Epub 2019 Aug 27.

Comparison of Machine Learning Methods With Traditional Models for Use of Administrative Claims With Electronic Medical Records to Predict Heart Failure Outcomes.

JAMA Netw Open. 2020 Jan 3;3(1):e1918962. doi: 10.1001/jamanetworkopen.2019.18962.

Prediction of Future Chronic Opioid Use Among Hospitalized Patients.

J Gen Intern Med. 2018 Jun;33(6):898-905. doi: 10.1007/s11606-018-4335-8. Epub 2018 Feb 5.

Prospective and External Evaluation of a Machine Learning Model to Predict In-Hospital Mortality of Adults at Time of Admission.

JAMA Netw Open. 2020 Feb 5;3(2):e1920733. doi: 10.1001/jamanetworkopen.2019.20733.

引用本文的文献

DRPM: An advanced predictive model for early diabetes detection and risk stratification.

Mol Ther Nucleic Acids. 2025 May 27;36(3):102576. doi: 10.1016/j.omtn.2025.102576. eCollection 2025 Sep 9.

Machine learning for high-risk hospitalization prediction in outpatient individuals with diabetes at a tertiary hospital.

Arch Endocrinol Metab. 2025 Apr 15;69(2):e230348. doi: 10.20945/2359-4292-2024-0317.

Diabetes and Hospitalizations Among Mexican Americans Aged 75 Years and Older.

J Prim Care Community Health. 2024 Jan-Dec;15:21501319241266108. doi: 10.1177/21501319241266108.

Predicting polycystic ovary syndrome with machine learning algorithms from electronic health records.

Front Endocrinol (Lausanne). 2024 Jan 30;15:1298628. doi: 10.3389/fendo.2024.1298628. eCollection 2024.

Personalized hypertension treatment recommendations by a data-driven model.

BMC Med Inform Decis Mak. 2023 Mar 1;23(1):44. doi: 10.1186/s12911-023-02137-z.

Machine learning models for prediction of HF and CKD development in early-stage type 2 diabetes patients.

Sci Rep. 2022 Nov 21;12(1):20012. doi: 10.1038/s41598-022-24562-2.

Characterization of Symptoms and Symptom Clusters for Type 2 Diabetes Using a Large Nationwide Electronic Health Record Database.

Diabetes Spectr. 2022 Spring;35(2):159-170. doi: 10.2337/ds21-0064. Epub 2022 Jan 11.

Machine learning and deep learning predictive models for type 2 diabetes: a systematic review.

Diabetol Metab Syndr. 2021 Dec 20;13(1):148. doi: 10.1186/s13098-021-00767-9.

Improving Risk Identification of Adverse Outcomes in Chronic Heart Failure Using SMOTE+ENN and Machine Learning.

Risk Manag Healthc Policy. 2021 Jun 8;14:2453-2463. doi: 10.2147/RMHP.S310295. eCollection 2021.

Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data.

NPJ Digit Med. 2021 Feb 12;4(1):24. doi: 10.1038/s41746-021-00394-8.

本文引用的文献

Predicting Chronic Disease Hospitalizations from Electronic Health Records: An Interpretable Classification Approach.

Proc IEEE Inst Electr Electron Eng. 2018 Apr;106(4):690-707. doi: 10.1109/JPROC.2017.2789319. Epub 2018 Feb 6.

Leveraging electronic health records for predictive modeling of post-surgical complications.

Stat Methods Med Res. 2018 Nov;27(11):3271-3285. doi: 10.1177/0962280217696115. Epub 2017 Mar 1.

Statistical analysis of a low cost method for multiple disease prediction.

Stat Methods Med Res. 2018 Aug;27(8):2312-2328. doi: 10.1177/0962280216680242. Epub 2016 Dec 8.

A machine learning-based framework to identify type 2 diabetes through electronic health records.

Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.

Prevalence of and Trends in Diabetes Among Adults in the United States, 1988-2012.

JAMA. 2015 Sep 8;314(10):1021-9. doi: 10.1001/jama.2015.10029.

The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

PLoS One. 2015 Mar 4;10(3):e0118432. doi: 10.1371/journal.pone.0118432. eCollection 2015.

Potentially avoidable hospitalizations in five European countries in 2009 and time trends from 2002 to 2009 based on administrative data.

Eur J Public Health. 2015 Feb;25 Suppl 1:35-43. doi: 10.1093/eurpub/cku227.

Prediction of hospitalization due to heart diseases by supervised learning methods.

Int J Med Inform. 2015 Mar;84(3):189-97. doi: 10.1016/j.ijmedinf.2014.10.002. Epub 2014 Oct 16.

Understanding why patients of low socioeconomic status prefer hospitals over ambulatory care.

Health Aff (Millwood). 2013 Jul;32(7):1196-203. doi: 10.1377/hlthaff.2012.0825.

Do integrated record systems lead to integrated services? An observational study of a multi-professional system in a diabetes service.

Int J Med Inform. 2012 Jan;81(1):45-52. doi: 10.1016/j.ijmedinf.2011.09.002. Epub 2011 Oct 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于电子健康记录预测糖尿病相关住院情况。

Predicting diabetes-related hospitalizations based on electronic health records.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献