• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在临床环境中,逻辑回归与优化的机器学习算法具有相似的性能:应用于区分年轻成年人的1型和2型糖尿病。

Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults.

作者信息

Lynam Anita L, Dennis John M, Owen Katharine R, Oram Richard A, Jones Angus G, Shields Beverley M, Ferrat Lauric A

机构信息

Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK.

Oxford Centre for Diabetes Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, OX3 7LE UK.

出版信息

Diagn Progn Res. 2020 Jun 4;4:6. doi: 10.1186/s41512-020-00075-2. eCollection 2020.

DOI:10.1186/s41512-020-00075-2
PMID:32607451
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7318367/
Abstract

BACKGROUND

There is much interest in the use of prognostic and diagnostic prediction models in all areas of clinical medicine. The use of machine learning to improve prognostic and diagnostic accuracy in this area has been increasing at the expense of classic statistical models. Previous studies have compared performance between these two approaches but their findings are inconsistent and many have limitations. We aimed to compare the discrimination and calibration of seven models built using logistic regression and optimised machine learning algorithms in a clinical setting, where the number of potential predictors is often limited, and externally validate the models.

METHODS

We trained models using logistic regression and six commonly used machine learning algorithms to predict if a patient diagnosed with diabetes has type 1 diabetes (versus type 2 diabetes). We used seven predictor variables (age, BMI, GADA islet-autoantibodies, sex, total cholesterol, HDL cholesterol and triglyceride) using a UK cohort of adult participants (aged 18-50 years) with clinically diagnosed diabetes recruited from primary and secondary care ( = 960, 14% with type 1 diabetes). Discrimination performance (ROC AUC), calibration and decision curve analysis of each approach was compared in a separate external validation dataset ( = 504, 21% with type 1 diabetes).

RESULTS

Average performance obtained in internal validation was similar in all models (ROC AUC ≥ 0.94). In external validation, there were very modest reductions in discrimination with AUC ROC remaining ≥ 0.93 for all methods. Logistic regression had the numerically highest value in external validation (ROC AUC 0.95). Logistic regression had good performance in terms of calibration and decision curve analysis. Neural network and gradient boosting machine had the best calibration performance. Both logistic regression and support vector machine had good decision curve analysis for clinical useful threshold probabilities.

CONCLUSION

Logistic regression performed as well as optimised machine algorithms to classify patients with type 1 and type 2 diabetes. This study highlights the utility of comparing traditional regression modelling to machine learning, particularly when using a small number of well understood, strong predictor variables.

摘要

背景

临床医学各领域对预后和诊断预测模型的应用都极为关注。利用机器学习提高该领域的预后和诊断准确性的做法日益增多,而经典统计模型的应用则有所减少。以往研究比较了这两种方法的性能,但结果并不一致,且许多研究存在局限性。我们旨在比较在临床环境中使用逻辑回归和优化的机器学习算法构建的七个模型的区分度和校准度,临床环境中潜在预测变量的数量通常有限,并对模型进行外部验证。

方法

我们使用逻辑回归和六种常用的机器学习算法训练模型,以预测被诊断为糖尿病的患者是否患有1型糖尿病(与2型糖尿病相对)。我们使用了七个预测变量(年龄、体重指数、谷氨酸脱羧酶胰岛自身抗体、性别、总胆固醇、高密度脂蛋白胆固醇和甘油三酯),研究对象为来自英国初级和二级医疗机构招募的成年参与者(年龄在18至50岁之间)的队列,这些参与者均患有临床诊断的糖尿病(n = 960,14%为1型糖尿病)。在一个单独的外部验证数据集中(n = 504,21%为1型糖尿病)比较了每种方法的区分性能(ROC曲线下面积)、校准度和决策曲线分析。

结果

内部验证中所有模型获得的平均性能相似(ROC曲线下面积≥0.94)。在外部验证中,所有方法的区分度虽有非常小的降低,但ROC曲线下面积仍≥0.93。逻辑回归在外部验证中的数值最高(ROC曲线下面积为0.95)。逻辑回归在校准度和决策曲线分析方面表现良好。神经网络和梯度提升机具有最佳的校准性能。逻辑回归和支持向量机在临床有用阈值概率的决策曲线分析方面均表现良好。

结论

在对1型和2型糖尿病患者进行分类时,逻辑回归的表现与优化的机器学习算法相当。本研究强调了将传统回归建模与机器学习进行比较的实用性,特别是在使用少量易于理解的强预测变量时。

相似文献

1
Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults.在临床环境中,逻辑回归与优化的机器学习算法具有相似的性能:应用于区分年轻成年人的1型和2型糖尿病。
Diagn Progn Res. 2020 Jun 4;4:6. doi: 10.1186/s41512-020-00075-2. eCollection 2020.
2
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.
3
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
4
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
5
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。
Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.
6
Machine Learning Can be Used to Predict Function but Not Pain After Surgery for Thumb Carpometacarpal Osteoarthritis.机器学习可用于预测拇指腕掌关节炎手术后的功能而非疼痛。
Clin Orthop Relat Res. 2022 Jul 1;480(7):1271-1284. doi: 10.1097/CORR.0000000000002105. Epub 2022 Jan 18.
7
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
8
Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.开发、验证和评估一种简单的机器学习模型以预测肝硬化死亡率。
JAMA Netw Open. 2020 Nov 2;3(11):e2023780. doi: 10.1001/jamanetworkopen.2020.23780.
9
Development of a 5-Year Risk Prediction Model for Transition From Prediabetes to Diabetes Using Machine Learning: Retrospective Cohort Study.使用机器学习开发一个用于预测糖尿病前期转变为糖尿病的5年风险预测模型:回顾性队列研究。
J Med Internet Res. 2025 May 9;27:e73190. doi: 10.2196/73190.
10
Machine Learning-Based Prediction for 4-Year Risk of Metabolic Syndrome in Adults: A Retrospective Cohort Study.基于机器学习的成年人代谢综合征4年风险预测:一项回顾性队列研究。
Risk Manag Healthc Policy. 2021 Oct 20;14:4361-4368. doi: 10.2147/RMHP.S328180. eCollection 2021.

引用本文的文献

1
Constructing a binary prediction model with incomplete data: Variable selection to balance fairness and precision.构建具有不完整数据的二元预测模型:平衡公平性和精度的变量选择。
Psychol Methods. 2025 Aug 14. doi: 10.1037/met0000786.
2
Integrating Ga-PSMA-11 PET/CT with Clinical Risk Factors for Enhanced Prostate Cancer Progression Prediction.整合镓-PSMA-11 PET/CT与临床风险因素以增强前列腺癌进展预测
Cancers (Basel). 2025 Jul 9;17(14):2285. doi: 10.3390/cancers17142285.
3
A microRNA-based dynamic risk score for type 1 diabetes.一种基于微小RNA的1型糖尿病动态风险评分

本文引用的文献

1
Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.运用机器学习分析常规收集的重症监护病房数据:系统评价。
Crit Care. 2019 Aug 22;23(1):284. doi: 10.1186/s13054-019-2564-9.
2
Evaluation of Machine-Learning Algorithms for Predicting Opioid Overdose Risk Among Medicare Beneficiaries With Opioid Prescriptions.评估机器学习算法在预测有阿片类药物处方的医疗保险受益人群中阿片类药物过量风险中的应用。
JAMA Netw Open. 2019 Mar 1;2(3):e190968. doi: 10.1001/jamanetworkopen.2019.0968.
3
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.
Nat Med. 2025 Jun 5. doi: 10.1038/s41591-025-03730-7.
4
Development of a machine learning-based model to predict urethral recurrence following radical cystectomy: a multicentre retrospective study and updated meta-analysis.基于机器学习的模型预测根治性膀胱切除术后尿道复发的研究:一项多中心回顾性研究及更新的荟萃分析
Sci Rep. 2025 Jun 4;15(1):19573. doi: 10.1038/s41598-025-04893-6.
5
Advancing Pediatric Growth Assessment with Machine Learning: Overcoming Challenges in Early Diagnosis and Monitoring.利用机器学习推进儿科生长评估:克服早期诊断和监测中的挑战。
Children (Basel). 2025 Feb 28;12(3):317. doi: 10.3390/children12030317.
6
Comparison of artificial intelligence and logistic regression models for mortality prediction in acute respiratory distress syndrome: a systematic review and meta-analysis.人工智能与逻辑回归模型在急性呼吸窘迫综合征死亡率预测中的比较:一项系统评价和荟萃分析
Intensive Care Med Exp. 2025 Feb 21;13(1):23. doi: 10.1186/s40635-024-00706-8.
7
Use of machine learning algorithms to construct models of symptom burden cluster risk in breast cancer patients undergoing chemotherapy.使用机器学习算法构建接受化疗的乳腺癌患者症状负担聚类风险模型。
Support Care Cancer. 2025 Feb 13;33(3):190. doi: 10.1007/s00520-025-09236-9.
8
Applications of Artificial Intelligence and Machine Learning in Emergency Medicine Triage - A Systematic Review.人工智能和机器学习在急诊医学分诊中的应用——一项系统综述
Med Arch. 2024;78(3):198-206. doi: 10.5455/medarh.2024.78.198-206.
9
Statistical models versus machine learning approach for competing risks in proctological surgery.直肠外科手术中竞争风险的统计模型与机器学习方法
Updates Surg. 2025 Apr;77(2):333-341. doi: 10.1007/s13304-025-02109-0. Epub 2025 Jan 25.
10
2. Diagnosis and Classification of Diabetes: Standards of Care in Diabetes-2025.2. 糖尿病的诊断与分类:《2025年糖尿病防治标准》
Diabetes Care. 2025 Jan 1;48(Supplement_1):S27-S49. doi: 10.2337/dc25-S002.
系统评价显示,机器学习在临床预测模型中并未优于逻辑回归。
J Clin Epidemiol. 2019 Jun;110:12-22. doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11.
4
Development and Validation of an Electronic Health Record-Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment.基于电子病历的机器学习模型开发与验证:用于预测无已知认知障碍的新入院患者发生谵妄的风险。
JAMA Netw Open. 2018 Aug 3;1(4):e181018. doi: 10.1001/jamanetworkopen.2018.1018.
5
Development of a prediction model for pancreatic cancer in patients with type 2 diabetes using logistic regression and artificial neural network models.使用逻辑回归和人工神经网络模型开发2型糖尿病患者胰腺癌预测模型。
Cancer Manag Res. 2018 Nov 26;10:6317-6324. doi: 10.2147/CMAR.S180791. eCollection 2018.
6
A comparison of logistic regression models with alternative machine learning methods to predict the risk of in-hospital mortality in emergency medical admissions via external validation.通过外部验证比较逻辑回归模型与替代机器学习方法,以预测急诊入院患者住院内死亡风险。
Health Informatics J. 2020 Mar;26(1):34-44. doi: 10.1177/1460458218813600. Epub 2018 Nov 29.
7
Identifying people at risk of developing type 2 diabetes: A comparison of predictive analytics techniques and predictor variables.识别有患 2 型糖尿病风险的人群:预测分析技术和预测变量的比较。
Int J Med Inform. 2018 Nov;119:22-38. doi: 10.1016/j.ijmedinf.2018.08.008. Epub 2018 Aug 28.
8
Decision curve analysis: a technical note.决策曲线分析:技术说明
Ann Transl Med. 2018 Aug;6(15):308. doi: 10.21037/atm.2018.07.02.
9
Big Data and Predictive Analytics: Recalibrating Expectations.大数据与预测分析:重新校准期望
JAMA. 2018 Jul 3;320(1):27-28. doi: 10.1001/jama.2018.5602.
10
Big Data and Machine Learning in Health Care.医疗保健中的大数据与机器学习
JAMA. 2018 Apr 3;319(13):1317-1318. doi: 10.1001/jama.2017.18391.