无糖尿病患者群体胰岛素抵抗模型的建立与验证及其临床意义：一项前瞻性队列研究

Development and validation of an insulin resistance model for a population without diabetes mellitus and its clinical implication: a prospective cohort study.

作者信息

Tsai Shang-Feng, Yang Chao-Tung, Liu Wei-Ju, Lee Chia-Lin

机构信息

Department of Post-Baccalaureate Medicine, College of Medicine, National Chung Hsing University, Taichung, Taiwan.

School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.

出版信息

EClinicalMedicine. 2023 Apr 4;58:101934. doi: 10.1016/j.eclinm.2023.101934. eCollection 2023 Apr.

DOI:10.1016/j.eclinm.2023.101934

PMID:37090441

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10119497/

Abstract

BACKGROUND

Insulin resistance (IR) is associated with diabetes mellitus, cardiovascular disease (CV), and mortality. Few studies have used machine learning to predict IR in the non-diabetic population.

METHODS

In this prospective cohort study, we trained a predictive model for IR in the non-diabetic populations using the US National Health and Nutrition Examination Survey (NHANES, from JAN 01, 1999 to DEC 31, 2012) database and the Taiwan MAJOR (from JAN 01, 2008 to DEC 31, 2017) database. We analysed participants in the NHANES and MAJOR and participants were excluded if they were aged <18 years old, had incomplete laboratory data, or had DM. To investigate the clinical implications (CV and all-cause mortality) of this trained model, we tested it with the Taiwan biobank (TWB) database from DEC 10, 2008 to NOV 30, 2018. We then used SHapley Additive exPlanation (SHAP) values to explain differences across the machine learning models.

FINDINGS

Of all participants (combined NHANES and MJ databases), we randomly selected 14,705 participants for the training group, and 4018 participants for the validation group. In the validation group, their areas under the curve (AUC) were all >0.8 (highest being XGboost, 0.87). In the test group, all AUC were also >0.80 (highest being XGboost, 0.88). Among all 9 features (age, gender, race, body mass index, fasting plasma glucose (FPG), glycohemoglobin, triglyceride, total cholesterol and high-density cholesterol), BMI had the highest value of feature importance on IR (0.43 for XGboost and 0.47 for RF algorithms). All participants from the TWB database were separated into the IR group and the non-IR group according to the XGboost algorithm. The Kaplan-Meier survival curve showed a significant difference between the IR and non-IR groups (p < 0.0001 for CV mortality, and p = 0.0006 for all-cause mortality). Therefore, the XGboost model has clear clinical implications for predicting IR, aside from CV and all-cause mortality.

INTERPRETATION

To predict IR in non-diabetic patients with high accuracy, only 9 easily obtained features are needed for prediction accuracy using our machine learning model. Similarly, the model predicts IR patients with significantly higher CV and all-cause mortality. The model can be applied to both Asian and Caucasian populations in clinical practice.

FUNDING

Taichung Veterans General Hospital, Taiwan and Japan Society for the Promotion of Science KAKENHI Grant Number JP21KK0293.

摘要

背景

胰岛素抵抗（IR）与糖尿病、心血管疾病（CV）及死亡率相关。很少有研究使用机器学习来预测非糖尿病人群的IR。

方法

在这项前瞻性队列研究中，我们使用美国国家健康与营养检查调查（NHANES，1999年1月1日至2012年12月31日）数据库和台湾MAJOR（2008年1月1日至2017年12月31日）数据库，为非糖尿病人群训练了一个IR预测模型。我们分析了NHANES和MAJOR中的参与者，年龄<18岁、实验室数据不完整或患有糖尿病的参与者被排除。为了研究这个训练模型的临床意义（CV和全因死亡率），我们使用2008年12月10日至2018年11月30日的台湾生物银行（TWB）数据库对其进行测试。然后我们使用夏普利值（SHAP）来解释机器学习模型之间的差异。

结果

在所有参与者（NHANES和MJ数据库合并）中，我们随机选择14705名参与者作为训练组，4018名参与者作为验证组。在验证组中，他们的曲线下面积（AUC）均>0.8（最高的是XGBoost，为0.87）。在测试组中，所有AUC也>0.80（最高的是XGBoost，为0.88）。在所有9个特征（年龄、性别、种族、体重指数、空腹血糖（FPG）、糖化血红蛋白、甘油三酯、总胆固醇和高密度胆固醇）中，BMI对IR的特征重要性值最高（XGBoost算法为0.43，随机森林（RF）算法为0.47）。根据XGBoost算法，将TWB数据库中的所有参与者分为IR组和非IR组。Kaplan-Meier生存曲线显示IR组和非IR组之间存在显著差异（CV死亡率p<0.0001，全因死亡率p = 0.0006）。因此XGBoost模型除了对CV和全因死亡率有预测作用外，对预测IR也有明确的临床意义。

解读

为了高精度地预测非糖尿病患者的IR，使用我们的机器学习模型进行预测准确性仅需要9个容易获得的特征。同样，该模型预测IR患者的CV和全因死亡率显著更高。该模型可在临床实践中应用于亚洲和白种人群体。

资助

台湾台中荣民总医院和日本学术振兴会科研资助金编号JP21KK0293。

相似文献

Development and validation of an insulin resistance model for a population without diabetes mellitus and its clinical implication: a prospective cohort study.无糖尿病患者群体胰岛素抵抗模型的建立与验证及其临床意义：一项前瞻性队列研究

EClinicalMedicine. 2023 Apr 4;58:101934. doi: 10.1016/j.eclinm.2023.101934. eCollection 2023 Apr.

Development and Validation of an Insulin Resistance Model for a Population with Chronic Kidney Disease Using a Machine Learning Approach.基于机器学习方法的慢性肾脏病人群胰岛素抵抗模型的建立与验证。

Nutrients. 2022 Jul 9;14(14):2832. doi: 10.3390/nu14142832.

Machine learning models including insulin resistance indexes for predicting liver stiffness in United States population: Data from NHANES.机器学习模型包括胰岛素抵抗指数，用于预测美国人群的肝硬度：来自 NHANES 的数据。

Front Public Health. 2022 Sep 23;10:1008794. doi: 10.3389/fpubh.2022.1008794. eCollection 2022.

Predicting Mortality in Intensive Care Unit Patients With Heart Failure Using an Interpretable Machine Learning Model: Retrospective Cohort Study.利用可解释机器学习模型预测重症监护病房心力衰竭患者的死亡率：回顾性队列研究。

J Med Internet Res. 2022 Aug 9;24(8):e38082. doi: 10.2196/38082.

Development and Validation of an Insulin Resistance Predicting Model Using a Machine-Learning Approach in a Population-Based Cohort in Korea.在韩国基于人群的队列中使用机器学习方法开发和验证胰岛素抵抗预测模型

Diagnostics (Basel). 2022 Jan 16;12(1):212. doi: 10.3390/diagnostics12010212.

Discriminating insulin resistance in middle-aged nondiabetic women using machine learning approaches.使用机器学习方法鉴别中年非糖尿病女性的胰岛素抵抗

AIMS Public Health. 2024 May 9;11(2):667-687. doi: 10.3934/publichealth.2024034. eCollection 2024.

Effects of Various Heavy Metal Exposures on Insulin Resistance in Non-diabetic Populations: Interpretability Analysis from Machine Learning Modeling Perspective.各种重金属暴露对非糖尿病人群胰岛素抵抗的影响：基于机器学习建模视角的可解释性分析

Biol Trace Elem Res. 2024 Dec;202(12):5438-5452. doi: 10.1007/s12011-024-04126-3. Epub 2024 Feb 26.

The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.应用机器学习模型预测伴有冠状动脉疾病的慢性肾脏病患者的院内死亡率。

Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x.

Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.用于预测心力衰竭合并高血压重症患者28天全因院内死亡率的可解释机器学习：一项基于重症监护医学信息集市数据库-IV和电子重症监护病房数据库的回顾性队列研究

Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.

Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China.基于机器学习的糖尿病视网膜病变预测模型与风险分析：中国的回顾性队列研究。

BMJ Open. 2021 Nov 26;11(11):e050989. doi: 10.1136/bmjopen-2021-050989.

引用本文的文献

AI-driven prediction of insulin resistance in non-diabetic populations using minimal invasive tests: comparing models and criteria.使用微创检测对非糖尿病人群胰岛素抵抗进行人工智能驱动的预测：模型与标准比较

Diabetol Metab Syndr. 2025 Aug 18;17(1):338. doi: 10.1186/s13098-025-01920-4.

Prediction Model for Insulin Resistance and Implications for MASLD in Youth: A Novel Marker, the Pediatric Insulin Resistance Assessment Score.青少年胰岛素抵抗预测模型及其对代谢功能障碍相关脂肪性肝病的意义：一种新型标志物——儿童胰岛素抵抗评估评分

Yonsei Med J. 2025 Aug;66(8):464-472. doi: 10.3349/ymj.2024.0442.

Development and validation of a risk prediction model for depression in patients with chronic obstructive pulmonary disease.慢性阻塞性肺疾病患者抑郁症风险预测模型的开发与验证

BMC Psychiatry. 2025 Jul 3;25(1):506. doi: 10.1186/s12888-025-06913-1.

Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证：横断面和回顾性队列研究

JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.

Development and validation of an interpretable risk prediction model for the early classification of thalassemia.地中海贫血早期分类的可解释风险预测模型的开发与验证

NPJ Digit Med. 2025 Jun 10;8(1):346. doi: 10.1038/s41746-025-01766-0.

Frequency of insulin resistance among non-diabetic patients with non-alcoholic fatty liver disease using HOMA-IR: an experience of a tertiary care hospital in Karachi, Pakistan.使用稳态模型评估胰岛素抵抗指数（HOMA-IR）评估非酒精性脂肪性肝病非糖尿病患者的胰岛素抵抗频率：巴基斯坦卡拉奇一家三级医院的经验

BMC Gastroenterol. 2025 Apr 15;25(1):259. doi: 10.1186/s12876-025-03790-6.

Association of Appendicular Skeletal Muscle Mass Index and Insulin Resistance With Mortality in Multi-Nationwide Cohorts.多国队列中四肢骨骼肌质量指数和胰岛素抵抗与死亡率的关联

J Cachexia Sarcopenia Muscle. 2025 Apr;16(2):e13811. doi: 10.1002/jcsm.13811.

Machine learning integration of multimodal data identifies key features of circulating NT-proBNP in people without cardiovascular diseases.多模态数据的机器学习整合识别出无心血管疾病人群中循环NT-proBNP的关键特征。

Sci Rep. 2025 Apr 8;15(1):12015. doi: 10.1038/s41598-025-96689-x.

Multi-cohort study in gastric cancer to develop CT-based radiomic models to predict pathological response to neoadjuvant immunotherapy.一项针对胃癌的多队列研究，旨在开发基于CT的放射组学模型以预测新辅助免疫治疗的病理反应。

J Transl Med. 2025 Mar 24;23(1):362. doi: 10.1186/s12967-025-06363-z.

An interpretable machine learning model based on computed tomography radiomics for predicting programmed death ligand 1 expression status in gastric cancer.一种基于计算机断层扫描影像组学的可解释机器学习模型，用于预测胃癌中程序性死亡配体1的表达状态。

Cancer Imaging. 2025 Mar 12;25(1):31. doi: 10.1186/s40644-025-00855-3.

本文引用的文献

Nutrients. 2022 Jul 9;14(14):2832. doi: 10.3390/nu14142832.

A Clinical Decision Support System for Diabetes Patients with Deep Learning: Experience of a Taiwan Medical Center.基于深度学习的糖尿病患者临床决策支持系统：台湾某医学中心的经验。

Int J Med Sci. 2022 Jun 13;19(6):1049-1055. doi: 10.7150/ijms.71341. eCollection 2022.

Machine Learning-Derived Prenatal Predictive Risk Model to Guide Intervention and Prevent the Progression of Gestational Diabetes Mellitus to Type 2 Diabetes: Prediction Model Development Study.机器学习衍生的产前预测风险模型，用于指导干预并预防妊娠期糖尿病进展为2型糖尿病：预测模型开发研究

JMIR Diabetes. 2022 Jul 5;7(3):e32366. doi: 10.2196/32366.

Early Diabetes Prediction: A Comparative Study Using Machine Learning Techniques.早期糖尿病预测：基于机器学习技术的比较研究。

Stud Health Technol Inform. 2022 Jun 29;295:409-413. doi: 10.3233/SHTI220752.

Early and ongoing stable glycaemic control is associated with a reduction in major adverse cardiovascular events in people with type 2 diabetes: A primary care cohort study.早期和持续的血糖控制稳定与 2 型糖尿病患者主要不良心血管事件的减少相关：一项初级保健队列研究。

Diabetes Obes Metab. 2022 Jul;24(7):1310-1318. doi: 10.1111/dom.14705. Epub 2022 Apr 18.

Machine Learning and Smart Devices for Diabetes Management: Systematic Review.机器学习和智能设备在糖尿病管理中的应用：系统评价。

Sensors (Basel). 2022 Feb 25;22(5):1843. doi: 10.3390/s22051843.

Machine learning models for classification and identification of significant attributes to detect type 2 diabetes.用于分类和识别重要属性以检测2型糖尿病的机器学习模型。

Health Inf Sci Syst. 2022 Feb 9;10(1):2. doi: 10.1007/s13755-021-00168-2. eCollection 2022 Dec.

Diagnostics (Basel). 2022 Jan 16;12(1):212. doi: 10.3390/diagnostics12010212.

Machine learning and deep learning predictive models for type 2 diabetes: a systematic review.用于2型糖尿病的机器学习和深度学习预测模型：一项系统综述

Diabetol Metab Syndr. 2021 Dec 20;13(1):148. doi: 10.1186/s13098-021-00767-9.

The association between insulin sensitivity indices, ECG findings and mortality: a 40-year cohort study.胰岛素敏感性指数、心电图表现与死亡率的关系：一项长达 40 年的队列研究。

Cardiovasc Diabetol. 2021 May 6;20(1):97. doi: 10.1186/s12933-021-01284-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验