纳入饮食摄入量的机器学习算法在妊娠期糖尿病预测中的应用。

Application of machine learning algorithm incorporating dietary intake in prediction of gestational diabetes mellitus.

作者信息

Ding Tianze, Liu Peijie, Jia Jie, Wu Hui, Zhu Jie, Yang Kefeng

机构信息

Department of Clinical Nutrition, Xin Hua Hospital Affiliated to School of Medicine, Shanghai Jiao Tong University, Shanghai, China.

Department of Clinical Nutrition, College of Heath Science and Technology, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.

出版信息

Endocr Connect. 2024 Nov 21;13(12). doi: 10.1530/EC-24-0169. Print 2024 Dec 1.

DOI:10.1530/EC-24-0169

PMID:39393404

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11623027/

Abstract

INTRODUCTION

Gestational diabetes mellitus (GDM) significantly affects pregnancy outcomes. Therefore, it is crucial to develop prediction models since they can guide timely interventions to reduce the incidence of GDM and its associated adverse effects.

METHODS

A total of 554 pregnant women were selected and their sociodemographic characteristics, clinical data and dietary data were collected. Dietary data were investigated by a validated semi-quantitative food frequency questionnaire (FFQ). We applied random forest mean decrease impurity for feature selection and the models are built using logistic regression, XGBoost, and LightGBM algorithms. The prediction performance of different models was compared by accuracy, sensitivity, specificity, area under curve (AUC) and Hosmer-Lemeshow test.

RESULTS

Blood glucose, age, pre-pregnancy body mass index (BMI), triglycerides and high-density lipoprotein cholesterol (HDL) were the top five features according to the feature selection. Among the three algorithms, XGBoost performed best with an AUC of 0.788, LightGBM came second (AUC = 0.749), and logistic regression performed the worst (AUC = 0.712). In addition, XGBoost and LightGBM both achieved a fairly good performance when dietary information was included, surpassing their performance on the non-dietary dataset (0.788 vs 0.718 in XGBoost; 0.749 vs 0.726 in LightGBM).

CONCLUSION

XGBoost and LightGBM algorithms outperform logistic regression in predicting GDM among Chinese pregnant women. In addition, dietary data may have a positive effect on improving model performance, which deserves more in-depth investigation with larger sample size.

摘要

引言

妊娠期糖尿病（GDM）会显著影响妊娠结局。因此，开发预测模型至关重要，因为它们可以指导及时干预，以降低GDM的发生率及其相关不良影响。

方法

共选取554名孕妇，收集她们的社会人口学特征、临床数据和饮食数据。饮食数据通过经过验证的半定量食物频率问卷（FFQ）进行调查。我们应用随机森林平均减少杂质进行特征选择，并使用逻辑回归、XGBoost和LightGBM算法构建模型。通过准确性、敏感性、特异性、曲线下面积（AUC）和Hosmer-Lemeshow检验比较不同模型的预测性能。

结果

根据特征选择，血糖、年龄、孕前体重指数（BMI）、甘油三酯和高密度脂蛋白胆固醇（HDL）是前五个特征。在这三种算法中，XGBoost表现最佳，AUC为0.788，LightGBM次之（AUC = 0.749），逻辑回归表现最差（AUC = 0.712）。此外，当纳入饮食信息时，XGBoost和LightGBM均取得了相当好的性能，超过了它们在非饮食数据集上的性能（XGBoost中为0.788对0.718；LightGBM中为0.749对0.726）。

结论

在中国孕妇中，XGBoost和LightGBM算法在预测GDM方面优于逻辑回归。此外，饮食数据可能对提高模型性能有积极作用，值得进行更大样本量的更深入研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5224/11623027/2fab7d3422e5/EC-24-0169fig1.jpg

相似文献

Application of machine learning algorithm incorporating dietary intake in prediction of gestational diabetes mellitus.纳入饮食摄入量的机器学习算法在妊娠期糖尿病预测中的应用。

Endocr Connect. 2024 Nov 21;13(12). doi: 10.1530/EC-24-0169. Print 2024 Dec 1.

Early prediction of postpartum dyslipidemia in gestational diabetes using machine learning models.使用机器学习模型对妊娠期糖尿病患者产后血脂异常进行早期预测。

Sci Rep. 2025 Mar 7;15(1):8028. doi: 10.1038/s41598-025-92299-9.

Comparison of Machine Learning Methods and Conventional Logistic Regressions for Predicting Gestational Diabetes Using Routine Clinical Data: A Retrospective Cohort Study.使用常规临床数据的机器学习方法与传统逻辑回归预测妊娠期糖尿病的比较：一项回顾性队列研究。

J Diabetes Res. 2020 Jun 12;2020:4168340. doi: 10.1155/2020/4168340. eCollection 2020.

Machine learning risk score for prediction of gestational diabetes in early pregnancy in Tianjin, China.基于中国天津地区的早期妊娠孕妇机器学习风险评分预测妊娠期糖尿病

Diabetes Metab Res Rev. 2021 Jul;37(5):e3397. doi: 10.1002/dmrr.3397. Epub 2020 Sep 9.

Machine Learning-Derived Prenatal Predictive Risk Model to Guide Intervention and Prevent the Progression of Gestational Diabetes Mellitus to Type 2 Diabetes: Prediction Model Development Study.机器学习衍生的产前预测风险模型，用于指导干预并预防妊娠期糖尿病进展为2型糖尿病：预测模型开发研究

JMIR Diabetes. 2022 Jul 5;7(3):e32366. doi: 10.2196/32366.

Establishment and validation of a heart failure risk prediction model for elderly patients after coronary rotational atherectomy based on machine learning.基于机器学习的老年患者冠状动脉旋磨术后心力衰竭风险预测模型的建立与验证

PeerJ. 2024 Jan 31;12:e16867. doi: 10.7717/peerj.16867. eCollection 2024.

Predicting Gestational Diabetes Mellitus in the first trimester using machine learning algorithms: a cross-sectional study at a hospital fertility health center in Iran.使用机器学习算法预测孕早期的妊娠期糖尿病：伊朗一家医院生育健康中心的横断面研究

BMC Med Inform Decis Mak. 2025 Jan 3;25(1):3. doi: 10.1186/s12911-024-02799-3.

Establishment of gestational diabetes risk prediction model and clinical verification.妊娠期糖尿病风险预测模型的建立及临床验证。

J Endocrinol Invest. 2024 May;47(5):1281-1287. doi: 10.1007/s40618-023-02249-3. Epub 2023 Dec 12.

Predicting low birth weight risks in pregnant women in Brazil using machine learning algorithms: data from the Araraquara cohort study.使用机器学习算法预测巴西孕妇的低出生体重风险：来自阿拉拉夸拉队列研究的数据。

BMC Pregnancy Childbirth. 2025 Mar 19;25(1):320. doi: 10.1186/s12884-025-07351-3.

Prediction of gestational diabetes mellitus at the first trimester: machine-learning algorithms.预测妊娠期糖尿病：基于机器学习算法的研究。

Arch Gynecol Obstet. 2024 Jun;309(6):2557-2566. doi: 10.1007/s00404-023-07131-4. Epub 2023 Jul 21.

本文引用的文献

Risk prediction models of gestational diabetes mellitus before 16 gestational weeks.16 孕周前妊娠糖尿病风险预测模型。

BMC Pregnancy Childbirth. 2022 Dec 1;22(1):889. doi: 10.1186/s12884-022-05219-4.

Development and Validation of Risk Prediction Models for Gestational Diabetes Mellitus Using Four Different Methods.使用四种不同方法的妊娠期糖尿病风险预测模型的开发与验证

Metabolites. 2022 Oct 29;12(11):1040. doi: 10.3390/metabo12111040.

Study on the correlation between homocysteine-related dietary patterns and gestational diabetes mellitus:a reduced-rank regression analysis study.同型半胱氨酸相关饮食模式与妊娠期糖尿病的相关性研究：降秩回归分析研究。

BMC Pregnancy Childbirth. 2022 Apr 10;22(1):306. doi: 10.1186/s12884-022-04656-5.

The Relative Validity and Reproducibility of Food Frequency Questionnaires in the China Kadoorie Biobank Study.食物频率问卷在“中国慢性病前瞻性研究”中的相对有效性和可重复性。

Nutrients. 2022 Feb 14;14(4):794. doi: 10.3390/nu14040794.

A Clinical Update on Gestational Diabetes Mellitus.妊娠期糖尿病的临床新进展。

Endocr Rev. 2022 Sep 26;43(5):763-793. doi: 10.1210/endrev/bnac003.

Nonalcoholic fatty liver disease and early prediction of gestational diabetes mellitus using machine learning methods.非酒精性脂肪肝疾病和使用机器学习方法对妊娠期糖尿病的早期预测。

Clin Mol Hepatol. 2022 Jan;28(1):105-116. doi: 10.3350/cmh.2021.0174. Epub 2021 Oct 15.

A risk prediction model of gestational diabetes mellitus before 16 gestational weeks in Chinese pregnant women.中国孕妇 16 孕周前妊娠糖尿病风险预测模型。

Diabetes Res Clin Pract. 2021 Sep;179:109001. doi: 10.1016/j.diabres.2021.109001. Epub 2021 Aug 12.

Association of thyroid disorders with gestational diabetes mellitus: a meta-analysis.甲状腺疾病与妊娠糖尿病的关系：一项荟萃分析。

Endocrine. 2021 Sep;73(3):550-560. doi: 10.1007/s12020-021-02712-2. Epub 2021 May 13.

Early Prediction of Gestational Diabetes Mellitus in the Chinese Population via Advanced Machine Learning.基于先进机器学习的中国人群妊娠期糖尿病早期预测。

J Clin Endocrinol Metab. 2021 Mar 8;106(3):e1191-e1205. doi: 10.1210/clinem/dgaa899.

Incidence and Risk Factors of Gestational Diabetes Mellitus: A Prospective Cohort Study in Qingdao, China.妊娠期糖尿病的发病率及危险因素：中国青岛的一项前瞻性队列研究

Front Endocrinol (Lausanne). 2020 Sep 11;11:636. doi: 10.3389/fendo.2020.00636. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

纳入饮食摄入量的机器学习算法在妊娠期糖尿病预测中的应用。

Application of machine learning algorithm incorporating dietary intake in prediction of gestational diabetes mellitus.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSION

引言

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献