Yang Shuheng, Weiskirchen Ralf, Zheng Wenjing, Hu Xiangxu, Zou Aibiao, Liu Zhiguo, Wang Hualin
School of Life Science and Technology, Wuhan Polytechnic University, Wuhan, China.
Institute of Molecular Pathobiochemistry, Experimental Gene Therapy and Clinical Chemistry (IFMPEGKC), RWTH University Hospital, Aachen, Germany.
Front Nutr. 2025 Jan 7;11:1520779. doi: 10.3389/fnut.2024.1520779. eCollection 2024.
The incidence of type 2 diabetes mellitus (T2DM) has increased in recent years. Alongside traditional pharmacological treatments, nutritional therapy has emerged as a crucial aspect of T2DM management. Inulin, a fructan-type soluble fiber that promotes the growth of probiotic species like and , is commonly used in nutritional interventions for T2DM. However, it remains unclear which type of T2DM patients are suitable for inulin intervention. The aim of this study was to predict the effectiveness of inulin treatment for T2DM using a machine learning model.
Original data were obtained from a previous study. After screening T2DM patients, feature election was conducted using LASSO regression, and a machine learning model was developed using XGBoost. The model's performance was evaluated based on accuracy, specificity, positive predictive value, negative predictive value and further analyzed using receiver operating curves, calibration curves, and decision curves.
Out of the 758 T2DM patients included, 477 had their glycated hemoglobin (HbA1c) levels reduced to less than 6.5% after inulin intervention, resulting in an incidence rate of 62.93%. LASSO regression identified six key factors in patients prior to inulin treatment. The SHAP values for interpretation ranked the characteristic variables in descending order of importance: HbA1c, difference between fasting and 2 h-postprandial glucose levels, fasting blood glucose, high-density lipoprotein, age, and body mass index. The XGBoost prediction model demonstrated a training set accuracy of 0.819, specificity of 0.913, positive predictive value of 0.818, and negative predictive value of 0.820. The testing set showed an accuracy of 0.709, specificity of 0.909, positive predictive value of 0.705, and negative predictive value of 0.710.
The XGBoost-SHAP framework for predicting the impact of inulin intervention in T2DM treatment proves to be effective. It allows for the comparison of prediction effect based on different features of an individual, assessment of prediction abilities for different individuals given their features, and establishes a connection between machine learning and nutritional intervention in T2DM treatment. This offers valuable insights for researchers in this field.
近年来,2型糖尿病(T2DM)的发病率有所上升。除了传统的药物治疗外,营养治疗已成为T2DM管理的一个关键方面。菊粉是一种果聚糖型可溶性纤维,可促进如[具体益生菌种类1]和[具体益生菌种类2]等益生菌的生长,常用于T2DM的营养干预。然而,尚不清楚哪种类型的T2DM患者适合菊粉干预。本研究的目的是使用机器学习模型预测菊粉治疗T2DM的有效性。
原始数据来自先前的一项研究。在筛选T2DM患者后,使用LASSO回归进行特征选择,并使用XGBoost开发机器学习模型。基于准确性、特异性、阳性预测值、阴性预测值评估模型性能,并使用受试者工作特征曲线、校准曲线和决策曲线进行进一步分析。
在纳入的758例T2DM患者中,477例在菊粉干预后糖化血红蛋白(HbA1c)水平降至6.5%以下,发生率为62.93%。LASSO回归确定了菊粉治疗前患者的六个关键因素。用于解释的SHAP值按重要性降序排列特征变量:HbA1c、空腹血糖与餐后2小时血糖水平之差、空腹血糖、高密度脂蛋白、年龄和体重指数。XGBoost预测模型在训练集上的准确率为0.819,特异性为0.913,阳性预测值为0.818,阴性预测值为0.820。测试集的准确率为0.709,特异性为0.909,阳性预测值为0.705,阴性预测值为0.710。
用于预测菊粉干预对T2DM治疗影响的XGBoost-SHAP框架被证明是有效的。它允许根据个体的不同特征比较预测效果,根据个体特征评估不同个体的预测能力,并在T2DM治疗中建立机器学习与营养干预之间的联系。这为该领域的研究人员提供了有价值的见解。