Shu Yufeng, Chen Zhe, Chi Jingshu, Cheng Sha, Li Huan, Liu Peng, Luo Ju
Department of Gastroenterology, Third Xiangya Hospital, Central South University., Changsha, Hunan, People's Republic of China.
Department of Gerontology, The Affiliated Changsha Central Hospital, Hengyang Medical School, University of South China., Changsha, Hunan, People's Republic of China.
J Multidiscip Healthc. 2024 Aug 8;17:3835-3847. doi: 10.2147/JMDH.S470429. eCollection 2024.
Whether machine learning (ML) can assist in the diagnosis of Crohn's disease (CD) and intestinal tuberculosis (ITB) remains to be explored.
We collected clinical data from 241 patients, and 51 parameters were included. Six ML methods were tested, including logistic regression, decision tree, k-nearest neighbor, multinomial NB, multilayer perceptron, and XGBoost. SHAP and LIME were subsequently introduced as interpretability methods. The ML model was tested in a real-world clinical practice and compared with a multidisciplinary team (MDT) meeting.
XGBoost displays the best performance among the six ML models. The diagnostic AUROC and the accuracy of XGBoost were 0.946 and 0.884, respectively. The top three clinical features affecting our ML model's result prediction were T-spot, pulmonary tuberculosis, and onset age. The ML model's accuracy, sensitivity, and specificity in clinical practice were 0.860, 0.833, and 0.871, respectively. The agreement rate and kappa coefficient of the ML and MDT methods were 90.7% and 0.780, respectively (P<0.001).
We developed an ML model based on XGBoost. The ML model could provide effective and efficient differential diagnoses of ITB and CD with diagnostic bases. The ML model performs well in real-world clinical practice, and the agreement between the ML model and MDT is strong.
机器学习(ML)能否辅助诊断克罗恩病(CD)和肠结核(ITB)仍有待探索。
我们收集了241例患者的临床数据,纳入51项参数。测试了六种机器学习方法,包括逻辑回归、决策树、k近邻、多项式朴素贝叶斯、多层感知器和极端梯度提升(XGBoost)。随后引入SHAP和LIME作为可解释性方法。该机器学习模型在实际临床实践中进行测试,并与多学科团队(MDT)会诊进行比较。
XGBoost在六种机器学习模型中表现最佳。XGBoost的诊断曲线下面积(AUROC)和准确率分别为0.946和0.884。影响我们机器学习模型结果预测的前三个临床特征是结核感染T细胞检测(T-spot)、肺结核和发病年龄。该机器学习模型在临床实践中的准确率、敏感性和特异性分别为0.860、0.833和0.871。机器学习方法与MDT方法的一致率和kappa系数分别为90.7%和0.780(P<0.001)。
我们开发了一种基于XGBoost的机器学习模型。该机器学习模型可以为肠结核和克罗恩病提供有效且高效的鉴别诊断及诊断依据。该机器学习模型在实际临床实践中表现良好,与MDT之间的一致性很强。