Yagin Fatma Hilal, Colak Cemil, Algarni Abdulmohsen, Algarni Ali, Al-Hashem Fahaid, Ardigò Luca Paolo
Department of Biostatistics, Faculty of Medicine, Malatya Turgut Ozal University, 44210 Malatya, Turkey.
Department of Biostatistics, and Medical Informatics, Faculty of Medicine, Inonu University, 44280 Malatya, Turkey.
Medicina (Kaunas). 2025 Apr 30;61(5):833. doi: 10.3390/medicina61050833.
: Rheumatoid arthritis (RA) is a chronic autoimmune disease characterised by joint inflammation and pain. Metabolomics approaches, which are high-throughput profiling of small molecule metabolites in plasma or serum in RA patients, have so far provided biomarker discovery in the literature for clinical subgroups, risk factors, and predictors of treatment response using classical statistical approaches or machine learning models. Despite these recent developments, an explainable artificial intelligence (XAI)-based methodology has not been used to identify RA metabolomic biomarkers and distinguish patients with RA. This study constructed a XAI-based EBM model using global plasma metabolomics profiling to identify metabolites predictive of RA patients and to develop a classification model that can distinguish RA patients from healthy controls. : Global plasma metabolomics data were analysed from RA patients (49 samples) and healthy individuals (10 samples). SMOTE technique was used for class imbalance in data preprocessing. EBM, LightGBM, and AdaBoost algorithms were applied to generate a discriminatory model between RA and controls. Comprehensive performance metrics were calculated, and the interpretability of the optimal model was assessed using global and local feature descriptions. : A total of 59 samples were analysed, 49 from RA patients, and 10 from healthy subjects. The EBM generated better results than LightGBM and AdaBoost by attaining an AUC of 0.901 (95% CI: 0.847-0.955) with 87.8% sensitivity which helps prevent false negative early RA diagnosis. The primary biomarkers EBM-based XAI identified were -acetyleucine, pyruvic acid, and glycerol-3-phosphate. EBM global explanation analysis indicated that elevated pyruvic acid levels were significantly correlated with RA, whereas -acetyleucine exhibited a nonlinear relationship, implying possible protective effects at specific concentrations. : This study underscores the promise of XAI and evidence-based medicine methodology in developing biomarkers for RA through metabolomics. The discovered metabolites offer significant insights into RA pathophysiology and may function as diagnostic biomarkers or therapeutic targets. Incorporating EBM methodologies integrated with XAI improves model transparency and increases the therapeutic applicability of predictive models for RA diagnosis/management. Furthermore, the transparent structure of the EBM model empowers clinicians to understand and verify the reasoning behind each prediction, thereby fostering trust in AI-assisted decision-making and facilitating the integration of metabolomic insights into routine clinical practice.
类风湿性关节炎(RA)是一种以关节炎症和疼痛为特征的慢性自身免疫性疾病。代谢组学方法是对RA患者血浆或血清中的小分子代谢物进行高通量分析,目前已通过经典统计方法或机器学习模型在文献中为临床亚组、风险因素和治疗反应预测指标发现了生物标志物。尽管有这些最新进展,但基于可解释人工智能(XAI)的方法尚未用于识别RA代谢组学生物标志物和区分RA患者。本研究构建了一个基于XAI的循证医学(EBM)模型,利用全球血浆代谢组学分析来识别RA患者的预测性代谢物,并开发一个能够区分RA患者和健康对照的分类模型。
对RA患者(49个样本)和健康个体(10个样本)的全球血浆代谢组学数据进行了分析。在数据预处理中使用SMOTE技术处理类别不平衡问题。应用EBM、LightGBM和AdaBoost算法生成RA与对照组之间的鉴别模型。计算了综合性能指标,并使用全局和局部特征描述评估了最优模型的可解释性。
共分析了59个样本,其中49个来自RA患者,10个来自健康受试者。EBM取得了比LightGBM和AdaBoost更好的结果,AUC为0.901(95%CI:0.847 - 0.955),灵敏度为87.8%,有助于防止早期RA诊断出现假阴性。基于EBM的XAI识别出的主要生物标志物为乙酰亮氨酸、丙酮酸和3 - 磷酸甘油。EBM全局解释分析表明,丙酮酸水平升高与RA显著相关,而乙酰亮氨酸表现出非线性关系,这意味着在特定浓度下可能具有保护作用。
本研究强调了XAI和循证医学方法在通过代谢组学为RA开发生物标志物方面的前景。发现的代谢物为RA病理生理学提供了重要见解,可能作为诊断生物标志物或治疗靶点。将EBM方法与XAI相结合,提高了模型的透明度,增加了预测模型在RA诊断/管理中的治疗适用性。此外,EBM模型的透明结构使临床医生能够理解和验证每个预测背后的推理,从而增强对人工智能辅助决策的信任,并促进代谢组学见解融入常规临床实践。