Suppr超能文献

使用基于基因特征的机器学习模型预测腰痛患者的治疗结果。

Predicting Treatment Outcomes in Patients with Low Back Pain Using Gene Signature-Based Machine Learning Models.

作者信息

Lian Youzhi, Shi Yinyu, Shang Haibin, Zhan Hongsheng

机构信息

Baoshan Hospital Affiliated to Shanghai University of Chinese Medicine, Shanghai, 201999, China.

Baoshan District Integrated Traditional Chinese and Western Medicine Hospital, Shanghai, 201999, China.

出版信息

Pain Ther. 2025 Feb;14(1):359-373. doi: 10.1007/s40122-024-00700-8. Epub 2024 Dec 25.

Abstract

INTRODUCTION

Low back pain (LBP) is a significant global health burden, with variable treatment outcomes and an unclear underlying molecular mechanism. Effective prediction of treatment responses remains a challenge. In this study, we aimed to develop gene signature-based machine learning models using transcriptomic data from peripheral immune cells to predict treatment outcomes in patients with LBP.

METHODS

The transcriptomic data of patients with LBP from peripheral immune cells were retrieved from the GEO database. Patients with LBP were recruited, and treatment outcomes were assessed after 3 months. Patients were classified into two groups: those with resolved pain and those with persistent pain. Differentially expressed genes (DEGs) between the two groups were identified through bioinformatic analysis. Key genes were selected using five machine learning models, including Lasso, Elastic Net, Random Forest, SVM, and GBM. These key genes were then used to train 45 machine learning models by combining nine different algorithms: Logistic Regression, K-Nearest Neighbors, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting Machine, Multilayer Perceptron, Naive Bayes, and Linear Discriminant Analysis. Five-fold cross-validation was employed to ensure robust model evaluation and minimize overfitting. In each fold, the dataset was split into training and validation sets, with model performance assessed using multiple metrics including accuracy, precision, recall, and F1 score. The final model performance was reported as the mean and standard deviation across all five folds, providing a more reliable estimate of the models' ability to predict LBP treatment outcomes using gene expression data from peripheral immune cells.

RESULTS

A total of 61 DEGs were identified between patients with resolved and persistent pain. From these genes, 45 machine learning models were constructed using different combinations of feature selection methods and classification algorithms. The Elastic Net with Logistic Regression achieved the highest accuracy of 88.7% ± 8.0% (mean ± standard deviation), followed closely by Elastic Net with Linear Discriminant Analysis (88.7% ± 7.5%) and Lasso with Multilayer Perceptron (87.7% ± 6.7%). Overall, 15 models demonstrated robust performance with accuracy > 80%, suggesting the reliability of our machine learning approach in predicting LBP treatment outcomes. The SHapley Additive exPlanations (SHAP) method was used to visualize the contribution of core genes to model performance, highlighting their roles in predicting treatment outcomes.

CONCLUSION

The study demonstrates the potential of using transcriptomic data from peripheral immune cells and machine learning models to predict treatment outcomes in patients with LBP. The identification of key genes and the high accuracy of certain models provide a basis for future personalized treatment strategies in LBP management. Visualizing gene importance with SHAP adds interpretability to the predictive models, enhancing their clinical relevance.

摘要

引言

腰痛(LBP)是一项重大的全球健康负担,治疗结果各异,潜在分子机制尚不明确。有效预测治疗反应仍是一项挑战。在本研究中,我们旨在利用外周免疫细胞的转录组数据开发基于基因特征的机器学习模型,以预测腰痛患者的治疗结果。

方法

从GEO数据库中检索腰痛患者外周免疫细胞的转录组数据。招募腰痛患者,并在3个月后评估治疗结果。患者被分为两组:疼痛缓解组和持续疼痛组。通过生物信息学分析确定两组之间的差异表达基因(DEG)。使用包括套索回归(Lasso)、弹性网络(Elastic Net)、随机森林(Random Forest)、支持向量机(SVM)和梯度提升机(GBM)在内的五种机器学习模型选择关键基因。然后,通过结合九种不同算法:逻辑回归、K近邻、支持向量机、决策树、随机森林、梯度提升机、多层感知器、朴素贝叶斯和线性判别分析,使用这些关键基因训练45个机器学习模型。采用五折交叉验证以确保稳健的模型评估并最小化过拟合。在每一折中,将数据集分为训练集和验证集,使用包括准确率、精确率、召回率和F1分数在内的多个指标评估模型性能。最终模型性能报告为所有五折的平均值和标准差,从而更可靠地估计模型利用外周免疫细胞基因表达数据预测腰痛治疗结果的能力。

结果

在疼痛缓解和持续疼痛的患者之间共鉴定出61个DEG。从这些基因中,使用特征选择方法和分类算法的不同组合构建了45个机器学习模型。弹性网络与逻辑回归组合达到了最高准确率88.7%±8.0%(平均值±标准差),紧随其后的是弹性网络与线性判别分析(88.7%±7.5%)以及套索回归与多层感知器(87.7%±6.7%)。总体而言,15个模型表现出稳健性能,准确率>80%,表明我们的机器学习方法在预测腰痛治疗结果方面的可靠性。使用SHapley加法解释(SHAP)方法可视化核心基因对模型性能的贡献,突出了它们在预测治疗结果中的作用。

结论

该研究证明了利用外周免疫细胞的转录组数据和机器学习模型预测腰痛患者治疗结果的潜力。关键基因的鉴定以及某些模型的高准确率为未来腰痛管理中的个性化治疗策略提供了基础。用SHAP可视化基因重要性增加了预测模型的可解释性,提高了它们的临床相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38e8/11751268/562710d99f32/40122_2024_700_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验