Suppr超能文献

基于多种机器学习模型的经皮肾镜取石术后无结石率的预测价值

Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models.

作者信息

Liu Zhao Rong, Yu Zhan Jiang, Zhou Jie, Huang Jian Biao

机构信息

Jiangxi Cancer Hospital & Institute, Jiangxi Clinical Research Center for Cancer, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, China.

Department of Urology, Yudu County People's Hospital, Yudu, China.

出版信息

Front Med (Lausanne). 2025 Aug 19;12:1559613. doi: 10.3389/fmed.2025.1559613. eCollection 2025.

Abstract

PURPOSE

This study aimed to develop three types of machine learning (ML) models based on gradient boosting decision tree (GBDT), random forest (RF), and extreme gradient boosting (XGBoost) to explore their predictive value for the stone-free rate after percutaneous nephrolithotomy (PCNL).

PATIENTS AND METHODS

A retrospective analysis was conducted on 160 patients who underwent PCNL. The patients were randomly divided into a training set and a test set in a 7:3 ratio. Clinical data were collected, and univariate analysis was performed to identify important data significantly associated with the stone-free rate after PCNL. Three ML models (GBDT, RF, and XGBoost) were developed using the training set. The predictive performance of these models was evaluated using the area under the curve (AUC) of the receiver operating characteristic (ROC) on the test set, confusion matrix, specificity, sensitivity, accuracy, and F1 score. For the top-performing prediction model, the study further employed the SHapley Additive exPlanations (SHAP) method to enhance model interpretability by elucidating the contribution of individual features to the prediction outcomes and ranking the relative importance of the predictive data. Finally, the clinical utility of the model was assessed through decision curve analysis (DCA), which quantified the net clinical benefit of applying the model across various risk thresholds.

RESULTS

Postoperative statistics indicated a stone-free rate of 70.6% ( = 113) among the patients. The data significantly associated with the absence of residual stones included the number of stones, stone diameter, stone CT value, history of previous stone surgery, stone location, and stone shape ( < 0.05). All three models demonstrated strong predictive effects in the validation set, with the GBDT model showing superior performance [AUC: 0.836 (95% CI: 0.785-0.873); accuracy: 0.854; sensitivity: 0.853; specificity: 0.857] compared to the XGBoost [AUC: 0.830 (95% CI: 0.792-0.868); accuracy: 0.771; sensitivity: 0.824; specificity: 0.643] and RF models [AUC: 0.803 (95% CI: 0.763-0.837); accuracy: 0.792; sensitivity: 0.824; specificity: 0.714]. The F1 scores for the GBDT, RF, and XGBoost models were 0.892, 0.836, and 0.849, respectively. The DCA decision curve analysis confirmed that the GBDT model offers a favorable net clinical benefit. In addition, the SHAP analysis identified the number of stones and the stone CT value as the most critical features influencing the model's predictions, contributing significantly to its overall predictive performance.

CONCLUSION

The prediction models developed based on three machine learning algorithms can accurately predict the stone-free rate after PCNL in patients with urinary tract stones. Among these, the GBDT model can effectively identify patients who are most likely to achieve successful outcomes from PCNL based on demographic and stone characteristics, thereby assisting in clinical treatment decision-making.

摘要

目的

本研究旨在开发基于梯度提升决策树(GBDT)、随机森林(RF)和极端梯度提升(XGBoost)的三种机器学习(ML)模型,以探讨它们对经皮肾镜取石术(PCNL)后无石率的预测价值。

患者与方法

对160例行PCNL的患者进行回顾性分析。患者按7:3的比例随机分为训练集和测试集。收集临床数据,并进行单因素分析以确定与PCNL后无石率显著相关的重要数据。使用训练集开发三种ML模型(GBDT、RF和XGBoost)。使用测试集上的受试者操作特征(ROC)曲线下面积(AUC)、混淆矩阵、特异性、敏感性、准确性和F1分数评估这些模型的预测性能。对于表现最佳的预测模型,该研究进一步采用SHapley加性解释(SHAP)方法,通过阐明个体特征对预测结果的贡献并对预测数据的相对重要性进行排序,来提高模型的可解释性。最后,通过决策曲线分析(DCA)评估模型的临床实用性,该分析量化了在各种风险阈值下应用该模型的净临床益处。

结果

术后统计显示患者的无石率为70.6%(n = 113)。与无残留结石显著相关的数据包括结石数量、结石直径、结石CT值、既往结石手术史、结石位置和结石形状(P < 0.05)。所有三种模型在验证集中均显示出强大的预测效果,与XGBoost模型[AUC:0.830(95%CI:0.792 - 0.868);准确性:0.771;敏感性:0.824;特异性:0.643]和RF模型[AUC:0.803(95%CI:0.763 - 0.837);准确性:0.792;敏感性:0.824;特异性:0.714]相比,GBDT模型表现更优[AUC:0.836(95%CI:0.785 - 0.873);准确性:0.854;敏感性:0.853;特异性:0.857]。GBDT、RF和XGBoost模型的F1分数分别为0.892、0.836和0.849。DCA决策曲线分析证实GBDT模型提供了良好的净临床益处。此外,SHAP分析确定结石数量和结石CT值是影响模型预测的最关键特征,对其整体预测性能有显著贡献。

结论

基于三种机器学习算法开发的预测模型能够准确预测尿路结石患者PCNL后的无石率。其中,GBDT模型可以根据人口统计学和结石特征有效识别最有可能从PCNL中获得成功结果的患者,从而辅助临床治疗决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/395d/12401985/b5f8e19c24ee/fmed-12-1559613-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验