Department of Urology, UCSF, San Francisco, California, USA.
J Endourol. 2024 Aug;38(8):809-816. doi: 10.1089/end.2023.0457.
The absence of predictive markers for kidney stone recurrence poses a challenge for the clinical management of stone disease. The unpredictability of stone events is also a significant limitation for clinical trials, where many patients must be enrolled to obtain sufficient stone events for analysis. In this study, we sought to use machine learning methods to identify a novel algorithm to predict stone recurrence. Patients enrolled in the Registry for Stones of the Kidney and Ureter (ReSKU), a registry of nephrolithiasis patients collected between 2015-2020, with at least one prospectively collected 24-hour urine test (Litholink 24-hour urine test; Labcorp) were included in the training set. A validation set was obtained from chart review of stone patients not enrolled in ReSKU with 24-hour urine data. Stone events were defined as either an office visit where a patient reports symptomatic passage of stones or a surgical procedure for stone removal. Seven prediction classification methods were evaluated. Predictive analyses and receiver operator characteristics (ROC) curve generation were performed in R. A training set of 423 kidney stone patients with stone event data and 24-hour urine samples were trained using the prediction classification methods. The highest performing prediction model was a Logistic Regression with ElasticNet machine learning model (area under curve [AUC] = 0.65). Restricting analysis to high confidence predictions significantly improved model accuracy (AUC = 0.82). The prediction model was validated on a validation set of 172 stone patients with stone event data and 24-hour urine samples. Prediction accuracy in the validation set demonstrated moderate discriminative ability (AUC = 0.64). Repeat modeling was performed with four of the highest scoring features, and ROC analyses demonstrated minimal loss in accuracy (AUC = 0.63). Machine-learning models based on 24-hour urine data can predict stone recurrences with a moderate degree of accuracy.
肾结石复发的预测标志物缺失给结石病的临床管理带来了挑战。结石事件的不可预测性也是临床试验的一个重大限制,因为需要招募许多患者才能获得足够的结石事件进行分析。在这项研究中,我们试图使用机器学习方法来确定一种新的算法来预测结石复发。
在 2015-2020 年间收集的肾结石和输尿管结石登记处(ReSKU)登记处中,至少有一项前瞻性收集的 24 小时尿液测试(Litholink 24 小时尿液测试;Labcorp)的患者被纳入训练集。验证集是从未纳入 ReSKU 的结石患者的图表回顾中获得的,这些患者有 24 小时尿液数据。结石事件定义为患者报告有症状的结石排出或进行结石清除的手术。评估了七种预测分类方法。预测分析和接收者操作特征(ROC)曲线生成在 R 中进行。
使用预测分类方法对 423 名肾结石患者的结石事件数据和 24 小时尿液样本进行了训练。表现最好的预测模型是带有弹性网络机器学习模型的逻辑回归(曲线下面积[AUC] = 0.65)。将分析限制在高置信度预测可以显著提高模型准确性(AUC = 0.82)。在有结石事件数据和 24 小时尿液样本的验证集中对预测模型进行了验证。验证集中的预测准确性表现出中等的辨别能力(AUC = 0.64)。对四个得分最高的特征进行了重复建模,ROC 分析表明准确性几乎没有损失(AUC = 0.63)。
基于 24 小时尿液数据的机器学习模型可以在一定程度上准确预测结石复发。