Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, No. 155, Sec. 2, Li-Nong St., Beitou District, Taipei, 112304, Taiwan.
Department of Obstetrics and Gynecology, MacKay Memorial Hospital, Taipei, Taiwan.
Support Care Cancer. 2024 Jul 24;32(8):544. doi: 10.1007/s00520-024-08757-z.
Muscle radiodensity loss after surgery and adjuvant chemotherapy is associated with poor outcomes in ovarian cancer. Assessing muscle radiodensity is a real-world clinical challenge owing to the requirement for computed tomography (CT) with consistent protocols and labor-intensive processes. This study aimed to use interpretable machine learning (ML) to predict muscle radiodensity loss.
This study included 723 patients with ovarian cancer who underwent primary debulking surgery and platinum-based chemotherapy between 2010 and 2019 at two tertiary centers (579 in cohort 1 and 144 in cohort 2). Muscle radiodensity was assessed from pre- and post-treatment CT acquired with consistent protocols, and a decrease in radiodensity ≥ 5% was defined as loss. Six ML models were trained, and their performances were evaluated using the area under the curve (AUC) and F1-score. The SHapley Additive exPlanations (SHAP) method was applied to interpret the ML models.
The CatBoost model achieved the highest AUC of 0.871 (95% confidence interval, 0.870-0.874) and F1-score of 0.688 (95% confidence interval, 0.685-0.691) among the models in the training set and outperformed in the external validation set, with an AUC of 0.839 and F1-score of 0.673. Albumin change, ascites, and residual disease were the most important features associated with a higher likelihood of muscle radiodensity loss. The SHAP force plot provided an individualized interpretation of model predictions.
An interpretable ML model can assist clinicians in identifying ovarian cancer patients at risk of muscle radiodensity loss after treatment and understanding the contributors of muscle radiodensity loss.
手术后和辅助化疗后的肌肉放射性密度损失与卵巢癌预后不良相关。由于需要具有一致协议的计算机断层扫描(CT)和劳动密集型过程,因此评估肌肉放射性密度是一个现实世界中的临床挑战。本研究旨在使用可解释的机器学习(ML)来预测肌肉放射性密度损失。
这项研究包括 723 名在 2010 年至 2019 年期间在两个三级中心接受初次减瘤手术和铂类化疗的卵巢癌患者(队列 1 中 579 名,队列 2 中 144 名)。使用具有一致协议的治疗前和治疗后 CT 评估肌肉放射性密度,并且将放射性密度降低≥5%定义为损失。训练了六个 ML 模型,并使用曲线下面积(AUC)和 F1 分数评估它们的性能。应用 SHapley Additive exPlanations(SHAP)方法来解释 ML 模型。
CatBoost 模型在训练集中获得了最高的 AUC 为 0.871(95%置信区间,0.870-0.874)和 F1 分数为 0.688(95%置信区间,0.685-0.691),在外部验证集中表现优于其他模型,AUC 为 0.839,F1 分数为 0.673。白蛋白变化、腹水和残留疾病是与肌肉放射性密度损失可能性增加相关的最重要特征。SHAP 力图提供了对模型预测的个体化解释。
可解释的 ML 模型可以帮助临床医生识别治疗后肌肉放射性密度损失风险较高的卵巢癌患者,并了解肌肉放射性密度损失的贡献因素。