Hamano Jun, Takeuchi Ayano, Keyaki Tomoya, Nose Hidemasa, Hayashi Kenichi
Palliative and Supportive Care, University of Tsukuba, Tsukuba, JPN.
Science and Engineering, Chuo University, Tokyo, JPN.
Cureus. 2024 Dec 22;16(12):e76227. doi: 10.7759/cureus.76227. eCollection 2024 Dec.
Accurate prognosis prediction for cancer patients in palliative care is critical for clinical decision-making and personalized care. Traditional statistical models have been complemented by machine learning approaches; however, their comparative effectiveness remains underexplored.
To assess the prognostic accuracy of statistical and machine learning models in predicting 30-day survival in patients with advanced cancer using objective data, such as the result of the blood test.
A secondary analysis of the Japan-Prognostic Assessment Tools Validation (J-ProVal) study was performed from September 2012 to April 2014. We used data from 58 palliative care services in Japan and enrolled 915 patients. Four models, fractional polynomial (FP) regression, Kernel Fisher discriminant analysis (KFDA), Kernel support vector machine (KSVM), and XGBoost, were compared using 17 objective clinical characteristics. Models were evaluated with the area under the receiver operating characteristic curve (AUC) as the primary metric.
The KSVM model demonstrated the highest predictive accuracy (AUC: 0.834), outperforming the FP model (AUC: 0.799). XGBoost showed comparatively lower performance; however, it was likely limited by the size of the dataset.
Machine learning, particularly KSVM, has high predictive accuracy in palliative care when sufficient data are available. However, our findings suggest that traditional statistical models offer advantages in stability and interpretability, underscoring the importance of tailored model selection based on data characteristics.
准确预测癌症姑息治疗患者的预后对于临床决策和个性化护理至关重要。传统统计模型已得到机器学习方法的补充;然而,它们的相对有效性仍未得到充分探索。
使用客观数据(如血液检测结果)评估统计模型和机器学习模型预测晚期癌症患者30天生存率的预后准确性。
对2012年9月至2014年4月进行的日本预后评估工具验证(J-ProVal)研究进行二次分析。我们使用了日本58家姑息治疗服务机构的数据,纳入了915名患者。使用17项客观临床特征比较了分数多项式(FP)回归、核Fisher判别分析(KFDA)、核支持向量机(KSVM)和XGBoost这四种模型。以受试者操作特征曲线下面积(AUC)作为主要指标对模型进行评估。
KSVM模型显示出最高的预测准确性(AUC:0.834),优于FP模型(AUC:0.799)。XGBoost的表现相对较低;然而,它可能受到数据集大小的限制。
当有足够数据时,机器学习,尤其是KSVM,在姑息治疗中具有较高的预测准确性。然而,我们的研究结果表明,传统统计模型在稳定性和可解释性方面具有优势,强调了根据数据特征选择合适模型的重要性。