Chen Luyao, Fu Zhehong, Dong Qianxi, Zheng Fuchun, Wang Zhipeng, Li Sheng, Zhan Xiangpeng, Dong Wentao, Song Yanping, Xu Songhui, Fu Bin, Xiong Situ
Jiangxi Provincial Key Laboratory of Urinary System Diseases, Department of Urology, The First Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, Jiangxi, China.
Department of Computer Science, Columbia University, New York, NY.
Urology. 2024 Dec;194:180-188. doi: 10.1016/j.urology.2024.08.011. Epub 2024 Aug 15.
To construct and externally validate machine learning-based nomograms for predicting progression stages of initial prostate cancer (PCa) using biomarkers and clinicopathologic features.
Three hundred sixty-two inpatients diagnosed with PCa at the First Affiliated Hospital were randomly assigned to training and testing sets in a 3:7 ratio, while 136 PCa patients from People's Hospital formed the external validation set. Imaging and clinicopathologic information were collected. Optimal features distinguishing advanced prostate cancer (APC) and metastatic PCa (mPCa) were identified through logistic regression (LR). ML algorithms were employed to build and compare ML models. The best-performing algorithm established models for PCa progression stage. Models performance was evaluated using metrics, ROC curves, calibration, and decision curve analysis (DCA) in training, testing, and external validation sets.
Following LR analyses, PSA (P = .001), maximum tumor diameter (P = .026), Gleason score (P <.001), and RNF41 (P <.001) were optimal features for predicting APC, while ALP (P <.001), PSA (P <.001), and GS score (P = .024) were for mPCa. Among ML models, the LR models exhibited superior performance. Consequently, the LR algorithm was used for the APC-risk-nomogram and mPCa-risk-nomogram construction, with AUC values of 0.848, 0.814, 0.810, and 0.940, 0.913, 0.910, in the training, testing, and external validation sets, respectively. Calibration and DCA curves affirmed nomograms' consistency and net benefits for clinical decision-making.
In summary, ML-based APC-risk-nomogram and mPCa-risk-nomogram exhibit outstanding predictive performance for PCa progression stages. These nomograms can assist clinicians in finely categorizing newly diagnosed PCa patients, facilitating personalized treatment plans and prognosis assessment.
构建并外部验证基于机器学习的列线图,以利用生物标志物和临床病理特征预测初发前列腺癌(PCa)的进展阶段。
将第一附属医院362例诊断为PCa的住院患者按3:7的比例随机分为训练集和测试集,而来自人民医院的136例PCa患者组成外部验证集。收集影像和临床病理信息。通过逻辑回归(LR)确定区分晚期前列腺癌(APC)和转移性PCa(mPCa)的最佳特征。采用机器学习算法构建并比较机器学习模型。表现最佳的算法为PCa进展阶段建立模型。在训练集、测试集和外部验证集中,使用指标、ROC曲线、校准和决策曲线分析(DCA)评估模型性能。
经过LR分析,PSA(P = .001)、最大肿瘤直径(P = .026)、 Gleason评分(P < .001)和RNF41(P < .001)是预测APC的最佳特征,而碱性磷酸酶(ALP,P < .001)、PSA(P < .001)和GS评分(P = .024)是预测mPCa的最佳特征。在机器学习模型中,LR模型表现出卓越性能。因此,使用LR算法构建APC风险列线图和mPCa风险列线图,在训练集、测试集和外部验证集中的AUC值分别为0.848、0.814、0.810和0.940、0.913、0.910。校准曲线和DCA曲线证实了列线图在临床决策中的一致性和净效益。
总之,基于机器学习的APC风险列线图和mPCa风险列线图对PCa进展阶段具有出色的预测性能。这些列线图可帮助临床医生对新诊断的PCa患者进行精细分类,促进个性化治疗方案制定和预后评估。