Department of Orthopaedic Surgery, National Taiwan University Hospital, Taipei, Taiwan.
Department of Medical Education, National Taiwan University Hospital, Taipei, Taiwan.
Cancer Med. 2023 Jul;12(13):14264-14281. doi: 10.1002/cam4.6097. Epub 2023 Jun 12.
Survival is an important factor to consider when clinicians make treatment decisions for patients with skeletal metastasis. Several preoperative scoring systems (PSSs) have been developed to aid in survival prediction. Although we previously validated the Skeletal Oncology Research Group Machine-learning Algorithm (SORG-MLA) in Taiwanese patients of Han Chinese descent, the performance of other existing PSSs remains largely unknown outside their respective development cohorts. We aim to determine which PSS performs best in this unique population and provide a direct comparison between these models.
We retrospectively included 356 patients undergoing surgical treatment for extremity metastasis at a tertiary center in Taiwan to validate and compare eight PSSs. Discrimination (c-index), decision curve (DCA), calibration (ratio of observed:expected survivors), and overall performance (Brier score) analyses were conducted to evaluate these models' performance in our cohort.
The discriminatory ability of all PSSs declined in our Taiwanese cohort compared with their Western validations. SORG-MLA is the only PSS that still demonstrated excellent discrimination (c-indexes>0.8) in our patients. SORG-MLA also brought the most net benefit across a wide range of risk probabilities on DCA with its 3-month and 12-month survival predictions.
Clinicians should consider potential ethnogeographic variations of a PSS's performance when applying it onto their specific patient populations. Further international validation studies are needed to ensure that existing PSSs are generalizable and can be integrated into the shared treatment decision-making process. As cancer treatment keeps advancing, researchers developing a new prediction model or refining an existing one could potentially improve their algorithm's performance by using data gathered from more recent patients that are reflective of the current state of cancer care.
当临床医生为患有骨骼转移的患者做出治疗决策时,生存是一个重要的考虑因素。已经开发了几种术前评分系统(PSS)来帮助预测生存。尽管我们之前在台湾汉族患者中验证了骨骼肿瘤研究组机器学习算法(SORG-MLA),但其他现有 PSS 的性能在其各自的开发队列之外仍然知之甚少。我们旨在确定哪种 PSS 在这个独特的人群中表现最好,并在这些模型之间进行直接比较。
我们回顾性地纳入了 356 名在台湾一家三级中心接受手术治疗肢体转移的患者,以验证和比较 8 种 PSS。进行了区分度(c 指数)、决策曲线(DCA)、校准(观察到的幸存者与预期幸存者的比例)和整体性能(Brier 评分)分析,以评估这些模型在我们队列中的表现。
与西方验证相比,所有 PSS 的区分能力在我们的台湾队列中均下降。SORG-MLA 是唯一在我们的患者中仍表现出出色区分能力(c 指数>0.8)的 PSS。SORG-MLA 还通过其 3 个月和 12 个月的生存预测,在 DCA 上为广泛的风险概率带来了最大的净收益。
临床医生在将 PSS 应用于特定患者人群时,应考虑其性能的潜在种族地理差异。需要进一步的国际验证研究,以确保现有的 PSS 具有普遍性,可以纳入共同的治疗决策制定过程。随着癌症治疗的不断进步,研究人员开发新的预测模型或改进现有的模型,通过使用反映当前癌症治疗现状的最近患者的数据,可能会提高其算法的性能。