Reiazi Reza, Prajapati Surendra, Fru Leonardo Che, Lee Dongyeon, Salehpour Mohammad
Department of Radiation Physics, Division of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.
Diagnostics (Basel). 2025 Mar 20;15(6):786. doi: 10.3390/diagnostics15060786.
Generalizability and domain dependency are critical challenges in developing predictive models for healthcare, particularly in medical diagnostics and radiation oncology. Predictive models designed to assess tumor recurrence rely on comprehensive and high-quality datasets, encompassing treatment planning parameters, imaging protocols, and patient-specific data. However, domain dependency, arising from variations in dose calculation algorithms, computed tomography (CT) density conversion curves, imaging modalities, and institutional protocols, can significantly undermine model reliability and clinical utility. This study evaluated dose calculation differences in the head and neck cancer treatment plans of 19 patients using two treatment planning systems, Pinnacle 9.10 and RayStation 11, with similar dose calculation algorithms. Variations in the dose grid size and CT density conversion curves were assessed for their impact on domain dependency. Results showed that dose grid size differences had a more significant influence within RayStation than Pinnacle, while CT curve variations introduced potential domain discrepancies. The findings underscore the critical role of precise and standardized treatment planning in enhancing the reliability of predictive modeling for tumor recurrence assessment. Incorporating treatment planning parameters, such as dose distribution and target volumes, as explicit features in model training can mitigate the impact of domain dependency and enhance prediction accuracy. Solutions such as multi-institutional data harmonization and domain adaptation techniques are essential to improve model generalizability and robustness. These strategies support the better integration of predictive modeling into clinical workflows, ultimately optimizing patient outcomes and personalized treatment strategies.
在开发医疗保健预测模型时,尤其是在医学诊断和放射肿瘤学领域,模型的可推广性和领域依赖性是关键挑战。旨在评估肿瘤复发的预测模型依赖于全面且高质量的数据集,包括治疗计划参数、成像协议和患者特定数据。然而,由于剂量计算算法、计算机断层扫描(CT)密度转换曲线、成像方式和机构协议的差异而产生的领域依赖性,可能会严重损害模型的可靠性和临床实用性。本研究使用两个具有相似剂量计算算法的治疗计划系统Pinnacle 9.10和RayStation 11,评估了19例头颈部癌患者治疗计划中的剂量计算差异。评估了剂量网格大小和CT密度转换曲线的变化对领域依赖性的影响。结果表明,剂量网格大小差异在RayStation系统中比在Pinnacle系统中影响更大,而CT曲线变化会引入潜在的领域差异。这些发现强调了精确和标准化治疗计划在提高肿瘤复发评估预测模型可靠性方面的关键作用。将治疗计划参数(如剂量分布和靶区体积)作为显式特征纳入模型训练可以减轻领域依赖性的影响并提高预测准确性。多机构数据协调和领域适应技术等解决方案对于提高模型的可推广性和稳健性至关重要。这些策略有助于将预测模型更好地整合到临床工作流程中,最终优化患者治疗结果和个性化治疗策略。