Wu Huiqin, Liu Xiaohui, Peng Lihong, Yang Yuling, Zhou Zidong, Du Dongyang, Xu Hui, Lv Wenbing, Lu Lijun
Department of Medical Imaging, Guangdong Second Provincial General Hospital, Guangzhou, Guangdong, 518037, People's Republic of China.
School of Biomedical Engineering and Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, Guangdong, 510515, People's Republic of China.
Phys Med Biol. 2023 Nov 16;68(22). doi: 10.1088/1361-6560/ad03d1.
. To determine the optimal approach for identifying and mitigating batch effects in PET/CT radiomics features, and further improve the prognosis of patients with head and neck cancer (HNC), this study investigated the performance of three batch harmonization methods.. Unsupervised harmonization identified the batch labels by K-means clustering. Supervised harmonization regarding the image acquisition factors (center, manufacturer, scanner, filter kernel) as known/given batch labels, and Combat harmonization was then implemented separately and sequentially based on the batch labels, i.e. harmonizing features among batches determined by each factor individually or harmonizing features among batches determined by multiple factors successively. Extensive experiments were conducted to predict overall survival (OS) on public PET/CT datasets that contain 800 patients from 9 centers.. In the external validation cohort, results show that compared to original models without harmonization, Combat harmonization would be beneficial in OS prediction with C-index of 0.687-0.740 versus 0.684-0.767. Supervised harmonization slightly outperformed unsupervised harmonization in all models (C-index: 0.692-0.767 versus 0.684-0.750). Separate harmonization outperformed sequential harmonization in CT_m+clinic and CT_cm+clinic models with C-index of 0.752 and 0.722, respectively, while sequential harmonization involved clinical features in PET_rs+clinic model further improving the performance and achieving the highest C-index of 0.767.. Optimal batch determination especially sequential harmonization for Combat holds the potential to improve the prognostic power of radiomics model in multi-center HNC dataset with PET/CT imaging.
为了确定识别和减轻PET/CT放射组学特征中批次效应的最佳方法,并进一步改善头颈癌(HNC)患者的预后,本研究调查了三种批次归一化方法的性能。无监督归一化通过K均值聚类识别批次标签。将图像采集因素(中心、制造商、扫描仪、滤波核)作为已知/给定的批次标签进行有监督归一化,然后基于批次标签分别依次实施Combat归一化,即分别对由每个因素确定的批次之间的特征进行归一化,或依次对由多个因素确定的批次之间的特征进行归一化。在包含来自9个中心的800名患者的公共PET/CT数据集上进行了广泛的实验以预测总生存期(OS)。在外部验证队列中,结果表明,与未进行归一化的原始模型相比,Combat归一化在OS预测中具有优势,C指数为0.687 - 0.740,而原始模型为0.684 - 0.767。在所有模型中,有监督归一化略优于无监督归一化(C指数:0.692 - 0.767对0.684 - 0.750)。在CT_m + clinic和CT_cm + clinic模型中,单独归一化优于顺序归一化,C指数分别为0.752和0.722,而在PET_rs + clinic模型中顺序归一化纳入临床特征进一步提高了性能,达到了最高C指数0.767。最佳批次确定,尤其是Combat的顺序归一化,有可能提高多中心HNC数据集PET/CT成像放射组学模型的预后预测能力。