Dwivedi Pooja, Barage Sagar, Singh Rajshri, Jha Ashish, Choudhury Sayak, Agrawal Archi, Rangarajan Venkatesh
Department of Nuclear Medicine and Molecular Imaging, Advanced Centre for Treatment Research and Education in Cancer, Tata Memorial Centre, Kharghar, Navi Mumbai, 410210, India.
Amity Institute of Biotechnology, Amity University Maharashtra, Mumbai-Pune Expressway, Bhatan, Panvel, 410206, India.
Phys Eng Sci Med. 2025 Aug 18. doi: 10.1007/s13246-025-01625-y.
Radiomic biomarkers have demonstrated significant potential in non-invasively assessing tumor biology and providing essential insights for precision medicine. However, the clinical translation is often hindered by challenges in multicenter studies, primarily due to a lack of standardization, such as variations in scanner models, acquisition protocols, reconstruction techniques, etc. This study aims to assess the impact of various harmonization methods in multicenter, 18 F-FDG PET-based radiomics for the classification of lung cancer histological subtypes using a machine learning model. Retrospective data included 178 lung cancer cohorts, comprising 117 adenocarcinomas and 61 squamous cell carcinomas from three different centers. PET DICOM image data was preprocessed with 3D ROI segmentation of the lung tumor and healthy liver, followed by the extraction of 111 radiomic features. Subsequently, Z-Score, Quantile, and ComBat were applied to generate three different harmonized datasets. Feature distribution was analyzed, and the top ten features were selected using recursive feature elimination. An eXtreme gradient boosting model was built on each dataset, and performance was assessed using accuracy, precision, sensitivity, specificity, and AUC with a 95% confidence interval. Variations in radiomic feature distribution and feature selection were observed after applying different harmonization methods. During validation of the trained model, AUC improved from 0.556 [95% CI 0.551-0.563] in the unharmonized data to 0.719 [95% CI 0.710-0.720], 0.952 [95% CI 0.951-0.954], and 0.996 [95% CI 0.995-0.996] in Z-Score, Quantile, and ComBat harmonized data, respectively, for classifying adenocarcinoma and squamous cell carcinoma subtypes. The study indicates that feature selection was affected by the different harmonization methods. The ComBat method was shown to significantly enhance the performance of AI-assisted PET radiomics.
放射组学生物标志物在非侵入性评估肿瘤生物学以及为精准医学提供重要见解方面已显示出巨大潜力。然而,临床转化往往受到多中心研究中各种挑战的阻碍,主要原因是缺乏标准化,例如扫描仪型号、采集协议、重建技术等方面的差异。本研究旨在评估多种归一化方法在基于18F-FDG PET的多中心放射组学中对使用机器学习模型分类肺癌组织学亚型的影响。回顾性数据包括178例肺癌队列,其中来自三个不同中心的117例腺癌和61例鳞状细胞癌。PET DICOM图像数据经过肺肿瘤和健康肝脏的3D ROI分割预处理,随后提取111个放射组学特征。随后,应用Z-Score、分位数和ComBat生成三个不同的归一化数据集。分析特征分布,并使用递归特征消除选择前十大特征。在每个数据集上构建一个极端梯度提升模型,并使用准确度、精确率、灵敏度、特异性和AUC以及95%置信区间评估性能。应用不同的归一化方法后,观察到放射组学特征分布和特征选择存在差异。在训练模型的验证过程中,AUC从不归一化数据中的0.556 [95% CI 0.551-0.563]分别提高到Z-Score、分位数和ComBat归一化数据中的0.719 [95% CI 0.710-0.720]、0.952 [95% CI 0.951-0.954]和0.996 [95% CI 0.995-0.996],用于分类腺癌和鳞状细胞癌亚型。该研究表明,特征选择受不同归一化方法的影响。ComBat方法显示出显著提高人工智能辅助PET放射组学的性能。