Molecular Radiotherapy and Innovative Therapeutics, INSERM UMR1030, Gustave Roussy Cancer Campus, Paris-Saclay University, Villejuif, France.
Department of Radiotherapy, Gustave Roussy, Paris-Saclay University, 94805, Villejuif, France.
Sci Rep. 2020 Jul 23;10(1):12340. doi: 10.1038/s41598-020-69298-z.
Radiomics relies on the extraction of a wide variety of quantitative image-based features to provide decision support. Magnetic resonance imaging (MRI) contributes to the personalization of patient care but suffers from being highly dependent on acquisition and reconstruction parameters. Today, there are no guidelines regarding the optimal pre-processing of MR images in the context of radiomics, which is crucial for the generalization of published image-based signatures. This study aims to assess the impact of three different intensity normalization methods (Nyul, WhiteStripe, Z-Score) typically used in MRI together with two methods for intensity discretization (fixed bin size and fixed bin number). The impact of these methods was evaluated on first- and second-order radiomics features extracted from brain MRI, establishing a unified methodology for future radiomics studies. Two independent MRI datasets were used. The first one (DATASET1) included 20 institutional patients with WHO grade II and III gliomas who underwent post-contrast 3D axial T1-weighted (T1w-gd) and axial T2-weighted fluid attenuation inversion recovery (T2w-flair) sequences on two different MR devices (1.5 T and 3.0 T) with a 1-month delay. Jensen-Shannon divergence was used to compare pairs of intensity histograms before and after normalization. The stability of first-order and second-order features across the two acquisitions was analysed using the concordance correlation coefficient and the intra-class correlation coefficient. The second dataset (DATASET2) was extracted from the public TCIA database and included 108 patients with WHO grade II and III gliomas and 135 patients with WHO grade IV glioblastomas. The impact of normalization and discretization methods was evaluated based on a tumour grade classification task (balanced accuracy measurement) using five well-established machine learning algorithms. Intensity normalization highly improved the robustness of first-order features and the performances of subsequent classification models. For the T1w-gd sequence, the mean balanced accuracy for tumour grade classification was increased from 0.67 (95% CI 0.61-0.73) to 0.82 (95% CI 0.79-0.84, P = .006), 0.79 (95% CI 0.76-0.82, P = .021) and 0.82 (95% CI 0.80-0.85, P = .005), respectively, using the Nyul, WhiteStripe and Z-Score normalization methods compared to no normalization. The relative discretization makes unnecessary the use of intensity normalization for the second-order radiomics features. Even if the bin number for the discretization had a small impact on classification performances, a good compromise was obtained using the 32 bins considering both T1w-gd and T2w-flair sequences. No significant improvements in classification performances were observed using feature selection. A standardized pre-processing pipeline is proposed for the use of radiomics in MRI of brain tumours. For models based on first- and second-order features, we recommend normalizing images with the Z-Score method and adopting an absolute discretization approach. For second-order feature-based signatures, relative discretization can be used without prior normalization. In both cases, 32 bins for discretization are recommended. This study may pave the way for the multicentric development and validation of MR-based radiomics biomarkers.
放射组学依赖于提取各种定量的基于图像的特征,以提供决策支持。磁共振成像(MRI)有助于实现患者护理的个性化,但严重依赖于采集和重建参数。目前,在放射组学的背景下,还没有关于 MRI 图像最佳预处理的指南,这对于已发表的基于图像特征的推广至关重要。本研究旨在评估三种常用于 MRI 的不同强度归一化方法(Nyul、WhiteStripe、Z-Score)以及两种强度离散化方法(固定-bin 大小和固定-bin 数量)的影响。通过从脑 MRI 中提取一阶和二阶放射组学特征来评估这些方法的影响,为未来的放射组学研究建立统一的方法。使用了两个独立的 MRI 数据集。第一个数据集(DATASET1)包括 20 名在两个不同的 MRI 设备(1.5T 和 3.0T)上接受了对比后 3D 轴向 T1 加权(T1w-gd)和轴向 T2 加权液体衰减反转恢复(T2w-flair)序列的患有 WHO 分级 II 和 III 级胶质瘤的机构患者,延迟 1 个月。使用 Jensen-Shannon 散度比较归一化前后的强度直方图对。使用一致性相关系数和组内相关系数分析了两个采集之间的一阶和二阶特征的稳定性。第二个数据集(DATASET2)从公共 TCIA 数据库中提取,包括 108 名患有 WHO 分级 II 和 III 级胶质瘤和 135 名患有 WHO 分级 IV 级胶质母细胞瘤的患者。使用五种成熟的机器学习算法,基于肿瘤分级分类任务(平衡准确性测量)评估归一化和离散化方法的影响。强度归一化极大地提高了一阶特征的稳健性和后续分类模型的性能。对于 T1w-gd 序列,使用 Nyul、WhiteStripe 和 Z-Score 归一化方法分别将肿瘤分级分类的平均平衡准确性从 0.67(95%CI 0.61-0.73)提高到 0.82(95%CI 0.79-0.84,P = .006)、0.79(95%CI 0.76-0.82,P = .021)和 0.82(95%CI 0.80-0.85,P = .005)。对于二阶放射组学特征,相对离散化使得二阶放射组学特征的强度归一化变得不必要。即使离散化的 bin 数量对分类性能有很小的影响,但考虑到 T1w-gd 和 T2w-flair 序列,使用 32 个 bin 可以获得很好的折衷。使用特征选择未观察到分类性能的显著提高。为脑肿瘤的 MRI 提出了一种标准化的预处理管道。对于基于一阶和二阶特征的模型,我们建议使用 Z-Score 方法对图像进行归一化,并采用绝对离散化方法。对于基于二阶特征的特征,无需先进行归一化即可使用相对离散化。在这两种情况下,建议使用 32 个 bin 进行离散化。本研究可能为基于 MRI 的放射组学生物标志物的多中心开发和验证铺平道路。