Institute of information Systems, University of Applied Sciences Western Switzerland (HES-SO), TechnoArk 3, 3960, Sierre, Switzerland.
CLCC Eugene Marquis, INSERM, LTSI - UMR 1099, Univ Rennes, 35000, Rennes, France.
Sci Rep. 2020 Nov 12;10(1):19679. doi: 10.1038/s41598-020-76310-z.
In standard radiomics studies the features extracted from clinical images are mostly quantified with simple statistics such as the average or variance per Region of Interest (ROI). Such approaches may smooth out any intra-region heterogeneity and thus hide some tumor aggressiveness that may hamper predictions. In this paper we study the importance of feature aggregation within the standard radiomics workflow, which allows to take into account intra-region variations. Feature aggregation methods transform a collection of voxel values from feature response maps (over a ROI) into one or several scalar values that are usable for statistical or machine learning algorithms. This important step has been little investigated within the radiomics workflows, so far. In this paper, we compare several aggregation methods with standard radiomics approaches in order to assess the improvements in prediction capabilities. We evaluate the performance using an aggregation function based on Bags of Visual Words (BoVW), which allows for the preservation of piece-wise homogeneous information within heterogeneous regions and compared with standard methods. The different models are compared on a cohort of 214 head and neck cancer patients coming from 4 medical centers. Radiomics features were extracted from manually delineated tumors in clinical PET-FDG and CT images were analyzed. We compared the performance of standard radiomics models, the volume of the ROI alone and the BoVW model for survival analysis. The average concordance index was estimated with a five fold cross-validation. The performance was significantly better using the BoVW model 0.627 (95% CI: 0.616-0.637) as compared to standard radiomics0.505 (95% CI: 0.499-0.511), mean-var. 0.543 (95% CI: 0.536-0.549), mean0.547 (95% CI: 0.541-0.554), var.0.530 (95% CI: 0.524-0.536) or volume 0.577 (95% CI: 0.571-0.582). We conclude that classical aggregation methods are not optimal in case of heterogeneous tumors. We also showed that the BoVW model is a better alternative to extract consistent features in the presence of lesions composed of heterogeneous tissue.
在标准的放射组学研究中,从临床图像中提取的特征主要使用简单的统计方法进行量化,例如每个感兴趣区域(ROI)的平均值或方差。这种方法可能会平滑任何区域内的异质性,从而掩盖一些可能阻碍预测的肿瘤侵袭性。在本文中,我们研究了标准放射组学工作流程中特征聚合的重要性,这可以考虑到区域内的变化。特征聚合方法将来自特征响应图(ROI 上)的一组体素值转换为一个或多个可用于统计或机器学习算法的标量值。到目前为止,在放射组学工作流程中,这个重要的步骤还没有得到充分的研究。在本文中,我们比较了几种聚合方法与标准放射组学方法,以评估在预测能力方面的改进。我们使用基于视觉词袋(BoVW)的聚合函数来评估性能,该函数允许在异质区域内保留分段同质信息,并与标准方法进行比较。不同的模型在来自 4 个医疗中心的 214 名头颈部癌症患者的队列中进行了比较。从临床 PET-FDG 和 CT 图像中手动勾画的肿瘤中提取了放射组学特征,并对其进行了分析。我们比较了标准放射组学模型、ROI 体积和 BoVW 模型的生存分析性能。使用五折交叉验证估计平均一致性指数。与标准放射组学模型 0.505(95%CI:0.499-0.511)、均值方差 0.543(95%CI:0.536-0.549)、均值 0.547(95%CI:0.541-0.554)、方差 0.530(95%CI:0.524-0.536)或体积 0.577(95%CI:0.571-0.582)相比,BoVW 模型 0.627(95%CI:0.616-0.637)的性能显著提高。我们得出结论,在存在异质肿瘤的情况下,经典的聚合方法并不理想。我们还表明,BoVW 模型是在存在由异质组织组成的病变时提取一致特征的更好选择。