Department of Radiology, University of Pittsburgh School of Medicine, 4200 Fifth Ave, Pittsburgh, PA, 15260, USA.
Department of Biomedical Informatics, University of Pittsburgh School of Medicine, 5607 Baum Blvd, Pittsburgh, PA, 15206, USA.
BMC Cancer. 2021 Apr 7;21(1):370. doi: 10.1186/s12885-021-08122-x.
The abundance of immune and stromal cells in the tumor microenvironment (TME) is informative of levels of inflammation, angiogenesis, and desmoplasia. Radiomics, an approach of extracting quantitative features from radiological imaging to characterize diseases, have been shown to predict molecular classification, cancer recurrence risk, and many other disease outcomes. However, the ability of radiomics methods to predict the abundance of various cell types in the TME remains unclear. In this study, we employed a radio-genomics approach and machine learning models to predict the infiltration of 10 cell types in breast cancer lesions utilizing radiomic features extracted from breast Dynamic Contrast Enhanced Magnetic Resonance Imaging.
We performed a retrospective study utilizing 73 patients from two independent institutions with imaging and gene expression data provided by The Cancer Imaging Archive (TCIA) and The Cancer Genome Atlas (TCGA), respectively. A set of 199 radiomic features including shape-based, morphological, texture, and kinetic characteristics were extracted from the lesion volumes. To capture one-to-one relationships between radiomic features and cell type abundance, we performed linear regression on each radiomic feature/cell type abundance combination. Each regression model was tested for statistical significance. In addition, multivariate models were built for the cell type infiltration status (i.e. "high" vs "low") prediction. A feature selection process via Recursive Feature Elimination was applied to the radiomic features on the training set. The classification models took the form of a binary logistic extreme gradient boosting framework. Two evaluation methods including leave-one-out cross validation and external independent test, were used for radiomic model learning and testing. The models' performance was measured via area under the receiver operating characteristic curve (AUC).
Univariate relationships were identified between a set of radiomic features and the abundance of fibroblasts. Multivariate models yielded leave-one-out cross validation AUCs ranging from 0.5 to 0.83, and independent test AUCs ranging from 0.5 to 0.68 for the multiple cell type invasion predictions.
On two independent breast cancer cohorts, breast MRI-derived radiomics are associated with the tumor's microenvironment in terms of the abundance of several cell types. Further evaluation with larger cohorts is needed.
肿瘤微环境(TME)中免疫细胞和基质细胞的丰富度反映了炎症、血管生成和纤维形成的水平。放射组学是一种从放射影像学中提取定量特征来描述疾病的方法,已被证明可预测分子分类、癌症复发风险和许多其他疾病结局。然而,放射组学方法预测 TME 中各种细胞类型丰度的能力尚不清楚。在这项研究中,我们采用放射基因组学方法和机器学习模型,利用从乳腺动态对比增强磁共振成像中提取的放射组学特征,预测乳腺癌病变中 10 种细胞类型的浸润。
我们进行了一项回顾性研究,纳入了来自两个独立机构的 73 名患者,其成像和基因表达数据分别来自癌症影像学档案(TCIA)和癌症基因组图谱(TCGA)。从病变体积中提取了一组 199 个放射组学特征,包括基于形状、形态、纹理和动力学特征。为了捕捉放射组学特征与细胞类型丰度之间的一对一关系,我们对每个放射组学特征/细胞类型丰度组合进行线性回归。每个回归模型均进行了统计学意义检验。此外,还构建了用于细胞类型浸润状态(即“高”与“低”)预测的多变量模型。通过递归特征消除对训练集中的放射组学特征进行特征选择过程。分类模型采用二进制逻辑极端梯度提升框架的形式。使用留一交叉验证和外部独立测试两种评估方法进行放射组学模型的学习和测试。通过接收者操作特征曲线下面积(AUC)来衡量模型的性能。
确定了一组放射组学特征与成纤维细胞丰度之间的单变量关系。多变量模型在对多种细胞类型浸润进行预测时,在留一交叉验证 AUC 范围为 0.5 至 0.83,在独立测试 AUC 范围为 0.5 至 0.68。
在两个独立的乳腺癌队列中,乳腺 MRI 衍生的放射组学与肿瘤微环境中几种细胞类型的丰度有关。需要更大的队列进行进一步评估。