Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, Texas.
Department of Pathology, University of Texas Southwestern Medical Center, Dallas, Texas.
Cancer Res. 2022 Aug 3;82(15):2792-2806. doi: 10.1158/0008-5472.CAN-21-2318.
Intratumoral heterogeneity arising from tumor evolution poses significant challenges biologically and clinically. Dissecting this complexity may benefit from deep learning (DL) algorithms, which can infer molecular features from ubiquitous hematoxylin and eosin (H&E)-stained tissue sections. Although DL algorithms have been developed to predict some driver mutations from H&E images, the ability of these DL algorithms to resolve intratumoral mutation heterogeneity at subclonal spatial resolution is unexplored. Here, we apply DL to a paradigm of intratumoral heterogeneity, clear cell renal cell carcinoma (ccRCC), the most common type of kidney cancer. Matched IHC and H&E images were leveraged to develop DL models for predicting intratumoral genetic heterogeneity of the three most frequently mutated ccRCC genes, BAP1, PBRM1, and SETD2. DL models were generated on a large cohort (N = 1,282) and tested on several independent cohorts, including a TCGA cohort (N = 363 patients) and two tissue microarray (TMA) cohorts (N = 118 and 365 patients). These models were also expanded to a patient-derived xenograft (PDX) TMA, affording analysis of homotopic and heterotopic interactions of tumor and stroma. The status of all three genes could be inferred by DL, with BAP1 showing the highest sensitivity and performance within and across tissue samples (AUC = 0.87-0.89 on holdout). BAP1 results were validated on independent human (AUC = 0.77-0.84) and PDX (AUC = 0.80) cohorts. Finally, BAP1 predictions correlated with clinical outputs such as disease-specific survival. Overall, these data show that DL models can resolve intratumoral heterogeneity in cancer with potential diagnostic, prognostic, and biological implications.
This work demonstrates the potential for deep learning analysis of histopathologic images to serve as a fast, low-cost method to assess genetic intratumoral heterogeneity. See related commentary by Song et al., p. 2672.
肿瘤进化引起的肿瘤内异质性在生物学和临床上都带来了重大挑战。从深度学习 (DL) 算法中可以推断出普遍存在的苏木精和曙红 (H&E) 染色组织切片的分子特征,从而可以剖析这种复杂性。尽管已经开发出 DL 算法来从 H&E 图像中预测某些驱动突变,但这些 DL 算法在亚克隆空间分辨率下解析肿瘤内突变异质性的能力尚未得到探索。在这里,我们将 DL 应用于肿瘤内异质性的范例——透明细胞肾细胞癌 (ccRCC),这是最常见的肾癌类型。利用匹配的免疫组化和 H&E 图像,开发了用于预测三种最常突变的 ccRCC 基因 BAP1、PBRM1 和 SETD2 的肿瘤内遗传异质性的 DL 模型。DL 模型是在一个大的队列 (N = 1,282) 上生成的,并在几个独立的队列上进行了测试,包括 TCGA 队列 (N = 363 名患者) 和两个组织微阵列 (TMA) 队列 (N = 118 和 365 名患者)。这些模型也扩展到了患者来源的异种移植 (PDX) TMA,从而可以分析肿瘤和基质的同源和异源相互作用。通过 DL 可以推断出所有三种基因的状态,BAP1 在组织样本内和跨组织样本的灵敏度和性能最高 (在保留样本上的 AUC = 0.87-0.89)。在独立的人类 (AUC = 0.77-0.84) 和 PDX (AUC = 0.80) 队列上验证了 BAP1 结果。最后,BAP1 预测与疾病特异性生存等临床结果相关。总的来说,这些数据表明,DL 模型可以解决癌症中的肿瘤内异质性,具有潜在的诊断、预后和生物学意义。
这项工作表明,对组织病理学图像进行深度学习分析有可能成为一种快速、低成本的方法来评估肿瘤内遗传异质性。见 Song 等人的相关评论,第 2672 页。