Hieromnimon Hanna M, Dolezal James, Doytcheva Kristina, Howard Frederick M, Kochanny Sara, Zhang Zhenyu, Grossman Robert L, Tanager Kevin, Wang Cindy, Kather Jakob Nikolas, Izumchenko Evgeny, Cipriani Nicole A, Fertig Elana J, Pearson Alexander T, Riesenfeld Samantha J
Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL, USA.
Department of Medicine, University of Chicago, Chicago, IL, USA.
Genome Med. 2025 Aug 7;17(1):87. doi: 10.1186/s13073-025-01502-z.
Precision oncology depends on identifying the biological vulnerabilities of a tumor. Molecular assays, like transcriptomics, provide an information-rich view of the tumor that can be leveraged to inform therapeutic selection. However, the costs of such assays can be prohibitive for clinical translation at scale. Histology-based imaging remains a predominant means of diagnosis that is widely accessible. To more broadly leverage limited molecular datasets, models have been trained to use histology to infer the expression of individual genes or pathways, with varying levels of accuracy and explainability.
Our approach detects expression of transcriptional programs from tumor histology and interprets the image features supporting program detection. Specifically, we used RNA-seq data from squamous cell carcinoma (SCC) patients to infer cohesive expression patterns of multiple genes. Then, we used deep learning techniques to train a computational model to predict the activity levels of the transcriptional programs directly from histology images. We exploited that predictive capability to generate synthetic digital models of the cellular histology of each transcriptional program, using generative adversarial networks to isolate image features supporting specific transcriptional predictions and pathologist review to interpret the images.
Applying our histologically integrated latent space analysis to SCCs revealed sets of genes associated with both pathologist-interpretable image features and clinically relevant processes, including immune response, collagen remodeling, and fibrosis, going beyond predictions of individual molecular features.
Our results demonstrate an approach for discovering clinically interpretable histological features that indicate molecular, potentially treatment-informing, biological processes. These features are detectable in widely available histology slides, allowing a standard microscope to deliver complex, patient-specific molecular information.
精准肿瘤学依赖于识别肿瘤的生物学脆弱性。分子检测,如转录组学,提供了肿瘤丰富的信息视图,可用于指导治疗选择。然而,此类检测的成本对于大规模临床转化来说可能过高。基于组织学的成像仍然是一种广泛可用的主要诊断手段。为了更广泛地利用有限的分子数据集,人们训练了模型,利用组织学来推断单个基因或通路的表达,其准确性和可解释性各不相同。
我们的方法从肿瘤组织学中检测转录程序的表达,并解释支持程序检测的图像特征。具体而言,我们使用鳞状细胞癌(SCC)患者的RNA测序数据来推断多个基因的凝聚表达模式。然后,我们使用深度学习技术训练一个计算模型,直接从组织学图像预测转录程序的活性水平。我们利用这种预测能力,使用生成对抗网络来分离支持特定转录预测的图像特征,并通过病理学家审查来解释图像,从而生成每个转录程序的细胞组织学合成数字模型。
将我们的组织学整合潜在空间分析应用于SCC,揭示了与病理学家可解释的图像特征和临床相关过程(包括免疫反应、胶原重塑和纤维化)相关的基因集,超越了对单个分子特征的预测。
我们的结果展示了一种发现临床可解释的组织学特征的方法,这些特征指示分子层面的、可能为治疗提供信息的生物学过程。这些特征可以在广泛可用的组织学切片中检测到,使标准显微镜能够提供复杂的、针对患者的分子信息。