利用生成式深度学习构建转录肿瘤程序的数字组织学模型，用于基于病理学的精准医学。

Building digital histology models of transcriptional tumor programs with generative deep learning for pathology-based precision medicine.

作者信息

Hieromnimon Hanna M, Dolezal James, Doytcheva Kristina, Howard Frederick M, Kochanny Sara, Zhang Zhenyu, Grossman Robert L, Tanager Kevin, Wang Cindy, Kather Jakob Nikolas, Izumchenko Evgeny, Cipriani Nicole A, Fertig Elana J, Pearson Alexander T, Riesenfeld Samantha J

机构信息

Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL, USA.

Department of Medicine, University of Chicago, Chicago, IL, USA.

出版信息

Genome Med. 2025 Aug 7;17(1):87. doi: 10.1186/s13073-025-01502-z.

DOI:10.1186/s13073-025-01502-z

PMID:40775734

Abstract

BACKGROUND

Precision oncology depends on identifying the biological vulnerabilities of a tumor. Molecular assays, like transcriptomics, provide an information-rich view of the tumor that can be leveraged to inform therapeutic selection. However, the costs of such assays can be prohibitive for clinical translation at scale. Histology-based imaging remains a predominant means of diagnosis that is widely accessible. To more broadly leverage limited molecular datasets, models have been trained to use histology to infer the expression of individual genes or pathways, with varying levels of accuracy and explainability.

METHODS

Our approach detects expression of transcriptional programs from tumor histology and interprets the image features supporting program detection. Specifically, we used RNA-seq data from squamous cell carcinoma (SCC) patients to infer cohesive expression patterns of multiple genes. Then, we used deep learning techniques to train a computational model to predict the activity levels of the transcriptional programs directly from histology images. We exploited that predictive capability to generate synthetic digital models of the cellular histology of each transcriptional program, using generative adversarial networks to isolate image features supporting specific transcriptional predictions and pathologist review to interpret the images.

RESULTS

Applying our histologically integrated latent space analysis to SCCs revealed sets of genes associated with both pathologist-interpretable image features and clinically relevant processes, including immune response, collagen remodeling, and fibrosis, going beyond predictions of individual molecular features.

CONCLUSIONS

Our results demonstrate an approach for discovering clinically interpretable histological features that indicate molecular, potentially treatment-informing, biological processes. These features are detectable in widely available histology slides, allowing a standard microscope to deliver complex, patient-specific molecular information.

摘要

背景

精准肿瘤学依赖于识别肿瘤的生物学脆弱性。分子检测，如转录组学，提供了肿瘤丰富的信息视图，可用于指导治疗选择。然而，此类检测的成本对于大规模临床转化来说可能过高。基于组织学的成像仍然是一种广泛可用的主要诊断手段。为了更广泛地利用有限的分子数据集，人们训练了模型，利用组织学来推断单个基因或通路的表达，其准确性和可解释性各不相同。

方法

我们的方法从肿瘤组织学中检测转录程序的表达，并解释支持程序检测的图像特征。具体而言，我们使用鳞状细胞癌（SCC）患者的RNA测序数据来推断多个基因的凝聚表达模式。然后，我们使用深度学习技术训练一个计算模型，直接从组织学图像预测转录程序的活性水平。我们利用这种预测能力，使用生成对抗网络来分离支持特定转录预测的图像特征，并通过病理学家审查来解释图像，从而生成每个转录程序的细胞组织学合成数字模型。