具有卷积和图节点共嵌入的Transformer：一种用于从局部组织病理学图像预测基因表达的准确且可解释的视觉主干。

Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image.

作者信息

Xiao Xiao, Kong Yan, Li Ronghan, Wang Zuoheng, Lu Hui

机构信息

State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China; SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University, Shanghai, China; Department of Biostatistics, Yale School of Public Health, Yale University, New Haven, CT, United States.

出版信息

Med Image Anal. 2024 Jan;91:103040. doi: 10.1016/j.media.2023.103040. Epub 2023 Nov 20.

DOI:10.1016/j.media.2023.103040

PMID:38007979

Abstract

Inferring gene expressions from histopathological images has long been a fascinating yet challenging task, primarily due to the substantial disparities between the two modality. Existing strategies using local or global features of histological images are suffering model complexity, GPU consumption, low interpretability, insufficient encoding of local features, and over-smooth prediction of gene expressions among neighboring sites. In this paper, we develop TCGN (Transformer with Convolution and Graph-Node co-embedding method) for gene expression estimation from H&E-stained pathological slide images. TCGN comprises a combination of convolutional layers, transformer encoders, and graph neural networks, and is the first to integrate these blocks in a general and interpretable computer vision backbone. Notably, TCGN uniquely operates with just a single spot image as input for histopathological image analysis, simplifying the process while maintaining interpretability. We validate TCGN on three publicly available spatial transcriptomic datasets. TCGN consistently exhibited the best performance (with median PCC 0.232). TCGN offers superior accuracy while keeping parameters to a minimum (just 86.241 million), and it consumes minimal memory, allowing it to run smoothly even on personal computers. Moreover, TCGN can be extended to handle bulk RNA-seq data while providing the interpretability. Enhancing the accuracy of omics information prediction from pathological images not only establishes a connection between genotype and phenotype, enabling the prediction of costly-to-measure biomarkers from affordable histopathological images, but also lays the groundwork for future multi-modal data modeling. Our results confirm that TCGN is a powerful tool for inferring gene expressions from histopathological images in precision health applications.

摘要

从组织病理学图像推断基因表达长期以来一直是一项引人入胜但具有挑战性的任务，主要是因为这两种模态之间存在巨大差异。现有的使用组织学图像局部或全局特征的策略存在模型复杂度高、GPU消耗大、可解释性低、局部特征编码不足以及相邻位点基因表达预测过度平滑等问题。在本文中，我们开发了TCGN（卷积与图节点共嵌入的Transformer方法）用于从苏木精-伊红（H&E）染色的病理切片图像估计基因表达。TCGN由卷积层、Transformer编码器和图神经网络组成，并且是首个将这些模块集成到一个通用且可解释的计算机视觉主干中的方法。值得注意的是，TCGN仅以单个斑点图像作为组织病理学图像分析的输入进行独特操作，在保持可解释性的同时简化了流程。我们在三个公开可用的空间转录组数据集上验证了TCGN。TCGN始终表现出最佳性能（中位数皮尔逊相关系数为0.232）。TCGN在将参数保持在最低水平（仅8624.1万个）的同时提供了卓越的准确性，并且消耗的内存最少，甚至可以在个人计算机上平稳运行。此外，TCGN可以扩展以处理批量RNA测序数据，同时提供可解释性。提高病理图像中组学信息预测的准确性不仅建立了基因型和表型之间的联系，使得能够从经济实惠的组织病理学图像预测成本高昂的生物标志物，而且还为未来的多模态数据建模奠定了基础。我们的结果证实，TCGN是在精准健康应用中从组织病理学图像推断基因表达的强大工具。

相似文献

Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image.

Med Image Anal. 2024 Jan;91:103040. doi: 10.1016/j.media.2023.103040. Epub 2023 Nov 20.

Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation.

Phys Med Biol. 2023 Jan 9;68(2). doi: 10.1088/1361-6560/aca74c.

Spatial transcriptomics prediction from histology jointly through Transformer and graph neural networks.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac297.

MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.

Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

An efficient colorectal cancer detection network using atrous convolution with coordinate attention transformer and histopathological images.

Sci Rep. 2024 Aug 17;14(1):19109. doi: 10.1038/s41598-024-70117-y.

Inferring spatial transcriptomics markers from whole slide images to characterize metastasis-related spatial heterogeneity of colorectal tumors: A pilot study.

J Pathol Inform. 2023 Mar 29;14:100308. doi: 10.1016/j.jpi.2023.100308. eCollection 2023.

A Convolutional Neural Network and Graph Convolutional Network Based Framework for Classification of Breast Histopathological Images.

IEEE J Biomed Health Inform. 2022 Jul;26(7):3163-3173. doi: 10.1109/JBHI.2022.3153671. Epub 2022 Jul 1.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation.

PLoS One. 2022 Jan 24;17(1):e0262689. doi: 10.1371/journal.pone.0262689. eCollection 2022.

引用本文的文献

FmH2ST: foundation model-based spatial transcriptomics generation from histological images.

Nucleic Acids Res. 2025 Sep 5;53(17). doi: 10.1093/nar/gkaf865.

SpaVGN: A hybrid deep learning framework for high-resolution spatial transcriptomics data reconstruction and spatial domain identification.

PLoS One. 2025 Aug 14;20(8):e0329122. doi: 10.1371/journal.pone.0329122. eCollection 2025.

Spatial histology and gene-expression representation and generative learning via online self-distillation contrastive learning.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf317.

AI-Driven Transcriptome Prediction in Human Pathology: From Molecular Insights to Clinical Applications.

Biology (Basel). 2025 Jun 4;14(6):651. doi: 10.3390/biology14060651.

Vispro improves imaging analysis for Visium spatial transcriptomics.

Genome Biol. 2025 Jun 18;26(1):173. doi: 10.1186/s13059-025-03648-w.

Efficient merging and validation of deep learning-based nuclei segmentations in H&E slides from multiple models.

J Pathol Inform. 2025 Apr 15;17:100443. doi: 10.1016/j.jpi.2025.100443. eCollection 2025 Apr.

Autonomous Self-Evolving Research on Biomedical Data: The DREAM Paradigm.

Adv Sci (Weinh). 2025 May 8:e2417066. doi: 10.1002/advs.202417066.

Unravelling tumour spatiotemporal heterogeneity using spatial multimodal data.

Clin Transl Med. 2025 May;15(5):e70331. doi: 10.1002/ctm2.70331.

Benchmarking the translational potential of spatial gene expression prediction from histology.

Nat Commun. 2025 Feb 11;16(1):1544. doi: 10.1038/s41467-025-56618-y.

Deep Learning-Enabled Integration of Histology and Transcriptomics for Tissue Spatial Profile Analysis.

Research (Wash D C). 2025 Jan 17;8:0568. doi: 10.34133/research.0568. eCollection 2025.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

具有卷积和图节点共嵌入的Transformer：一种用于从局部组织病理学图像预测基因表达的准确且可解释的视觉主干。

Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image.

作者信息

Xiao Xiao, Kong Yan, Li Ronghan, Wang Zuoheng, Lu Hui

机构信息

出版信息

Med Image Anal. 2024 Jan;91:103040. doi: 10.1016/j.media.2023.103040. Epub 2023 Nov 20.

DOI:10.1016/j.media.2023.103040

PMID:38007979

Abstract

摘要

具有卷积和图节点共嵌入的Transformer：一种用于从局部组织病理学图像预测基因表达的准确且可解释的视觉主干。

Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

具有卷积和图节点共嵌入的Transformer：一种用于从局部组织病理学图像预测基因表达的准确且可解释的视觉主干。

Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image.

作者信息

机构信息

出版信息

相似文献

引用本文的文献