College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China.
Computer Science, Tuskegee University, State of Alabama 36088, United States.
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae607.
Recent advances in spatial transcriptomics technologies have provided multi-modality data integrating gene expression, spatial context, and histological images. Accurately identifying spatial domains and spatially variable genes is crucial for understanding tissue structures and biological functions. However, effectively combining multi-modality data to identify spatial domains and determining SVGs closely related to these spatial domains remains a challenge.
In this study, we propose spatial transcriptomics multi-modality and multi-granularity collaborative learning (spaMMCL). For detecting spatial domains, spaMMCL mitigates the adverse effects of modality bias by masking portions of gene expression data, integrates gene and image features using a shared graph convolutional network, and employs graph self-supervised learning to deal with noise from feature fusion. Simultaneously, based on the identified spatial domains, spaMMCL integrates various strategies to detect potential SVGs at different granularities, enhancing their reliability and biological significance. Experimental results demonstrate that spaMMCL substantially improves the identification of spatial domains and SVGs.
The code and data of spaMMCL are available on Github: Https://github.com/liangxiao-cs/spaMMCL.
最近空间转录组学技术的进步提供了多模态数据,将基因表达、空间背景和组织学图像整合在一起。准确识别空间域和与这些空间域密切相关的空间变异基因对于理解组织结构和生物学功能至关重要。然而,有效地结合多模态数据来识别空间域并确定与这些空间域密切相关的空间变异基因仍然是一个挑战。
在这项研究中,我们提出了空间转录组学多模态和多粒度协作学习(spaMMCL)。对于检测空间域,spaMMCL 通过屏蔽部分基因表达数据来减轻模态偏差的不利影响,使用共享图卷积网络整合基因和图像特征,并采用图自监督学习来处理特征融合产生的噪声。同时,基于识别出的空间域,spaMMCL 集成了各种策略来检测不同粒度的潜在空间变异基因,提高了它们的可靠性和生物学意义。实验结果表明,spaMMCL 大大提高了空间域和空间变异基因的识别能力。
spaMMCL 的代码和数据可在 Github 上获得: Https://github.com/liangxiao-cs/spaMMCL。