Lin Yu, Wang Yan, Liang Yanchun, Yu Yang, Li Jingyi, Ma Qin, He Fei, Xu Dong
School of Artificial Intelligence, Jilin University, Changchun, China.
Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, United States.
Front Genet. 2022 Aug 12;13:912813. doi: 10.3389/fgene.2022.912813. eCollection 2022.
Spatial transcriptomics is an emerging technology widely applied to the analyses of tissue architecture and corresponding biological functions. Substantial computational methods have been developed for analyzing spatial transcriptomics data. These methods generate embeddings from gene expression and spatial locations for spot clustering or tissue architecture segmentation. Although the hyperparameters used to produce an embedding can be tuned for a given training set, a fixed embedding has variable performance from case to case due to data distributions. Therefore, selecting an effective embedding for new data in advance would be useful. For this purpose, we developed an embedding evaluation method named message passing-Moran's I with maximum filtering (MP-MIM), which combines message passing-based embedding transformation with spatial autocorrelation analysis. We applied a graph convolution to aggregate spatial transcriptomics data and employed global Moran's I to measure spatial autocorrelation and select the most effective embedding to infer tissue architecture. Sixteen spatial transcriptomics samples generated from the human brain were used to validate our method. The results show that MP-MIM can accurately identify high-quality embeddings that produce a high correlation between the predicted tissue architecture and the ground truth. Overall, our study provides a novel method to select embeddings for new test data and enhance the usability of deep learning tools for spatial transcriptome analyses.
空间转录组学是一种新兴技术,广泛应用于组织结构及其相应生物学功能的分析。目前已经开发出大量计算方法用于分析空间转录组学数据。这些方法从基因表达和空间位置生成嵌入,用于斑点聚类或组织结构分割。尽管用于生成嵌入的超参数可以针对给定训练集进行调整,但由于数据分布的原因,固定的嵌入在不同情况下性能会有所不同。因此,提前为新数据选择有效的嵌入会很有帮助。为此,我们开发了一种名为基于最大滤波的消息传递 - 莫兰指数(MP - MIM)的嵌入评估方法,该方法将基于消息传递的嵌入变换与空间自相关分析相结合。我们应用图卷积来聚合空间转录组学数据,并使用全局莫兰指数来测量空间自相关,选择最有效的嵌入来推断组织结构。使用从人类大脑生成的16个空间转录组学样本对我们的方法进行验证。结果表明,MP - MIM可以准确识别高质量的嵌入,这些嵌入在预测的组织结构与真实情况之间产生高度相关性。总体而言,我们的研究提供了一种为新测试数据选择嵌入的新方法,并提高了深度学习工具在空间转录组分析中的可用性。