Andersson Alma, Lundeberg Joakim
Department of Gene Technology, Science for Life Laboratory, KTH Royal Institute of Technology, Stockholm 114 28, Sweden.
Bioinformatics. 2021 Sep 9;37(17):2644-2650. doi: 10.1093/bioinformatics/btab164.
Collection of spatial signals in large numbers has become a routine task in multiple omics-fields, but parsing of these rich datasets still pose certain challenges. In whole or near-full transcriptome spatial techniques, spurious expression profiles are intermixed with those exhibiting an organized structure. To distinguish profiles with spatial patterns from the background noise, a metric that enables quantification of spatial structure is desirable. Current methods designed for similar purposes tend to be built around a framework of statistical hypothesis testing, hence we were compelled to explore a fundamentally different strategy.
We propose an unexplored approach to analyze spatial transcriptomics data, simulating diffusion of individual transcripts to extract genes with spatial patterns. The method performed as expected when presented with synthetic data. When applied to real data, it identified genes with distinct spatial profiles, involved in key biological processes or characteristic for certain cell types. Compared to existing methods, ours seemed to be less informed by the genes' expression levels and showed better time performance when run with multiple cores.
Open-source Python package with a command line interface (CLI), freely available at https://github.com/almaan/sepal under an MIT licence. A mirror of the GitHub repository can be found at Zenodo, doi: 10.5281/zenodo.4573237.
Supplementary data are available at Bioinformatics online.
在多个组学领域,大量空间信号的收集已成为一项常规任务,但解析这些丰富的数据集仍面临一定挑战。在全转录组或接近全转录组的空间技术中,虚假的表达谱与那些呈现组织结构的表达谱相互混杂。为了将具有空间模式的谱与背景噪声区分开来,需要一种能够量化空间结构的指标。目前为类似目的设计的方法往往围绕统计假设检验框架构建,因此我们不得不探索一种根本不同的策略。
我们提出了一种未被探索的方法来分析空间转录组学数据,模拟单个转录本的扩散以提取具有空间模式的基因。当处理合成数据时,该方法表现符合预期。应用于真实数据时,它识别出具有独特空间谱的基因,这些基因参与关键生物学过程或特定细胞类型的特征。与现有方法相比,我们的方法似乎受基因表达水平的影响较小,并且在使用多个核心运行时具有更好的时间性能。
具有命令行界面(CLI)的开源Python包,根据MIT许可在https://github.com/almaan/sepal上免费提供。GitHub仓库的镜像可在Zenodo上找到,doi: 10.5281/zenodo.4573237。
补充数据可在《生物信息学》在线获取。