BGI-Shenzhen, Shenzhen 518083, China; BGI-Beijing, Beijing 100101, China.
BGI-Shenzhen, Shenzhen 518083, China.
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):24-47. doi: 10.1016/j.gpb.2022.10.001. Epub 2022 Oct 14.
The development of spatial transcriptomics (ST) technologies has transformed genetic research from a single-cell data level to a two-dimensional spatial coordinate system and facilitated the study of the composition and function of various cell subsets in different environments and organs. The large-scale data generated by these ST technologies, which contain spatial gene expression information, have elicited the need for spatially resolved approaches to meet the requirements of computational and biological data interpretation. These requirements include dealing with the explosive growth of data to determine the cell-level and gene-level expression, correcting the inner batch effect and loss of expression to improve the data quality, conducting efficient interpretation and in-depth knowledge mining both at the single-cell and tissue-wide levels, and conducting multi-omics integration analysis to provide an extensible framework toward the in-depth understanding of biological processes. However, algorithms designed specifically for ST technologies to meet these requirements are still in their infancy. Here, we review computational approaches to these problems in light of corresponding issues and challenges, and present forward-looking insights into algorithm development.
空间转录组学(ST)技术的发展将遗传研究从单细胞数据层面转化为二维空间坐标系,促进了对不同环境和器官中各种细胞亚群的组成和功能的研究。这些 ST 技术产生的大规模数据包含空间基因表达信息,这就需要采用空间分辨方法来满足计算和生物学数据解释的要求。这些要求包括处理数据的爆炸式增长以确定细胞水平和基因水平的表达,纠正内在批次效应和表达损失以提高数据质量,在单细胞和组织水平上进行有效的解释和深入的知识挖掘,以及进行多组学整合分析,为深入了解生物过程提供一个可扩展的框架。然而,专门为 ST 技术设计的满足这些要求的算法仍处于起步阶段。在这里,我们根据相应的问题和挑战,综述了这些问题的计算方法,并对算法的发展提出了前瞻性的见解。