Hackenberg Maren, Canal Guitart Laia, Backofen Rolf, Binder Harald
Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Stefan-Meier-Straße 26, 79106 Freiburg, Germany.
Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Ernst-Zermelo-Straße 1, 79106 Freiburg, Germany.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf287.
There are various dimensionality reduction techniques for visually inspecting dynamical patterns in time-series single-cell RNA-sequencing (scRNA-seq) data. However, the lack of one-to-one correspondence between cells across time points makes it difficult to uniquely uncover temporal structure in a low-dimensional manifold. The use of different techniques may thus lead to discrepancies in the representation of dynamical patterns. However, The extent of these discrepancies remains unclear. To investigate this, we propose an approach for reasoning about such discrepancies based on synthetic time-series scRNA-seq data generated by variational autoencoders. The synthetic dynamical patterns induced in a low-dimensional manifold reflect biologically plausible temporal patterns, such as dividing cell clusters during a differentiation process. We consider manifolds from different dimensionality reduction techniques, such as principal component analysis, t-distributed stochastic neighbor embedding, uniform manifold approximation, and projection and single-cell variational inference. We illustrate how the proposed approach allows for reasoning about to what extent low-dimensional manifolds, obtained from different techniques, can capture different dynamical patterns. None of these techniques was found to be consistently superior and the results indicate that they may not reliably represent dynamics when used in isolation, underscoring the need to compare multiple perspectives. Thus, the proposed synthetic dynamical pattern approach provides a foundation for guiding future methods development to detect complex patterns in time-series scRNA-seq data.
有多种降维技术可用于直观检查时间序列单细胞RNA测序(scRNA-seq)数据中的动态模式。然而,不同时间点的细胞之间缺乏一一对应关系,这使得难以在低维流形中唯一地揭示时间结构。因此,使用不同的技术可能会导致动态模式表示上的差异。然而,这些差异的程度仍不清楚。为了研究这一点,我们提出了一种基于变分自编码器生成的合成时间序列scRNA-seq数据来推断此类差异的方法。在低维流形中诱导的合成动态模式反映了生物学上合理的时间模式,例如分化过程中正在分裂的细胞簇。我们考虑来自不同降维技术的流形,如主成分分析、t分布随机邻域嵌入、均匀流形近似与投影以及单细胞变分推理。我们说明了所提出的方法如何能够推断从不同技术获得的低维流形在多大程度上可以捕获不同的动态模式。没有发现这些技术中的任何一种始终具有优势,结果表明它们单独使用时可能无法可靠地表示动态,这突出了比较多个视角的必要性。因此,所提出的合成动态模式方法为指导未来方法开发以检测时间序列scRNA-seq数据中的复杂模式提供了基础。