Division of Systems Biology, Nagoya University Graduate School of Medicine, Nagoya, Aichi 466-8550, Japan.
Laboratory of Computational Life Science, National Cancer Center Research Institute, Tokyo, Tokyo 104-0045, Japan.
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae520.
Single-cell RNA sequencing (scRNA-seq) enables comprehensive characterization of the cell state. However, its destructive nature prohibits measuring gene expression changes during dynamic processes such as embryogenesis or cell state divergence due to injury or disease. Although recent studies integrating scRNA-seq with lineage tracing have provided clonal insights between progenitor and mature cells, challenges remain. Because of their experimental nature, observations are sparse, and cells observed in the early state are not the exact progenitors of cells observed at later time points. To overcome these limitations, we developed LineageVAE, a novel computational methodology that utilizes deep learning based on the property that cells sharing barcodes have identical progenitors.
LineageVAE is a deep generative model that transforms scRNA-seq observations with identical lineage barcodes into sequential trajectories toward a common progenitor in a latent cell state space. This method enables the reconstruction of unobservable cell state transitions, historical transcriptomes, and regulatory dynamics at a single-cell resolution. Applied to hematopoiesis and reprogrammed fibroblast datasets, LineageVAE demonstrated its ability to restore backward cell state transitions and infer progenitor heterogeneity and transcription factor activity along differentiation trajectories.
The LineageVAE model was implemented in Python using the PyTorch deep learning library. The code is available on GitHub at https://github.com/LzrRacer/LineageVAE/.
单细胞 RNA 测序(scRNA-seq)能够全面描述细胞状态。然而,由于其破坏性,它禁止在胚胎发生或细胞状态因损伤或疾病而发生分歧等动态过程中测量基因表达变化。尽管最近的研究将 scRNA-seq 与谱系追踪相结合,为祖细胞和成熟细胞之间提供了克隆见解,但仍存在挑战。由于其实验性质,观察结果稀疏,并且在早期状态下观察到的细胞不是在稍后时间点观察到的细胞的确切祖细胞。为了克服这些限制,我们开发了 LineageVAE,这是一种新的计算方法,它利用了基于具有相同条形码的细胞具有相同祖细胞这一特性的深度学习。
LineageVAE 是一种深度生成模型,它将具有相同谱系条形码的 scRNA-seq 观测值转换为在潜在细胞状态空间中向共同祖细胞的连续轨迹。该方法能够重建不可观测的细胞状态转变、历史转录组和单细胞分辨率的调控动态。将 LineageVAE 应用于造血和重编程成纤维细胞数据集,证明了它能够恢复向后的细胞状态转变,并推断出沿分化轨迹的祖细胞异质性和转录因子活性。
LineageVAE 模型是使用 Python 中的 PyTorch 深度学习库实现的。代码可在 GitHub 上获得,网址为 https://github.com/LzrRacer/LineageVAE/。