Liu Jiajia, Ma Jian, Wen Jianguo, Zhou Xiaobo
bioRxiv. 2024 Feb 2:2024.01.31.578213. doi: 10.1101/2024.01.31.578213.
In recent years, the integration of single-cell multi-omics data has provided a more comprehensive understanding of cell functions and internal regulatory mechanisms from a non-single omics perspective, but it still suffers many challenges, such as omics-variance, sparsity, cell heterogeneity and confounding factors. As we know, cell cycle is regarded as a confounder when analyzing other factors in single-cell RNA-seq data, but it's not clear how it will work on the integrated single-cell multi-omics data. Here, we developed a Cell Cycle-Aware Network (CCAN) to remove cell cycle effects from the integrated single-cell multi-omics data while keeping the cell type-specific variations. This is the first computational model to study the cell-cycle effects in the integration of single-cell multi-omics data. Validations on several benchmark datasets show the out-standing performance of CCAN in a variety of downstream analyses and applications, including removing cell cycle effects and batch effects of scRNA-seq datasets from different protocols, integrating paired and unpaired scRNA-seq and scATAC-seq data, accurately transferring cell type labels from scRNA-seq to scATAC-seq data, and characterizing the differentiation process from hematopoietic stem cells to different lineages in the integration of differentiation data.
近年来,单细胞多组学数据的整合从非单一组学的角度为细胞功能和内部调控机制提供了更全面的理解,但仍面临许多挑战,如组学方差、稀疏性、细胞异质性和混杂因素。众所周知,在分析单细胞RNA测序数据中的其他因素时,细胞周期被视为一个混杂因素,但尚不清楚它在整合的单细胞多组学数据中会如何起作用。在此,我们开发了一种细胞周期感知网络(CCAN),以从整合的单细胞多组学数据中去除细胞周期效应,同时保留细胞类型特异性变异。这是第一个研究单细胞多组学数据整合中细胞周期效应的计算模型。在几个基准数据集上的验证表明,CCAN在各种下游分析和应用中表现出色,包括去除来自不同协议的scRNA测序数据集的细胞周期效应和批次效应、整合配对和非配对的scRNA测序与scATAC测序数据、将细胞类型标签从scRNA测序准确转移到scATAC测序数据,以及在分化数据整合中表征从造血干细胞到不同谱系的分化过程。