Cui Lingyu, Li Hongfei, Bian Jilong, Wang Guohua, Liang Yingjian
College of Life Science, Northeast Forestry University, Harbin, 150040, China.
College of Information and Computer Engineering, Northeast Forestry University, Harbin, 150040, China.
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad011.
Identifying gene regulatory networks (GRNs) at the resolution of single cells has long been a great challenge, and the advent of single-cell multi-omics data provides unprecedented opportunities to construct GRNs. Here, we propose a novel strategy to integrate omics datasets of single-cell ribonucleic acid sequencing and single-cell Assay for Transposase-Accessible Chromatin using sequencing, and using an unsupervised learning neural network to divide the samples with high copy number variation scores, which are used to infer the GRN in each gene block. Accuracy validation of proposed strategy shows that approximately 80% of transcription factors are directly associated with cancer, colorectal cancer, malignancy and disease by TRRUST; and most transcription factors are prone to produce multiple transcript variants and lead to tumorigenesis by RegNetwork database, respectively. The source code access are available at: https://github.com/Cuily-v/Colorectal_cancer.
在单细胞分辨率下识别基因调控网络(GRNs)长期以来一直是一项巨大挑战,而单细胞多组学数据的出现为构建基因调控网络提供了前所未有的机遇。在此,我们提出一种新颖策略,将单细胞核糖核酸测序和单细胞转座酶可及染色质测序的组学数据集进行整合,并使用无监督学习神经网络对具有高拷贝数变异分数的样本进行划分,这些样本用于推断每个基因块中的基因调控网络。所提策略的准确性验证表明,通过TRRUST大约80%的转录因子与癌症、结直肠癌、恶性肿瘤和疾病直接相关;并且通过RegNetwork数据库,大多数转录因子分别易于产生多种转录变体并导致肿瘤发生。源代码可在以下网址获取:https://github.com/Cuily-v/Colorectal_cancer 。