Choi Jerome J, Svaren John, Wang Daifeng
bioRxiv. 2024 Jun 21:2024.06.19.599799. doi: 10.1101/2024.06.19.599799.
Oligodendrocytes are the myelinating cells within the central nervous system. Many oligodendrocyte genes have been associated with brain disorders. However, how transcription factors (TFs) cooperate for gene regulation in oligodendrocytes remains largely uncharacterized. To address this, we integrated scRNA-seq and scATAC-seq data to identify the cooperative TFs that co-regulate the target gene (TG) expression in oligodendrocytes. First, we identified co- binding TF pairs whose binding sites overlapped in oligodendrocyte-specific regulatory regions. Second, we trained a deep learning model to predict the expression level of each TG using the expression levels of co-binding TFs. Third, using the trained models, we computed the TF importance and TF-TF interaction scores for predicting TG expression by the Shapley interaction scores. We found that the co-binding TF pairs involving known important TF pairs for oligodendrocyte differentiation, such as SOX10-TCF12, SOX10-MYRF, and SOX10-OLIG2, exhibited significantly higher Shapley scores than others (t-test, p-value < 1e-4). Furthermore, we identified 153 oligodendrocyte-associated eQTLs that reside in oligodendrocyte-specific enhancers or promoters where their eGenes (TGs) are regulated by cooperative TFs, suggesting potential novel regulatory roles from genetic variants. We also experimentally validated some identified TF pairs such as SOX10-OLIG2 and SOX10-NKX2.2 by co-enrichment analysis, using ChIP-seq data from rat peripheral nerve.
少突胶质细胞是中枢神经系统中的髓鞘形成细胞。许多少突胶质细胞基因与脑部疾病有关。然而,转录因子(TFs)如何在少突胶质细胞中协同进行基因调控在很大程度上仍不清楚。为了解决这个问题,我们整合了单细胞RNA测序(scRNA-seq)和单细胞染色质可及性测序(scATAC-seq)数据,以识别共同调节少突胶质细胞中靶基因(TG)表达的协同转录因子。首先,我们鉴定了在少突胶质细胞特异性调控区域中结合位点重叠的共结合转录因子对。其次,我们训练了一个深度学习模型,使用共结合转录因子的表达水平来预测每个靶基因的表达水平。第三,使用训练好的模型,我们通过Shapley相互作用分数计算预测靶基因表达的转录因子重要性和转录因子-转录因子相互作用分数。我们发现,涉及少突胶质细胞分化已知重要转录因子对的共结合转录因子对,如SOX10-TCF12、SOX10-MYRF和SOX10-OLIG2,其Shapley分数显著高于其他对(t检验,p值<1e-4)。此外,我们鉴定了153个少突胶质细胞相关的表达数量性状基因座(eQTLs),它们位于少突胶质细胞特异性增强子或启动子中,其效应基因(靶基因)由协同转录因子调控,这表明遗传变异可能具有新的调控作用。我们还通过共富集分析,利用大鼠外周神经的染色质免疫沉淀测序(ChIP-seq)数据,对一些鉴定出的转录因子对,如SOX10-OLIG2和SOX10-NKX2.2进行了实验验证。