Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA; Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC 27599, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA.
Curriculum in Bioinformatics and Computational Biology, School of Medicine, University of North Carolina, Chapel Hill, NC 27599, USA.
Cell Syst. 2022 Sep 21;13(9):737-751.e4. doi: 10.1016/j.cels.2022.08.004. Epub 2022 Sep 1.
The epigenetic control of gene expression is highly cell-type and context specific. Yet, despite its complexity, gene regulatory logic can be broken down into modular components consisting of a transcription factor (TF) activating or repressing the target gene expression through its binding to a cis-regulatory region. We propose a nonparametric approach, TRIPOD, to detect and characterize the three-way relationships between a TF, its target gene, and the accessibility of the TF's binding site using single-cell RNA and ATAC multiomic data. We apply TRIPOD to interrogate the cell-type-specific regulatory logic in peripheral blood mononuclear cells and contrast our results to detections from enhancer databases, cis-eQTL studies, ChIP-seq experiments, and TF knockdown/knockout studies. We then apply TRIPOD to mouse embryonic brain data and identify regulatory relationships, validated by ChIP-seq and PLAC-seq. Finally, we demonstrate TRIPOD on the SHARE-seq data of differentiating mouse hair follicle cells and identify lineage-specific regulation supported by histone marks and super-enhancer annotations. A record of this paper's transparent peer review process is included in the supplemental information.
基因表达的表观遗传控制具有高度的细胞类型和上下文特异性。然而,尽管其复杂性,基因调控逻辑可以分解为模块化组件,包括转录因子 (TF) 通过与其结合来激活或抑制靶基因表达的顺式调控区。我们提出了一种非参数方法 TRIPOD,用于使用单细胞 RNA 和 ATAC 多组学数据检测和表征 TF、其靶基因和 TF 结合位点可及性之间的三向关系。我们应用 TRIPOD 来探究外周血单核细胞中的细胞类型特异性调控逻辑,并将我们的结果与增强子数据库、顺式 eQTL 研究、ChIP-seq 实验和 TF 敲低/敲除研究的检测结果进行对比。然后,我们将 TRIPOD 应用于小鼠胚胎大脑数据,并通过 ChIP-seq 和 PLAC-seq 验证了调控关系。最后,我们在分化的小鼠毛囊细胞的 SHARE-seq 数据上演示了 TRIPOD,并通过组蛋白标记和超级增强子注释识别了支持谱系特异性调控的关系。本文的透明同行评审过程记录包含在补充信息中。