School of Life Sciences, University of Science and Technology of China, Hefei, Anhui, 230027, China.
Department of Plant Biology and the Genome Center, College of Biological Sciences, University of California, Davis, CA, 95616, USA.
Sci Rep. 2017 Jul 17;7(1):5557. doi: 10.1038/s41598-017-05705-2.
Deciphering gene regulatory networks requires identification of gene expression modules. We describe a novel bottom-up approach to identify gene modules regulated by cis-regulatory motifs from a human gene co-expression network. Target genes of a cis-regulatory motif were identified from the network via the motif's enrichment or biased distribution towards transcription start sites in the promoters of co-expressed genes. A gene sub-network containing the target genes was extracted and used to derive gene modules. The analysis revealed known and novel gene modules regulated by the NF-Y motif. The binding of NF-Y proteins to these modules' gene promoters were verified using ENCODE ChIP-Seq data. The analyses also identified 8,048 Sp1 motif target genes, interestingly many of which were not detected by ENCODE ChIP-Seq. These target genes assemble into house-keeping, tissues-specific developmental, and immune response modules. Integration of Sp1 modules with genomic and epigenomic data indicates epigenetic control of Sp1 targets' expression in a cell/tissue specific manner. Finally, known and novel target genes and modules regulated by the YY1, RFX1, IRF1, and 34 other motifs were also identified. The study described here provides a valuable resource to understand transcriptional regulation of various human developmental, disease, or immunity pathways.
解析基因调控网络需要识别基因表达模块。我们描述了一种新的自下而上的方法,用于从人类基因共表达网络中识别受顺式调控基序调控的基因模块。通过基序在共表达基因启动子中朝向转录起始位点的富集或偏向分布,从网络中识别顺式调控基序的靶基因。提取包含靶基因的基因子网络,并用于推导基因模块。分析揭示了受 NF-Y 基序调控的已知和新的基因模块。使用 ENCODE ChIP-Seq 数据验证了 NF-Y 蛋白与这些模块基因启动子的结合。分析还鉴定了 8048 个 Sp1 基序靶基因,有趣的是,其中许多基因未被 ENCODE ChIP-Seq 检测到。这些靶基因组装成管家、组织特异性发育和免疫反应模块。Sp1 模块与基因组和表观基因组数据的整合表明,Sp1 靶基因的表达受到细胞/组织特异性表观遗传控制。最后,还鉴定了受 YY1、RFX1、IRF1 和其他 34 个基序调控的已知和新的靶基因和模块。本文描述的研究为理解各种人类发育、疾病或免疫途径的转录调控提供了有价值的资源。