Department of Statistics, University of California, Los Angeles, Los Angeles, USA.
BMC Genomics. 2011 Oct 20;12:515. doi: 10.1186/1471-2164-12-515.
To facilitate deciphering underlying transcriptional regulatory circuits in mouse embryonic stem (ES) cells, recent ChIP-seq data provided genome-wide binding locations of several key transcription factors (TFs); meanwhile, existing efforts profiled gene expression in ES cells and in their early differentiated state. It has been shown that the gene expression profiles are correlated with the binding of these TFs. However, it remains unclear whether other TFs, referred to as cofactors, participate the gene regulation by collaborating with the ChIP-seq TFs.
Based on our analyses of the ES gene expression profiles and binding sites of potential cofactors in vicinity of the ChIP-seq TF binding locations, we identified a list of co-binding features that show significantly different characteristics between different gene expression patterns (activated or repressed gene expression in ES cells) at a false discovery rate of 10%. Gene classification with a subset of the identified features achieved up to 20% improvement over classification only based on the ChIP-seq TFs. More than 1/3 of reasoned regulatory roles of cofactor candidates involved in these features are supported by existing literatures. Finally, the predicted target genes of the majority candidates present expected expression change in another independent data set, which serves as a supplementary validation of these candidates.
Our results revealed a list of combinatorial genomic features that are significantly associated with gene expression in ES cells, suggesting potential cofactors of the ChIP-seq TFs for gene regulation.
为了帮助理解小鼠胚胎干细胞(ES 细胞)中的潜在转录调控回路,最近的 ChIP-seq 数据提供了几个关键转录因子(TF)的全基因组结合位置;同时,现有的研究已经对 ES 细胞及其早期分化状态中的基因表达进行了分析。已经表明基因表达谱与这些 TF 的结合相关。然而,其他 TF,称为共因子,是否通过与 ChIP-seq TF 结合来参与基因调控仍不清楚。
基于我们对 ES 细胞基因表达谱和附近潜在共因子结合位点的分析,我们在假发现率为 10%的情况下,确定了一组共结合特征,这些特征在 ES 细胞中不同基因表达模式(激活或抑制基因表达)之间表现出显著不同的特征。使用部分鉴定出的特征进行基因分类,比仅基于 ChIP-seq TF 的分类提高了 20%。在这些特征中,超过 1/3 的共因子候选物的推理调控作用得到了现有文献的支持。最后,大多数候选物的预测靶基因在另一个独立数据集的表达变化符合预期,这为这些候选物提供了补充验证。
我们的研究结果揭示了一组与 ES 细胞中基因表达显著相关的组合基因组特征,提示了 ChIP-seq TF 进行基因调控的潜在共因子。