Kaplan Tommy, Biggin Mark D
Department of Molecular and Cell Biology, California Institute of Quantitative Biosciences, University of California, Berkeley, California, USA; School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel.
Methods Cell Biol. 2012;110:263-83. doi: 10.1016/B978-0-12-388403-9.00011-4.
Animal transcription factors drive complex spatial and temporal patterns of gene expression during development by binding to a wide array of genomic regions. While the in vivo DNA binding landscape and in vitro DNA binding affinities of many such proteins have been characterized, our understanding of the forces that determine where, when, and the extent to which these transcription factors bind DNA in cells remains primitive. In this chapter, we describe computational thermodynamic models that predict the genome-wide DNA binding landscape of transcription factors in vivo and evaluate the contribution of biophysical determinants, such as protein-protein interactions and chromatin accessibility, on DNA occupancy. We show that predictions based only on DNA sequence and in vitro DNA affinity data achieve a mild correlation (r=0.4) with experimental measurements of in vivo DNA binding. However, by incorporating direct measurements of DNA accessibility in chromatin, it is possible to obtain much higher accuracy (r=0.6-0.9) for various transcription factors across known target genes. Thus, a combination of experimental DNA accessibility data and computational modeling of transcription factor DNA binding may be sufficient to predict the binding landscape of any animal transcription factor with reasonable accuracy.
动物转录因子通过与广泛的基因组区域结合,在发育过程中驱动基因表达的复杂时空模式。虽然许多这类蛋白质的体内DNA结合图谱和体外DNA结合亲和力已得到表征,但我们对决定这些转录因子在细胞中何处、何时以及在多大程度上结合DNA的作用力的理解仍很初级。在本章中,我们描述了计算热力学模型,该模型可预测体内转录因子的全基因组DNA结合图谱,并评估生物物理决定因素(如蛋白质-蛋白质相互作用和染色质可及性)对DNA占据情况的贡献。我们表明,仅基于DNA序列和体外DNA亲和力数据的预测与体内DNA结合的实验测量结果具有中等程度的相关性(r = 0.4)。然而,通过纳入染色质中DNA可及性的直接测量数据,对于已知靶基因上的各种转录因子,可以获得更高的准确性(r = 0.6 - 0.9)。因此,实验性DNA可及性数据与转录因子DNA结合的计算建模相结合,可能足以以合理的准确性预测任何动物转录因子的结合图谱。