Orenstein Yaron, Shamir Ron
Brief Funct Genomics. 2017 May 1;16(3):171-180. doi: 10.1093/bfgp/elw030.
Protein-DNA binding plays a central role in gene regulation and by that in all processes in the living cell. Novel experimental and computational approaches facilitate better understanding of protein-DNA binding preferences via high-throughput measurement of protein binding to a large number of DNA sequences and inference of binding models from them. Here we review the state of the art in measuring protein-DNA binding in vitro, emphasizing the advantages and limitations of different technologies. In addition, we describe models for representing protein-DNA binding preferences and key computational approaches to learn those from high-throughput data. Using large experimental data sets, we test the performance of different models based on different measuring techniques. We conclude with pertinent open problems.
蛋白质与DNA的结合在基因调控中起着核心作用,进而在活细胞的所有过程中都发挥着关键作用。新颖的实验和计算方法有助于通过高通量测量蛋白质与大量DNA序列的结合情况以及从中推断结合模型,从而更好地理解蛋白质与DNA的结合偏好。在此,我们综述了体外测量蛋白质与DNA结合的现有技术水平,着重介绍了不同技术的优缺点。此外,我们还描述了用于表示蛋白质与DNA结合偏好的模型以及从高通量数据中学习这些偏好的关键计算方法。利用大型实验数据集,我们基于不同的测量技术测试了不同模型的性能。最后,我们总结了相关的开放性问题。