Kauffman Chris, Karypis George
Department of Computer Science, University of Minnesota, 117 Pleasant St. SE, Minneapolis, MN 55455, USA.
Pac Symp Biocomput. 2008:477-88.
Understanding the role proteins play in regulating DNA replication is essential to forming a complete picture of how the genome manifests itself. In this work, we examine the feasibility of predicting the residues of a protein essential to binding by analyzing protein-DNA interactions from an information theoretic perspective. Through the lens of mutual information, we explore which properties of protein sequence and structure are most useful in determining binding residues with a particular focus on sequence features. We find that the quantity of information carried in most features is small with respect to DNA-contacting residues, the bulk being provided by sequence features along with a select few structural features. Supplemental information for this article is available at http://www.cs.umn.edu/ -kauffman/supplements/psb2008.
了解蛋白质在调节DNA复制中所起的作用对于全面了解基因组如何展现自身至关重要。在这项工作中,我们从信息论的角度分析蛋白质与DNA的相互作用,研究预测蛋白质结合所必需残基的可行性。通过互信息的视角,我们探索蛋白质序列和结构的哪些特性对于确定结合残基最为有用,特别关注序列特征。我们发现,大多数特征所携带的信息相对于与DNA接触的残基来说较少,大部分信息由序列特征以及少数结构特征提供。本文的补充信息可在http://www.cs.umn.edu/ -kauffman/supplements/psb2008获取。