Center of New Technologies, University of Warsaw, Warsaw, Poland.
Department of Physics, University of Warsaw, Warsaw, Poland.
Proteins. 2021 Oct;89(10):1333-1339. doi: 10.1002/prot.26154. Epub 2021 Jun 10.
Protein structure networks (PSNs) have long been used to provide a coarse yet meaningful representation of protein structure, dynamics, and internal communication pathways. An important question is what criteria should be applied to construct the network so that to include relevant interresidue contacts while avoiding unnecessary connections. To address this issue, we systematically considered varying residue distance cutoff length and the probability threshold for contact formation to construct PSNs based on atomistic molecular dynamics in order to assess the amount of mutual information within the resulting representations. We found that the minimum in mutual information is universally achieved at the cutoff length of 5 Å, irrespective of the applied contact formation probability threshold in all considered, distinct proteins. Assuming that the optimal PSNs should be characterized by the least amount of redundancy, which corresponds to the minimum in mutual information, this finding suggests an objective criterion for cutoff distance and supports the existing preference towards its customary selection around 5 Å length, typically based to date on heuristic criteria.
蛋白质结构网络(PSN)长期以来一直被用于提供蛋白质结构、动力学和内部通讯途径的粗糙但有意义的表示。一个重要的问题是应该应用什么标准来构建网络,以便包括相关的残基间接触,同时避免不必要的连接。为了解决这个问题,我们系统地考虑了变化的残基距离截止长度和接触形成的概率阈值,以便根据原子分子动力学构建 PSN,从而评估所得表示中的互信息量。我们发现,在所有考虑的不同蛋白质中,互信息的最小值普遍在 5Å 的截止长度处达到,而与应用的接触形成概率阈值无关。假设最佳 PSN 应该具有最少的冗余性,这对应于互信息的最小值,这一发现为截止距离提供了一个客观的标准,并支持了目前围绕 5Å 长度的常用选择,通常基于启发式标准。