Center for Computational Biology and Bioinformatics and College of Engineering, Koc University, Istanbul, Turkey.
Proteins. 2010 Aug 1;78(10):2283-94. doi: 10.1002/prot.22741.
We propose a novel approach to analyze and visualize residue contact networks of protein interfaces by graph-based algorithms using a minimum cut tree (mincut tree). Edges in the network are weighted according to an energy function derived from knowledge-based potentials. The mincut tree, which is constructed from the weighted residue network, simplifies and summarizes the complex structure of the contact network by an efficient and informative representation. This representation offers a comprehensible view of critical residues and facilitates the inspection of their organization. We observed, on a nonredundant data set of 38 protein complexes with experimental hotspots that the highest degree node in the mincut tree usually corresponds to an experimental hotspot. Further, hotspots are found in a few paths in the mincut tree. In addition, we examine the organization of hotspots (hot regions) using an iterative clustering algorithm on two different case studies. We find that distinct hot regions are located on specific sites of the mincut tree and some critical residues hold these clusters together. Clustering of the interface residues provides information about the relation of hot regions with each other. Our new approach is useful at the molecular level for both identification of critical paths in the protein interfaces and extraction of hot regions by clustering of the interface residues.
我们提出了一种新的方法,通过基于图的算法和最小割树(mincut tree)来分析和可视化蛋白质界面残基接触网络。网络中的边根据基于知识的势能导出的能量函数进行加权。最小割树是从加权残基网络构建的,通过有效且信息丰富的表示形式简化和总结了接触网络的复杂结构。这种表示形式提供了对关键残基的可理解视图,并方便了对它们组织的检查。我们在具有实验热点的 38 个蛋白质复合物的非冗余数据集上观察到,mincut 树中的最高度节点通常对应于实验热点。此外,热点存在于 mincut 树中的几条路径中。此外,我们使用迭代聚类算法在两个不同的案例研究中检查热点(热区)的组织。我们发现,不同的热点位于 mincut 树的特定位置,一些关键残基将这些簇连接在一起。界面残基的聚类提供了有关热点区域彼此之间关系的信息。我们的新方法在分子水平上对于识别蛋白质界面中的关键路径以及通过界面残基聚类提取热点区域都非常有用。