Department of Biophysics, Molecular Biology & Bioinformatics, University of Calcutta, 92 APC Road, Kolkata-700009, India.
BMC Bioinformatics. 2012 Jun 21;13:142. doi: 10.1186/1471-2105-13-142.
The three-dimensional structure of a protein can be described as a graph where nodes represent residues and the strength of non-covalent interactions between them are edges. These protein contact networks can be separated into long and short-range interactions networks depending on the positions of amino acids in primary structure. Long-range interactions play a distinct role in determining the tertiary structure of a protein while short-range interactions could largely contribute to the secondary structure formations. In addition, physico chemical properties and the linear arrangement of amino acids of the primary structure of a protein determines its three dimensional structure. Here, we present an extensive analysis of protein contact subnetworks based on the London van der Waals interactions of amino acids at different length scales. We further subdivided those networks in hydrophobic, hydrophilic and charged residues networks and have tried to correlate their influence in the overall topology and organization of a protein.
The largest connected component (LCC) of long (LRN)-, short (SRN)- and all-range (ARN) networks within proteins exhibit a transition behaviour when plotted against different interaction strengths of edges among amino acid nodes. While short-range networks having chain like structures exhibit highly cooperative transition; long- and all-range networks, which are more similar to each other, have non-chain like structures and show less cooperativity. Further, the hydrophobic residues subnetworks in long- and all-range networks have similar transition behaviours with all residues all-range networks, but the hydrophilic and charged residues networks don't. While the nature of transitions of LCC's sizes is same in SRNs for thermophiles and mesophiles, there exists a clear difference in LRNs. The presence of larger size of interconnected long-range interactions in thermophiles than mesophiles, even at higher interaction strength between amino acids, give extra stability to the tertiary structure of the thermophiles. All the subnetworks at different length scales (ARNs, LRNs and SRNs) show assortativity mixing property of their participating amino acids. While there exists a significant higher percentage of hydrophobic subclusters over others in ARNs and LRNs; we do not find the assortative mixing behaviour of any the subclusters in SRNs. The clustering coefficient of hydrophobic subclusters in long-range network is the highest among types of subnetworks. There exist highly cliquish hydrophobic nodes followed by charged nodes in LRNs and ARNs; on the other hand, we observe the highest dominance of charged residues cliques in short-range networks. Studies on the perimeter of the cliques also show higher occurrences of hydrophobic and charged residues' cliques.
The simple framework of protein contact networks and their subnetworks based on London van der Waals force is able to capture several known properties of protein structure as well as can unravel several new features. The thermophiles do not only have the higher number of long-range interactions; they also have larger cluster of connected residues at higher interaction strengths among amino acids, than their mesophilic counterparts. It can reestablish the significant role of long-range hydrophobic clusters in protein folding and stabilization; at the same time, it shed light on the higher communication ability of hydrophobic subnetworks over the others. The results give an indication of the controlling role of hydrophobic subclusters in determining protein's folding rate. The occurrences of higher perimeters of hydrophobic and charged cliques imply the role of charged residues as well as hydrophobic residues in stabilizing the distant part of primary structure of a protein through London van der Waals interaction.
蛋白质的三维结构可以描述为一个图,其中节点代表残基,节点之间的非共价相互作用的强度为边。这些蛋白质接触网络可以根据氨基酸在一级结构中的位置分为长程和短程相互作用网络。长程相互作用在确定蛋白质的三级结构方面起着独特的作用,而短程相互作用可能在很大程度上有助于二级结构的形成。此外,蛋白质一级结构的理化性质和氨基酸的线性排列决定了其三维结构。在这里,我们基于氨基酸的伦敦范德华相互作用,对蛋白质接触子网络进行了广泛的分析。我们进一步将这些网络细分为疏水、亲水和带电残基网络,并试图将它们的影响与蛋白质的整体拓扑结构和组织相关联。
蛋白质中长程(LRN)、短程(SRN)和全范围(ARN)网络的最大连通分量(LCC)在不同的氨基酸节点之间的边相互作用强度下呈过渡行为。尽管具有链式结构的短程网络表现出高度协同的转变,但与彼此更相似的长程和全范围网络具有非链式结构,协同性较低。此外,长程和全范围网络中的疏水残基子网络与所有残基全范围网络具有相似的转变行为,但亲水和带电残基网络则不然。虽然在嗜热菌和中温菌中,SRN 中 LCC 大小的转变性质相同,但在 LRN 中存在明显差异。嗜热菌中存在较大的长程相互作用,即使在更高的氨基酸相互作用强度下,也能为其三级结构提供额外的稳定性。不同长度尺度(ARN、LRN 和 SRN)的所有子网都表现出参与氨基酸的聚类混合特性。虽然在 ARN 和 LRN 中,疏水亚簇的存在比例显著高于其他亚簇;但我们没有发现 SRN 中任何亚簇的聚类混合行为。长程网络中疏水亚簇的聚类系数在所有类型的子网中最高。LRN 和 ARN 中存在高度连接的疏水节点,其次是带电节点;另一方面,我们观察到短程网络中带电荷残基聚类的主导地位最高。对聚类的周长的研究也表明,疏水和带电荷残基聚类的出现频率更高。
基于伦敦范德华力的蛋白质接触网络及其子网的简单框架能够捕捉到蛋白质结构的几个已知性质,同时也能够揭示出几个新的特征。嗜热菌不仅具有更多的长程相互作用;它们在氨基酸之间的相互作用强度较高时,也具有更大的连接残基簇,比它们的中温菌对应物多。它可以重新确立长程疏水簇在蛋白质折叠和稳定中的重要作用;同时,它也揭示了疏水子网比其他子网更高的通讯能力。研究结果表明,疏水亚簇在决定蛋白质折叠速率方面起着控制作用。更高的疏水和带电荷聚类周长的出现意味着带电荷残基和疏水残基在通过伦敦范德华相互作用稳定蛋白质一级结构的远距离部分方面的作用。