Isaac Arnold Emerson, Sinha Sitabhra
Bioinformatics Division, School of Bio Sciences and Technology, VIT University, Vellore, India.
J Biosci. 2015 Oct;40(4):683-99. doi: 10.1007/s12038-015-9554-0.
The representation of proteins as networks of interacting amino acids, referred to as protein contact networks (PCN), and their subsequent analyses using graph theoretic tools, can provide novel insights into the key functional roles of specific groups of residues. We have characterized the networks corresponding to the native states of 66 proteins (belonging to different families) in terms of their core-periphery organization. The resulting hierarchical classification of the amino acid constituents of a protein arranges the residues into successive layers - having higher core order - with increasing connection density, ranging from a sparsely linked periphery to a densely intra-connected core (distinct from the earlier concept of protein core defined in terms of the three-dimensional geometry of the native state, which has least solvent accessibility). Our results show that residues in the inner cores are more conserved than those at the periphery. Underlining the functional importance of the network core, we see that the receptor sites for known ligand molecules of most proteins occur in the innermost core. Furthermore, the association of residues with structural pockets and cavities in binding or active sites increases with the core order. From mutation sensitivity analysis, we show that the probability of deleterious or intolerant mutations also increases with the core order. We also show that stabilization centre residues are in the innermost cores, suggesting that the network core is critically important in maintaining the structural stability of the protein. A publicly available Web resource for performing core-periphery analysis of any protein whose native state is known has been made available by us at http://www.imsc.res.in/ ~sitabhra/proteinKcore/index.html.
将蛋白质表示为相互作用氨基酸的网络,即蛋白质接触网络(PCN),并随后使用图论工具对其进行分析,可以为特定残基组的关键功能作用提供新的见解。我们根据核心-外围组织对66种蛋白质(属于不同家族)的天然状态对应的网络进行了表征。由此产生的蛋白质氨基酸成分的层次分类将残基排列成连续的层——具有更高的核心顺序——连接密度不断增加,范围从稀疏连接的外围到紧密内部连接的核心(不同于根据天然状态的三维几何定义的早期蛋白质核心概念,天然状态具有最小的溶剂可及性)。我们的结果表明,内核中的残基比外围的残基更保守。强调网络核心的功能重要性,我们发现大多数蛋白质已知配体分子的受体位点出现在最内层核心。此外,残基与结合或活性位点中的结构口袋和空腔的关联随着核心顺序的增加而增加。从突变敏感性分析中,我们表明有害或不耐受突变的概率也随着核心顺序的增加而增加。我们还表明,稳定中心残基位于最内层核心,这表明网络核心在维持蛋白质的结构稳定性方面至关重要。我们在http://www.imsc.res.in/ ~sitabhra/proteinKcore/index.html上提供了一个公开可用的网络资源,用于对任何已知天然状态的蛋白质进行核心-外围分析。