BIRC, School of Computer Engineering, Nanyang Technological University, Singapore, Singapore.
PLoS One. 2013 Apr 22;8(4):e59737. doi: 10.1371/journal.pone.0059737. Print 2013.
Specific binding between proteins plays a crucial role in molecular functions and biological processes. Protein binding interfaces and their atomic contacts are typically defined by simple criteria, such as distance-based definitions that only use some threshold of spatial distance in previous studies. These definitions neglect the nearby atomic organization of contact atoms, and thus detect predominant contacts which are interrupted by other atoms. It is questionable whether such kinds of interrupted contacts are as important as other contacts in protein binding. To tackle this challenge, we propose a new definition called beta (β) atomic contacts. Our definition, founded on the β-skeletons in computational geometry, requires that there is no other atom in the contact spheres defined by two contact atoms; this sphere is similar to the van der Waals spheres of atoms. The statistical analysis on a large dataset shows that β contacts are only a small fraction of conventional distance-based contacts. To empirically quantify the importance of β contacts, we design βACV, an SVM classifier with β contacts as input, to classify homodimers from crystal packing. We found that our βACV is able to achieve the state-of-the-art classification performance superior to SVM classifiers with distance-based contacts as input. Our βACV also outperforms several existing methods when being evaluated on several datasets in previous works. The promising empirical performance suggests that β contacts can truly identify critical specific contacts in protein binding interfaces. β contacts thus provide a new model for more precise description of atomic organization in protein quaternary structures than distance-based contacts.
蛋白质之间的特异性结合在分子功能和生物过程中起着至关重要的作用。蛋白质结合界面及其原子接触通常通过简单的标准来定义,例如基于距离的定义,在以前的研究中仅使用一些空间距离的阈值。这些定义忽略了接触原子的附近原子组织,因此仅检测到被其他原子中断的主要接触。这样的中断接触是否与蛋白质结合中的其他接触一样重要,这是值得怀疑的。为了解决这个挑战,我们提出了一种新的定义,称为β原子接触。我们的定义基于计算几何中的β骨架,要求在两个接触原子定义的接触球中没有其他原子;这个球类似于原子的范德华球。在大型数据集上的统计分析表明,β接触仅占传统基于距离的接触的一小部分。为了经验性地量化β接触的重要性,我们设计了βACV,这是一个带有β接触作为输入的 SVM 分类器,用于从晶体堆积中分类同源二聚体。我们发现,我们的βACV 能够实现最先进的分类性能,优于带有基于距离的接触作为输入的 SVM 分类器。当在以前的工作中的几个数据集上进行评估时,我们的βACV 也优于几个现有的方法。有希望的经验性能表明,β接触可以真正识别蛋白质结合界面中的关键特定接触。因此,β接触为更精确地描述蛋白质四级结构中的原子组织提供了一种新的模型,优于基于距离的接触。