Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands.
Commun Biol. 2024 Jan 6;7(1):49. doi: 10.1038/s42003-023-05718-w.
The formation of a stable complex between proteins lies at the core of a wide variety of biological processes and has been the focus of countless experiments. The huge amount of information contained in the protein structural interactome in the Protein Data Bank can now be used to characterise and classify the existing biological interfaces. We here introduce ARCTIC-3D, a fast and user-friendly data mining and clustering software to retrieve data and rationalise the interface information associated with the protein input data. We demonstrate its use by various examples ranging from showing the increased interaction complexity of eukaryotic proteins, 20% of which on average have more than 3 different interfaces compared to only 10% for prokaryotes, to associating different functions to different interfaces. In the context of modelling biomolecular assemblies, we introduce the concept of "recognition entropy", related to the number of possible interfaces of the components of a protein-protein complex, which we demonstrate to correlate with the modelling difficulty in classical docking approaches. The identified interface clusters can also be used to generate various combinations of interface-specific restraints for integrative modelling. The ARCTIC-3D software is freely available at github.com/haddocking/arctic3d and can be accessed as a web-service at wenmr.science.uu.nl/arctic3d.
蛋白质稳定复合物的形成是各种生物过程的核心,也是无数实验的焦点。现在,可以利用蛋白质数据库中的蛋白质结构相互作用组中包含的大量信息来描述和分类现有的生物界面。我们在这里引入了 ARCTIC-3D,这是一种快速且用户友好的数据挖掘和聚类软件,可以检索数据并合理化与蛋白质输入数据相关的接口信息。我们通过各种示例演示了它的用途,包括展示真核蛋白质的相互作用复杂性增加,平均 20%的真核蛋白质具有 3 个以上不同的界面,而原核蛋白质只有 10%;以及将不同的功能与不同的界面相关联。在生物分子组装建模的背景下,我们引入了“识别熵”的概念,它与蛋白质-蛋白质复合物成分的可能界面数量有关,我们证明它与经典对接方法中的建模难度相关。识别出的界面簇还可用于生成各种特定于界面的约束的组合,用于综合建模。ARCTIC-3D 软件可在 github.com/haddocking/arctic3d 上免费获得,并可在 wenmr.science.uu.nl/arctic3d 上作为网络服务访问。