Hamdani Rahma, Cianferoni Damiano, Reche Raul, Delgado Javier, Serrano Luis
Department of Systems and Synthetic Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona, Spain.
Universitat Pompeu Fabra (UPF), Barcelona, Spain.
Protein Sci. 2025 Aug;34(8):e70220. doi: 10.1002/pro.70220.
Protein-protein interactions (PPI) are fundamental to cellular signaling, forming robust networks that govern critical biological processes such as immune response, cell growth, and signal transduction. Nanobody-based therapies have emerged as a key strategy for modulating PPIs, offering exceptional potential due to their high specificity, stability, and ability to access challenging epitopes on PPI interfaces inside cells. The rational design of nanobodies relies mainly on understanding and predicting their binding regions, particularly the residues that contribute the most to the binding energy (binding hotspots). Existing computational methods do not fully provide a scalable solution for hotspot identification in nanobody design, leaving a critical gap in the rational design of these therapeutics. Here, we present a scalable and structure-aware algorithm for hotspot prediction in nanobody design. The algorithm queries a curated database of triplets of interacting residues obtained from ~20,000 non-redundant PDB structures. We showed that these triplets contain structural and energetic information, being able to assess the stability effect of residue variations in protein structures, Pearson R = 0.63 (MSE = 1.58 kcal/mol). More important than effects on stability is the ability of the algorithm to predict binding hotspots of protein-protein generic complexes and more specifically in complexes containing nanobodies. HotspotPred reached an accuracy of 0.73 for hotspot residue identification in a protein interaction dataset of 1160 Alanine mutants and correctly identified in 63.4% of the cases we predicted at least 2 residues on the binding surface.
蛋白质-蛋白质相互作用(PPI)是细胞信号传导的基础,形成了强大的网络,控制着诸如免疫反应、细胞生长和信号转导等关键生物学过程。基于纳米抗体的疗法已成为调节PPI的关键策略,由于其高特异性、稳定性以及能够识别细胞内PPI界面上具有挑战性的表位,具有巨大的潜力。纳米抗体的合理设计主要依赖于对其结合区域的理解和预测,特别是对结合能贡献最大的残基(结合热点)。现有的计算方法不能完全为纳米抗体设计中的热点识别提供可扩展的解决方案,在这些治疗方法的合理设计中留下了关键差距。在此,我们提出了一种用于纳米抗体设计中热点预测的可扩展且结构感知的算法。该算法查询了一个经过整理的数据库,该数据库包含从约20,000个非冗余PDB结构中获得的相互作用残基三联体。我们表明,这些三联体包含结构和能量信息,能够评估蛋白质结构中残基变异的稳定性效应,皮尔逊相关系数R = 0.63(均方误差MSE = 1.58千卡/摩尔)。比稳定性影响更重要的是该算法预测蛋白质-蛋白质通用复合物,特别是包含纳米抗体的复合物的结合热点的能力。在1160个丙氨酸突变体的蛋白质相互作用数据集中,HotspotPred对热点残基识别的准确率达到0.73,并且在我们预测的63.4%的案例中,正确识别出结合表面上至少2个残基。