Feng Hua, Sun Xuefeng, Li Ning, Xu Qian, Li Qin, Zhang Shenli, Xing Guangxu, Zhang Gaiping, Wang Fangyu
Institute for Animal Health, Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, 116 Huayuan Road, Zhengzhou 450002, China.
Longhu Laboratory, 218 Ping AN Avenue, Zhengzhou 450002, China.
ACS Omega. 2024 Nov 19;9(48):47893-47902. doi: 10.1021/acsomega.4c09718. eCollection 2024 Dec 3.
Because of their high affinity, specificity, and environmental stability, nanobodies (Nbs) have continuously received attention from the field of biological research. However, it is tough work to obtain high-affinity Nbs using experimental methods. In the current study, 12 machine learning algorithms were compared in parallel to explore the potential patterns between Nb-ligand affinity and eight noncovalent interactions. After model comparison and optimization, four optimized models (SVMrB, RotFB, RFB, and C50B) and two stacked models (StackKNN and StackRF) based on nine uncorrelated (correlation coefficient <0.65) optimized models were selected. All the models showed an accuracy of around 0.70 and high specificity. Compared to the other models, RotFB and RFB were not capable of predicting nonaffinitive Nbs with lower precision (<0.44) but showed higher sensitivity at 0.6761 and 0.3521 and good model robustness (F1 score and MCC values). On the contrary, SVMrB, C50B, and StackKNN were able to effectively predict the future nonaffinitive Nbs (specificity >0.92) and reduce the number of true affinitive Nbs (precision >0.5). On the other hand, StackRF showed intermediate model performance. Furthermore, an in-depth feature analysis indicated that hydrogen bonding and aromatic-associated interactions were the key noncovalent interactions in determining Nb-ligand binding affinity. In summary, the current study provides, for the first time, a tool that can effectively predict whether there is an affinity between nanobodies and their intended ligands and explores the key factors that influence their affinity, which could improve the screening and design process of Nbs and accelerate the development of Nb drugs and applications.
由于其高亲和力、特异性和环境稳定性,纳米抗体(Nbs)一直受到生物学研究领域的关注。然而,使用实验方法获得高亲和力的纳米抗体是一项艰巨的工作。在当前的研究中,并行比较了12种机器学习算法,以探索纳米抗体-配体亲和力与8种非共价相互作用之间的潜在模式。经过模型比较和优化,基于9个不相关(相关系数<0.65)的优化模型选择了4个优化模型(SVMrB、RotFB、RFB和C50B)和2个堆叠模型(StackKNN和StackRF)。所有模型的准确率均约为0.70,且具有高特异性。与其他模型相比,RotFB和RFB不能以较低的精度(<0.44)预测非亲和性纳米抗体,但在0.6761和0.3521时表现出较高的敏感性以及良好的模型稳健性(F1分数和MCC值)。相反,SVMrB、C50B和StackKNN能够有效预测未来的非亲和性纳米抗体(特异性>0.92)并减少真正亲和性纳米抗体的数量(精度>0.5)。另一方面,StackRF表现出中等的模型性能。此外,深入的特征分析表明,氢键和芳香族相关相互作用是决定纳米抗体-配体结合亲和力的关键非共价相互作用。总之,当前的研究首次提供了一种能够有效预测纳米抗体与其预期配体之间是否存在亲和力的工具,并探索了影响其亲和力的关键因素,这可以改善纳米抗体的筛选和设计过程,加速纳米抗体药物的开发和应用。