Castillo-Mendieta Kevin, Agüero-Chapin Guillermin, Marquez Edgar, Perez-Castillo Yunierkis, Barigye Stephen J, Pérez-Cárdenas Mariela, Peréz-Giménez Facundo, Marrero-Ponce Yovani
School of Biological Sciences and Engineering, Yachay Tech University, Hda. San José s/n y Proyecto Yachay, Urcuquí 100119, Ecuador.
CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, Terminal de Cruzeiros do Porto de Leixões, University of Porto, Av. General Norton de Matos s/n, 4450-208 Porto, Portugal.
Chem Res Toxicol. 2024 Apr 15;37(4):580-589. doi: 10.1021/acs.chemrestox.3c00408. Epub 2024 Mar 19.
The desirable pharmacological properties and a broad number of therapeutic activities have made peptides promising drugs over small organic molecules and antibody drugs. Nevertheless, toxic effects, such as hemolysis, have hampered the development of such promising drugs. Hence, a reliable computational tool to predict peptide hemolytic toxicity is enormously useful before synthesis and experimental evaluation. Currently, four web servers that predict hemolytic activity using machine learning (ML) algorithms are available; however, they exhibit some limitations, such as the need for a reliable negative set and limited application domain. Hence, we developed a robust model based on a novel theoretical approach that combines network science and a multiquery similarity searching (MQSS) method. A total of 1152 initial models were constructed from 144 scaffolds generated in a previous report. These were evaluated on external data sets, and the best models were fused and improved. Our best MQSS model outperformed all ML-based models and was used to characterize the prevalence of hemolytic toxicity on therapeutic peptides. Based on our model's estimation, the number of hemolytic peptides might be 3.9-fold higher than the reported.
理想的药理学特性和广泛的治疗活性使肽成为比小分子有机药物和抗体药物更有前景的药物。然而,诸如溶血等毒性作用阻碍了这类有前景药物的开发。因此,在合成和实验评估之前,一种可靠的预测肽溶血毒性的计算工具非常有用。目前,有四个使用机器学习(ML)算法预测溶血活性的网络服务器;然而,它们存在一些局限性,例如需要可靠的阴性集和有限的应用领域。因此,我们基于一种结合网络科学和多查询相似性搜索(MQSS)方法的新颖理论方法开发了一个强大的模型。从先前报告中生成的144个支架构建了总共1152个初始模型。这些模型在外部数据集上进行了评估,并且对最佳模型进行了融合和改进。我们最好的MQSS模型优于所有基于ML的模型,并用于表征治疗性肽溶血毒性的普遍性。根据我们模型的估计,溶血肽的数量可能比报道的高3.9倍。