Romero Maylin, Marrero-Ponce Yovani, Rodríguez Hortensia, Agüero-Chapin Guillermin, Antunes Agostinho, Aguilera-Mendoza Longendri, Martinez-Rios Felix
School of Chemical Sciences and Engineering, Yachay Tech University, Hda. San Jose s/n y Proyecto Yachay, Urcuqui 100119, Ecuador.
Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas, Diego de Robles y vía Interoceánica, Pichincha, Quito 170157, Ecuador.
Antibiotics (Basel). 2022 Mar 17;11(3):401. doi: 10.3390/antibiotics11030401.
Peptide-based drugs are promising anticancer candidates due to their biocompatibility and low toxicity. In particular, tumor-homing peptides (THPs) have the ability to bind specifically to cancer cell receptors and tumor vasculature. Despite their potential to develop antitumor drugs, there are few available prediction tools to assist the discovery of new THPs. Two webservers based on machine learning models are currently active, the TumorHPD and the THPep, and more recently the SCMTHP. Herein, a novel method based on network science and similarity searching implemented in the starPep toolbox is presented for THP discovery. The approach leverages from exploring the structural space of THPs with Chemical Space Networks (CSNs) and from applying centrality measures to identify the most relevant and non-redundant THP sequences within the CSN. Such THPs were considered as queries (Qs) for multi-query similarity searches that apply a group fusion (MAX-SIM rule) model. The resulting multi-query similarity searching models (SSMs) were validated with three benchmarking datasets of THPs/non-THPs. The predictions achieved accuracies that ranged from 92.64 to 99.18% and Matthews Correlation Coefficients between 0.894-0.98, outperforming state-of-the-art predictors. The best model was applied to repurpose AMPs from the starPep database as THPs, which were subsequently optimized for the TH activity. Finally, 54 promising THP leads were discovered, and their sequences were analyzed to encounter novel motifs. These results demonstrate the potential of CSNs and multi-query similarity searching for the rapid and accurate identification of THPs.
基于肽的药物因其生物相容性和低毒性而成为有前景的抗癌候选药物。特别是,肿瘤归巢肽(THP)能够特异性结合癌细胞受体和肿瘤脉管系统。尽管它们有开发抗肿瘤药物的潜力,但用于辅助发现新THP的可用预测工具却很少。目前有两个基于机器学习模型的网络服务器在运行,即TumorHPD和THPep,最近还有SCMTHP。本文提出了一种基于网络科学和相似性搜索的新方法,该方法在starPep工具箱中实现,用于发现THP。该方法利用化学空间网络(CSN)探索THP的结构空间,并应用中心性度量来识别CSN中最相关且无冗余的THP序列。这些THP被视为多查询相似性搜索的查询(Q),该搜索应用组融合(MAX-SIM规则)模型。所得的多查询相似性搜索模型(SSM)用三个THP/非THP基准数据集进行了验证。预测准确率在92.64%至99.18%之间,马修斯相关系数在0.894 - 0.98之间,优于现有最佳预测器。最佳模型被用于将starPep数据库中的抗菌肽重新用作THP,随后对其TH活性进行了优化。最后,发现了54个有前景的THP先导物,并对其序列进行分析以发现新基序。这些结果证明了CSN和多查询相似性搜索在快速准确识别THP方面的潜力。