Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, 50200, Thailand.
Pediatric Translational Research Unit, Department of Pediatrics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand.
Curr Med Chem. 2022;29(5):849-864. doi: 10.2174/0929867328666210810145806.
Cancer is one of the leading causes of death worldwide and the underlying angiogenesis represents one of the hallmarks of cancer. Efforts are already under way for the discovery of anti-angiogenic peptides (AAPs) as a promising therapeutic route, which tackle the formation of new blood vessels. As such, the identification of AAPs constitutes a viable path for understanding their mechanistic properties pertinent for the discovery of new anti-cancer drugs. In spite of the abundance of peptide sequences in public databases, experimental efforts in the identification of anti-angiogenic peptides have progressed very slowly owing to high expenditures and laborious nature. Owing to its inherent ability to make sense of large volumes of data, machine learning (ML) represents a lucrative technique that can be harnessed for peptide-based drug discovery. In this review, we conducted a comprehensive and comparative analysis of ML-based AAP predictors in terms of their employed feature descriptors, ML algorithms, cross-validation methods and prediction performance. Moreover, the common framework of these AAP predictors and their inherent weaknesses are also discussed. Particularly, we explore future perspectives for improving the prediction accuracy and model interpretability, which represent an interesting avenue for overcoming some of the inherent weaknesses of existing AAP predictors. We anticipate that this review would assist researchers in the rapid screening and identification of promising AAPs for clinical use.
癌症是全球主要死因之一,其潜在的血管生成是癌症的标志之一。目前已经在努力发现抗血管生成肽(AAPs)作为一种有前途的治疗途径,以解决新血管形成的问题。因此,鉴定 AAP 是了解其与发现新抗癌药物相关的机制特性的可行途径。尽管公共数据库中存在大量的肽序列,但由于高支出和繁琐的性质,抗血管生成肽的实验鉴定进展非常缓慢。由于其能够理解大量数据的固有能力,机器学习(ML)是一种有前途的技术,可以用于基于肽的药物发现。在这篇综述中,我们从所使用的特征描述符、ML 算法、交叉验证方法和预测性能等方面对基于 ML 的 AAP 预测器进行了全面和比较分析。此外,还讨论了这些 AAP 预测器的常见框架及其内在弱点。特别是,我们探讨了提高预测准确性和模型可解释性的未来展望,这为克服现有 AAP 预测器的一些内在弱点提供了一个有趣的途径。我们预计,这篇综述将有助于研究人员快速筛选和鉴定有临床应用前景的 AAP。