School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
College of Computer and Information Engineering, Nanjing Tech University, Nanjing, 211816, China.
Interdiscip Sci. 2024 Dec;16(4):951-965. doi: 10.1007/s12539-024-00640-z. Epub 2024 Jul 7.
The emergence of antibiotic-resistant microbes raises a pressing demand for novel alternative treatments. One promising alternative is the antimicrobial peptides (AMPs), a class of innate immunity mediators within the therapeutic peptide realm. AMPs offer salient advantages such as high specificity, cost-effective synthesis, and reduced toxicity. Although some computational methodologies have been proposed to identify potential AMPs with the rapid development of artificial intelligence techniques, there is still ample room to improve their performance. This study proposes a predictive framework which ensembles deep learning and statistical learning methods to screen peptides with antimicrobial activity. We integrate multiple LightGBM classifiers and convolution neural networks which leverages various predicted sequential, structural and physicochemical properties from their residue sequences extracted by diverse machine learning paradigms. Comparative experiments exhibit that our method outperforms other state-of-the-art approaches on an independent test dataset, in terms of representative capability measures. Besides, we analyse the discrimination quality under different varieties of attribute information and it reveals that combination of multiple features could improve prediction. In addition, a case study is carried out to illustrate the exemplary favorable identification effect. We establish a web application at http://amp.denglab.org to provide convenient usage of our proposal and make the predictive framework, source code, and datasets publicly accessible at https://github.com/researchprotein/amp .
抗生素耐药微生物的出现提出了对新型替代疗法的迫切需求。一种有前途的替代方法是抗菌肽 (AMPs),它是治疗性肽领域中先天免疫介质的一类。AMPs 具有高度特异性、经济高效的合成和降低毒性等突出优势。尽管随着人工智能技术的快速发展,已经提出了一些计算方法来识别潜在的 AMPs,但仍有很大的改进空间。本研究提出了一种预测框架,该框架集成了深度学习和统计学习方法来筛选具有抗菌活性的肽。我们整合了多个 LightGBM 分类器和卷积神经网络,这些分类器利用了从不同机器学习范例中提取的残基序列的各种预测序列、结构和物理化学特性。对比实验表明,我们的方法在独立测试数据集上的代表性能力度量方面优于其他最先进的方法。此外,我们分析了不同属性信息种类下的区分质量,结果表明多种特征的组合可以提高预测性能。此外,还进行了案例研究以说明示例识别效果。我们在 http://amp.denglab.org 上建立了一个网络应用程序,以方便使用我们的建议,并在 https://github.com/researchprotein/amp 上公开提供预测框架、源代码和数据集。