Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong, Shenzhen, China.
School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China.
Protein Sci. 2023 Oct;32(10):e4758. doi: 10.1002/pro.4758.
Fungal infections have become a significant global health issue, affecting millions worldwide. Antifungal peptides (AFPs) have emerged as a promising alternative to conventional antifungal drugs due to their low toxicity and low propensity for inducing resistance. In this study, we developed a deep learning-based framework called DeepAFP to efficiently identify AFPs. DeepAFP fully leverages and mines composition information, evolutionary information, and physicochemical properties of peptides by employing combined kernels from multiple branches of convolutional neural network with bi-directional long short-term memory layers. In addition, DeepAFP integrates a transfer learning strategy to obtain efficient representations of peptides for improving model performance. DeepAFP demonstrates strong predictive ability on carefully curated datasets, yielding an accuracy of 93.29% and an F1-score of 93.45% on the DeepAFP-Main dataset. The experimental results show that DeepAFP outperforms existing AFP prediction tools, achieving state-of-the-art performance. Finally, we provide a downloadable AFP prediction tool to meet the demands of large-scale prediction and facilitate the usage of our framework by the public or other researchers. Our framework can accurately identify AFPs in a short time without requiring significant human and material resources, and hence can accelerate the development of AFPs as well as contribute to the treatment of fungal infections. Furthermore, our method can provide new perspectives for other biological sequence analysis tasks.
真菌感染已成为一个重大的全球健康问题,影响着全世界数百万人。抗真菌肽 (AFPs) 由于其低毒性和低耐药性诱导倾向,已成为传统抗真菌药物的一种有前途的替代品。在这项研究中,我们开发了一种基于深度学习的框架,称为 DeepAFP,用于有效地识别 AFP。DeepAFP 通过使用来自多个卷积神经网络分支的组合核和双向长短期记忆层,充分利用和挖掘肽的组成信息、进化信息和物理化学性质。此外,DeepAFP 集成了迁移学习策略,以获得用于提高模型性能的肽的有效表示。DeepAFP 在精心策划的数据集上表现出强大的预测能力,在 DeepAFP-Main 数据集上的准确率为 93.29%,F1 得分为 93.45%。实验结果表明,DeepAFP 优于现有的 AFP 预测工具,实现了最先进的性能。最后,我们提供了一个可下载的 AFP 预测工具,以满足大规模预测的需求,并方便公众或其他研究人员使用我们的框架。我们的框架可以在短时间内准确识别 AFP,而不需要大量的人力和物力资源,因此可以加速 AFP 的开发,并有助于治疗真菌感染。此外,我们的方法可以为其他生物序列分析任务提供新的视角。