Hu Rui-Si, Gu Kui, Ehsan Muhammad, Abbas Raza Sayed Haidar, Wang Chun-Ren
School of Health and Wellness Industry & School of Medicine, Sichuan University of Arts and Science, Dazhou, Sichuan Province, People's Republic of China.
Key Laboratory of Intelligent Medicine and Health Data Science, Sichuan University of Arts and Science, Dazhou, Sichuan Province, People's Republic of China.
PLoS Negl Trop Dis. 2025 Apr 29;19(4):e0012985. doi: 10.1371/journal.pntd.0012985. eCollection 2025 Apr.
The identification of B-cell epitopes (BCEs) is fundamental to advancing epitope-based vaccine design, therapeutic antibody development, and diagnostics, such as in neglected tropical diseases caused by parasitic pathogens. However, the structural complexity of parasite antigens and the high cost of experimental validation present certain challenges. Advances in Artificial Intelligence (AI)-driven protein engineering, particularly through machine learning and deep learning, offer efficient solutions to enhance prediction accuracy and reduce experimental costs.
METHODOLOGY/PRINCIPAL FINDINGS: Here, we present deepBCE-Parasite, a Transformer-based deep learning model designed to predict linear BCEs from peptide sequences. By leveraging a state-of-the-art self-attention mechanism, the model achieved remarkable predictive performance, achieving an accuracy of approximately 81% and an AUC of 0.90 in both 10-fold cross-validation and independent testing. Comparative analyses against 12 handcrafted features and four conventional machine learning algorithms (GNB, SVM, RF, and LGBM) highlighted the superior predictive power of the model. As a case study, deepBCE-Parasite predicted eight BCEs from the leucine aminopeptidase (LAP) protein in Fasciola hepatica proteomic data. Dot-blot immunoassays confirmed the specific binding of seven synthetic peptides to positive sera, validating their IgG reactivity and demonstrating the model's efficacy in BCE prediction.
CONCLUSIONS/SIGNIFICANCE: deepBCE-Parasite demonstrates excellent performance in predicting BCEs across diverse parasitic pathogens, offering a valuable tool for advancing the design of epitope-based vaccines, antibodies, and diagnostic applications in parasitology.
B细胞表位(BCE)的鉴定对于推进基于表位的疫苗设计、治疗性抗体开发以及诊断至关重要,例如在由寄生性病原体引起的被忽视热带病中。然而,寄生虫抗原的结构复杂性以及实验验证的高成本带来了一定挑战。人工智能(AI)驱动的蛋白质工程进展,特别是通过机器学习和深度学习,提供了提高预测准确性和降低实验成本的有效解决方案。
方法/主要发现:在此,我们展示了deepBCE-Parasite,这是一种基于Transformer的深度学习模型,旨在从肽序列预测线性BCE。通过利用先进的自注意力机制,该模型取得了卓越的预测性能,在10折交叉验证和独立测试中均达到了约81%的准确率和0.90的AUC。与12种手工特征和四种传统机器学习算法(GNB、SVM、RF和LGBM)的比较分析突出了该模型的优越预测能力。作为案例研究,deepBCE-Parasite从肝片吸虫蛋白质组数据中的亮氨酸氨肽酶(LAP)蛋白预测了8个BCE。斑点印迹免疫分析证实了7种合成肽与阳性血清的特异性结合,验证了它们的IgG反应性,并证明了该模型在BCE预测中的有效性。
结论/意义:deepBCE-Parasite在预测多种寄生性病原体的BCE方面表现出色,为推进基于表位的疫苗、抗体设计以及寄生虫学诊断应用提供了有价值的工具。