SEMA：使用深度迁移学习预测抗原 B 细胞构象表位。

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.

机构信息

Artificial Intelligence Research Institute, Moscow, Russia.

Sber AI Lab, Moscow, Russia.

出版信息

Front Immunol. 2022 Sep 15;13:960985. doi: 10.3389/fimmu.2022.960985. eCollection 2022.

DOI:10.3389/fimmu.2022.960985

PMID:36189325

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9523212/

Abstract

One of the primary tasks in vaccine design and development of immunotherapeutic drugs is to predict conformational B-cell epitopes corresponding to primary antibody binding sites within the antigen tertiary structure. To date, multiple approaches have been developed to address this issue. However, for a wide range of antigens their accuracy is limited. In this paper, we applied the transfer learning approach using pretrained deep learning models to develop a model that predicts conformational B-cell epitopes based on the primary antigen sequence and tertiary structure. A pretrained protein language model, ESM-1v, and an inverse folding model, ESM-IF1, were fine-tuned to quantitatively predict antibody-antigen interaction features and distinguish between epitope and non-epitope residues. The resulting model called SEMA demonstrated the best performance on an independent test set with ROC AUC of 0.76 compared to peer-reviewed tools. We show that SEMA can quantitatively rank the immunodominant regions within the SARS-CoV-2 RBD domain. SEMA is available at https://github.com/AIRI-Institute/SEMAi and the web-interface http://sema.airi.net.

摘要

在疫苗设计和免疫治疗药物开发中，首要任务之一是预测抗原三级结构中与初级抗体结合位点相对应的构象 B 细胞表位。迄今为止，已经开发出多种方法来解决这个问题。然而，对于广泛的抗原，它们的准确性受到限制。在本文中，我们应用了迁移学习方法，使用预先训练的深度学习模型，开发了一种基于抗原一级序列和三级结构预测构象 B 细胞表位的模型。我们对预先训练好的蛋白质语言模型 ESM-1v 和逆折叠模型 ESM-IF1 进行了微调，以定量预测抗体-抗原相互作用特征，并区分表位和非表位残基。所得到的模型称为 SEMA，与同行评审的工具相比，在独立测试集上的 ROC AUC 为 0.76，表现出了最佳的性能。我们表明，SEMA 可以定量排列 SARS-CoV-2 RBD 结构域内的免疫显性区域。SEMA 可在 https://github.com/AIRI-Institute/SEMAi 上获取，其网络界面为 http://sema.airi.net。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abf5/9523212/798445a7190c/fimmu-13-960985-g001.jpg

相似文献

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.

Front Immunol. 2022 Sep 15;13:960985. doi: 10.3389/fimmu.2022.960985. eCollection 2022.

SEMA 2.0: web-platform for B-cell conformational epitopes prediction using artificial intelligence.

Nucleic Acids Res. 2024 Jul 5;52(W1):W533-W539. doi: 10.1093/nar/gkae386.

Revelation of Potent Epitopes Present in Unannotated ORF Antigens of SARS-CoV-2 for Epitope-Based Polyvalent Vaccine Design Using Immunoinformatics Approach.

Front Immunol. 2021 Aug 23;12:692937. doi: 10.3389/fimmu.2021.692937. eCollection 2021.

BCEPS: A Web Server to Predict Linear B Cell Epitopes with Enhanced Immunogenicity and Cross-Reactivity.

Cells. 2021 Oct 14;10(10):2744. doi: 10.3390/cells10102744.

Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature.

BMC Bioinformatics. 2011 Aug 17;12:341. doi: 10.1186/1471-2105-12-341.

Computational prediction of conformational B-cell epitopes from antigen primary structures by ensemble learning.

PLoS One. 2012;7(8):e43575. doi: 10.1371/journal.pone.0043575. Epub 2012 Aug 21.

Staged heterogeneity learning to identify conformational B-cell epitopes from antigen sequences.

BMC Genomics. 2017 Mar 14;18(Suppl 2):113. doi: 10.1186/s12864-017-3493-0.

NIgPred: Class-Specific Antibody Prediction for Linear B-Cell Epitopes Based on Heterogeneous Features and Machine-Learning Approaches.

Viruses. 2021 Aug 3;13(8):1531. doi: 10.3390/v13081531.

DiscoTope-3.0: improved B-cell epitope prediction using inverse folding latent representations.

Front Immunol. 2024 Feb 8;15:1322712. doi: 10.3389/fimmu.2024.1322712. eCollection 2024.

A vaccine built from potential immunogenic pieces derived from the SARS-CoV-2 spike glycoprotein: A computational approximation.

J Immunol Methods. 2022 Mar;502:113216. doi: 10.1016/j.jim.2022.113216. Epub 2022 Jan 7.

引用本文的文献

AI-driven epitope prediction: a system review, comparative analysis, and practical guide for vaccine development.

NPJ Vaccines. 2025 Aug 30;10(1):207. doi: 10.1038/s41541-025-01258-y.

BIDpred: unraveling B cell Immunodominance hierarchical pattern using statistical feature discovery and deep learning prediction.

Front Immunol. 2025 Aug 13;16:1646946. doi: 10.3389/fimmu.2025.1646946. eCollection 2025.

Advancing therapeutic vaccines for chronic hepatitis B: Integrating reverse vaccinology and immunoinformatics.

World J Hepatol. 2025 Jul 27;17(7):107620. doi: 10.4254/wjh.v17.i7.107620.

A comprehensive antigen-antibody complex database unlocking insights into interaction interface.

Elife. 2025 May 22;14:RP104934. doi: 10.7554/eLife.104934.

B-cell epitope peptide immunotherapy alleviates chitin-binding protein-induced type 2 airway inflammation in a Blomia tropicalis-murine model.

Respir Res. 2025 Apr 9;26(1):129. doi: 10.1186/s12931-025-03207-8.

Revolutionizing oncology: the role of Artificial Intelligence (AI) as an antibody design, and optimization tools.

Biomark Res. 2025 Mar 29;13(1):52. doi: 10.1186/s40364-025-00764-4.

Molecular Evolutionary Analyses of the RNA-Dependent RNA Polymerase () Region and Gene in Sapovirus GI.1 and GI.2.

Microorganisms. 2025 Feb 1;13(2):322. doi: 10.3390/microorganisms13020322.

An augmented transformer model trained on protein family specific variant data leads to improved prediction of variants of uncertain significance.

Hum Genet. 2025 Mar;144(2-3):143-158. doi: 10.1007/s00439-025-02727-z. Epub 2025 Jan 27.

GENA-LM: a family of open-source foundational DNA language models for long sequences.

Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1310.

B cell epitope prediction by capturing spatial clustering property of the epitopes using graph attention network.

Sci Rep. 2024 Nov 11;14(1):27496. doi: 10.1038/s41598-024-78506-z.

本文引用的文献

ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.

Glycosylation is a key in SARS-CoV-2 infection.

J Mol Med (Berl). 2021 Aug;99(8):1023-1031. doi: 10.1007/s00109-021-02092-0. Epub 2021 May 22.

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.

Proc Natl Acad Sci U S A. 2021 Apr 13;118(15). doi: 10.1073/pnas.2016239118.

Potent SARS-CoV-2 neutralizing antibodies directed against spike N-terminal domain target a single supersite.

Cell Host Microbe. 2021 May 12;29(5):819-833.e7. doi: 10.1016/j.chom.2021.03.005. Epub 2021 Mar 12.

iLBE for Computational Identification of Linear B-cell Epitopes by Integrating Sequence and Evolutionary Features.

Genomics Proteomics Bioinformatics. 2020 Oct;18(5):593-600. doi: 10.1016/j.gpb.2019.04.004. Epub 2020 Oct 22.

A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV.

Science. 2020 May 8;368(6491):630-633. doi: 10.1126/science.abb7269. Epub 2020 Apr 3.

SEPPA 3.0-enhanced spatial epitope prediction enabling glycoprotein antigens.

Nucleic Acids Res. 2019 Jul 2;47(W1):W388-W394. doi: 10.1093/nar/gkz413.

MMseqs2 desktop and local web server app for fast, interactive sequence searches.

Bioinformatics. 2019 Aug 15;35(16):2856-2858. doi: 10.1093/bioinformatics/bty1057.

iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction.

Front Immunol. 2018 Jul 27;9:1695. doi: 10.3389/fimmu.2018.01695. eCollection 2018.

BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes.

Nucleic Acids Res. 2017 Jul 3;45(W1):W24-W29. doi: 10.1093/nar/gkx346.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SEMA：使用深度迁移学习预测抗原 B 细胞构象表位。

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.

机构信息

Artificial Intelligence Research Institute, Moscow, Russia.

Sber AI Lab, Moscow, Russia.

出版信息

Front Immunol. 2022 Sep 15;13:960985. doi: 10.3389/fimmu.2022.960985. eCollection 2022.

DOI:10.3389/fimmu.2022.960985

PMID:36189325

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9523212/

Abstract

摘要

SEMA：使用深度迁移学习预测抗原 B 细胞构象表位。

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

SEMA：使用深度迁移学习预测抗原 B 细胞构象表位。

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献