• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EpitopeVec:使用深度蛋白质序列嵌入进行线性表位预测。

EpitopeVec: linear epitope prediction using deep protein sequence embeddings.

机构信息

Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig 38124, Germany.

Braunschweig Integrated Center of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig 38106, Germany.

出版信息

Bioinformatics. 2021 Dec 7;37(23):4517-4525. doi: 10.1093/bioinformatics/btab467.

DOI:10.1093/bioinformatics/btab467
PMID:34180989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8652027/
Abstract

MOTIVATION

B-cell epitopes (BCEs) play a pivotal role in the development of peptide vaccines, immuno-diagnostic reagents and antibody production, and thus in infectious disease prevention and diagnostics in general. Experimental methods used to determine BCEs are costly and time-consuming. Therefore, it is essential to develop computational methods for the rapid identification of BCEs. Although several computational methods have been developed for this task, generalizability is still a major concern, where cross-testing of the classifiers trained and tested on different datasets has revealed accuracies of 51-53%.

RESULTS

We describe a new method called EpitopeVec, which uses a combination of residue properties, modified antigenicity scales, and protein language model-based representations (protein vectors) as features of peptides for linear BCE predictions. Extensive benchmarking of EpitopeVec and other state-of-the-art methods for linear BCE prediction on several large and small datasets, as well as cross-testing, demonstrated an improvement in the performance of EpitopeVec over other methods in terms of accuracy and area under the curve. As the predictive performance depended on the species origin of the respective antigens (viral, bacterial and eukaryotic), we also trained our method on a large viral dataset to create a dedicated linear viral BCE predictor with improved cross-testing performance.

AVAILABILITY AND IMPLEMENTATION

The software is available at https://github.com/hzi-bifo/epitope-prediction.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

B 细胞表位(BCEs)在肽疫苗、免疫诊断试剂和抗体生产的开发中起着关键作用,因此在传染病的预防和诊断中也起着重要作用。用于确定 BCEs 的实验方法既昂贵又耗时。因此,开发用于快速识别 BCEs 的计算方法至关重要。尽管已经开发了几种用于此任务的计算方法,但可泛化性仍然是一个主要关注点,其中在不同数据集上训练和测试的分类器的交叉测试显示出 51-53%的准确率。

结果

我们描述了一种新的方法,称为 EpitopeVec,它使用残基特性、改良的抗原性量表以及基于蛋白质语言模型的表示(蛋白质向量)的组合作为肽的特征,用于线性 BCE 预测。在几个大型和小型数据集上对 EpitopeVec 和其他用于线性 BCE 预测的最新方法进行了广泛的基准测试和交叉测试,结果表明,EpitopeVec 在准确性和曲线下面积方面的性能优于其他方法。由于预测性能取决于各自抗原的物种来源(病毒、细菌和真核生物),我们还在一个大型病毒数据集上训练了我们的方法,以创建一个具有改进交叉测试性能的专用线性病毒 BCE 预测器。

可用性和实现

该软件可在 https://github.com/hzi-bifo/epitope-prediction 上获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/1ea97873586b/btab467f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/6a865f83568e/btab467f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/6d56b6a5c56c/btab467f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/1ea97873586b/btab467f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/6a865f83568e/btab467f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/6d56b6a5c56c/btab467f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96d5/8652027/1ea97873586b/btab467f3.jpg

相似文献

1
EpitopeVec: linear epitope prediction using deep protein sequence embeddings.EpitopeVec:使用深度蛋白质序列嵌入进行线性表位预测。
Bioinformatics. 2021 Dec 7;37(23):4517-4525. doi: 10.1093/bioinformatics/btab467.
2
iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction.iBCE-EL:一种用于改进线性 B 细胞表位预测的新集成学习框架。
Front Immunol. 2018 Jul 27;9:1695. doi: 10.3389/fimmu.2018.01695. eCollection 2018.
3
Prediction of linear B-cell epitopes based on protein sequence features and BERT embeddings.基于蛋白质序列特征和 BERT 嵌入的线性 B 细胞表位预测。
Sci Rep. 2024 Jan 30;14(1):2464. doi: 10.1038/s41598-024-53028-w.
4
LBCE-XGB: A XGBoost Model for Predicting Linear B-Cell Epitopes Based on BERT Embeddings.LBCE-XGB:一种基于BERT嵌入的用于预测线性B细胞表位的XGBoost模型。
Interdiscip Sci. 2023 Jun;15(2):293-305. doi: 10.1007/s12539-023-00549-z. Epub 2023 Jan 16.
5
NetBCE: An Interpretable Deep Neural Network for Accurate Prediction of Linear B-cell Epitopes.NetBCE:一种用于线性 B 细胞表位准确预测的可解释深度神经网络。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):1002-1012. doi: 10.1016/j.gpb.2022.11.009. Epub 2022 Dec 13.
6
EpiDope: a deep neural network for linear B-cell epitope prediction.EpiDope:用于线性 B 细胞表位预测的深度神经网络。
Bioinformatics. 2021 May 1;37(4):448-455. doi: 10.1093/bioinformatics/btaa773.
7
BepiPred-3.0: Improved B-cell epitope prediction using protein language models.BepiPred-3.0:使用蛋白质语言模型改进 B 细胞表位预测。
Protein Sci. 2022 Dec;31(12):e4497. doi: 10.1002/pro.4497.
8
BepFAMN: A Method for Linear B-Cell Epitope Predictions Based on Fuzzy-ARTMAP Artificial Neural Network.BepFAMN:一种基于模糊 ARTMAP 人工神经网络的线性 B 细胞表位预测方法。
Sensors (Basel). 2022 May 26;22(11):4027. doi: 10.3390/s22114027.
9
Multi-perspectives and challenges in identifying B-cell epitopes.鉴定 B 细胞表位的多视角和挑战。
Protein Sci. 2023 Nov;32(11):e4785. doi: 10.1002/pro.4785.
10
PEPITO: improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure.佩皮托:使用多个距离阈值和半球暴露改进的不连续B细胞表位预测
Bioinformatics. 2008 Jun 15;24(12):1459-60. doi: 10.1093/bioinformatics/btn199. Epub 2008 Apr 28.

引用本文的文献

1
A deep learning model for predicting systemic lupus erythematosus-associated epitopes.一种用于预测系统性红斑狼疮相关表位的深度学习模型。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):230. doi: 10.1186/s12911-025-03056-x.
2
Epitope-Based Vaccines: The Next Generation of Promising Vaccines Against Bacterial Infection.基于表位的疫苗:对抗细菌感染的下一代有前景的疫苗。
Vaccines (Basel). 2025 Feb 27;13(3):248. doi: 10.3390/vaccines13030248.
3
Revolutionizing oncology: the role of Artificial Intelligence (AI) as an antibody design, and optimization tools.

本文引用的文献

1
Linear B-Cell Epitope Prediction for In Silico Vaccine Design: A Performance Review of Methods Available via Command-Line Interface.线性 B 细胞表位预测在计算机疫苗设计中的应用:通过命令行界面提供的方法的性能评估。
Int J Mol Sci. 2021 Mar 22;22(6):3210. doi: 10.3390/ijms22063210.
2
EpiDope: a deep neural network for linear B-cell epitope prediction.EpiDope:用于线性 B 细胞表位预测的深度神经网络。
Bioinformatics. 2021 May 1;37(4):448-455. doi: 10.1093/bioinformatics/btaa773.
3
Bioinformatics analysis of epitope-based vaccine design against the novel SARS-CoV-2.
肿瘤学的变革:人工智能(AI)作为抗体设计与优化工具的作用。
Biomark Res. 2025 Mar 29;13(1):52. doi: 10.1186/s40364-025-00764-4.
4
Phylogeny-aware linear B-cell epitope predictor detects targets associated with immune response to orthopoxviruses.基于系统发育的线性 B 细胞表位预测器可检测与正痘病毒免疫反应相关的靶标。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae527.
5
A comprehensive overview of recent advances in generative models for antibodies.抗体生成模型近期进展的全面概述。
Comput Struct Biotechnol J. 2024 Jun 20;23:2648-2660. doi: 10.1016/j.csbj.2024.06.016. eCollection 2024 Dec.
6
Binding of Natural Antibodies Generated after COVID-19 and Vaccination with Individual Peptides Corresponding to the SARS-CoV-2 S-Protein.新型冠状病毒肺炎康复及接种疫苗后产生的天然抗体与对应严重急性呼吸综合征冠状病毒2刺突蛋白的单个肽段的结合情况
Vaccines (Basel). 2024 Apr 17;12(4):426. doi: 10.3390/vaccines12040426.
7
The rise of taxon-specific epitope predictors.税类特异性表位预测器的兴起。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae092.
8
Prediction of linear B-cell epitopes based on protein sequence features and BERT embeddings.基于蛋白质序列特征和 BERT 嵌入的线性 B 细胞表位预测。
Sci Rep. 2024 Jan 30;14(1):2464. doi: 10.1038/s41598-024-53028-w.
9
Brewpitopes: a pipeline to refine B-cell epitope predictions during public health emergencies.酿酒位:公共卫生突发事件期间用于精炼 B 细胞表位预测的流水线。
Front Immunol. 2023 Dec 6;14:1278534. doi: 10.3389/fimmu.2023.1278534. eCollection 2023.
10
A Lean Reverse Vaccinology Pipeline with Publicly Available Bioinformatic Tools.一种具有公共可用生物信息学工具的精益反向疫苗学管道。
Methods Mol Biol. 2023;2673:341-356. doi: 10.1007/978-1-0716-3239-0_24.
基于表位的新型 SARS-CoV-2 疫苗设计的生物信息学分析。
Infect Dis Poverty. 2020 Jul 10;9(1):88. doi: 10.1186/s40249-020-00713-3.
4
Deep learning methods improve linear B-cell epitope prediction.深度学习方法改进线性B细胞表位预测。
BioData Min. 2020 Apr 17;13:1. doi: 10.1186/s13040-020-00211-0. eCollection 2020.
5
Epitope Prediction by Novel Immunoinformatics Approach: A State-of-the-art Review.基于新型免疫信息学方法的表位预测:最新综述
Int J Pept Res Ther. 2020;26(2):1155-1163. doi: 10.1007/s10989-019-09918-z. Epub 2019 Aug 20.
6
A Sequence Homology and Bioinformatic Approach Can Predict Candidate Targets for Immune Responses to SARS-CoV-2.一种序列同源性和生物信息学方法可预测针对 SARS-CoV-2 的免疫反应的候选靶点。
Cell Host Microbe. 2020 Apr 8;27(4):671-680.e2. doi: 10.1016/j.chom.2020.03.002. Epub 2020 Mar 16.
7
Mutation effect estimation on protein-protein interactions using deep contextualized representation learning.使用深度上下文表示学习估计突变对蛋白质-蛋白质相互作用的影响。
NAR Genom Bioinform. 2020 Jun;2(2):lqaa015. doi: 10.1093/nargab/lqaa015. Epub 2020 Mar 5.
8
Peptide-Based Vaccines: Current Progress and Future Challenges.基于肽的疫苗:当前进展和未来挑战。
Chem Rev. 2020 Mar 25;120(6):3210-3229. doi: 10.1021/acs.chemrev.9b00472. Epub 2019 Dec 5.
9
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。
Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.
10
Machine learning techniques for protein function prediction.基于机器学习的蛋白质功能预测技术。
Proteins. 2020 Mar;88(3):397-413. doi: 10.1002/prot.25832. Epub 2019 Nov 14.