• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SeRenDIP-CE:基于序列的构象表位界面预测

SeRenDIP-CE: sequence-based interface prediction for conformational epitopes.

作者信息

Hou Qingzhen, Stringer Bas, Waury Katharina, Capel Henriette, Haydarlou Reza, Xue Fuzhong, Abeln Sanne, Heringa Jaap, Feenstra K Anton

机构信息

Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong 250002, China.

National Institute of Health Data Science of China, Shandong University, Shandong 250002, China.

出版信息

Bioinformatics. 2021 Oct 25;37(20):3421-3427. doi: 10.1093/bioinformatics/btab321.

DOI:10.1093/bioinformatics/btab321
PMID:33974039
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8136078/
Abstract

MOTIVATION

Antibodies play an important role in clinical research and biotechnology, with their specificity determined by the interaction with the antigen's epitope region, as a special type of protein-protein interaction (PPI) interface. The ubiquitous availability of sequence data, allows us to predict epitopes from sequence in order to focus time-consuming wet-lab experiments toward the most promising epitope regions. Here, we extend our previously developed sequence-based predictors for homodimer and heterodimer PPI interfaces to predict epitope residues that have the potential to bind an antibody.

RESULTS

We collected and curated a high quality epitope dataset from the SAbDab database. Our generic PPI heterodimer predictor obtained an AUC-ROC of 0.666 when evaluated on the epitope test set. We then trained a random forest model specifically on the epitope dataset, reaching AUC 0.694. Further training on the combined heterodimer and epitope datasets, improves our final predictor to AUC 0.703 on the epitope test set. This is better than the best state-of-the-art sequence-based epitope predictor BepiPred-2.0. On one solved antibody-antigen structure of the COVID19 virus spike receptor binding domain, our predictor reaches AUC 0.778. We added the SeRenDIP-CE Conformational Epitope predictors to our webserver, which is simple to use and only requires a single antigen sequence as input, which will help make the method immediately applicable in a wide range of biomedical and biomolecular research.

AVAILABILITY AND IMPLEMENTATION

Webserver, source code and datasets at www.ibi.vu.nl/programs/serendipwww/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

抗体在临床研究和生物技术中发挥着重要作用,其特异性由与抗原表位区域的相互作用决定,抗原表位区域是一种特殊类型的蛋白质 - 蛋白质相互作用(PPI)界面。序列数据的广泛可得性使我们能够从序列中预测表位,以便将耗时的湿实验室实验聚焦于最有前景的表位区域。在此,我们将先前开发的用于同源二聚体和异源二聚体PPI界面的基于序列的预测器进行扩展,以预测有可能与抗体结合的表位残基。

结果

我们从SAbDab数据库收集并整理了一个高质量的表位数据集。当在表位测试集上进行评估时,我们的通用PPI异源二聚体预测器获得了0.666的AUC-ROC。然后我们专门在表位数据集上训练了一个随机森林模型,达到了0.694的AUC。在组合的异源二聚体和表位数据集上进一步训练后,我们的最终预测器在表位测试集上的AUC提高到了0.703。这比基于序列的最佳现有表位预测器BepiPred-2.0更好。在一种已解析的新冠病毒刺突受体结合域的抗体 - 抗原结构上,我们的预测器达到了0.778的AUC。我们将SeRenDIP-CE构象表位预测器添加到了我们的网络服务器中,该服务器使用简单,仅需输入单个抗原序列,这将有助于使该方法立即应用于广泛的生物医学和生物分子研究。

可用性与实现

网络服务器、源代码和数据集可在www.ibi.vu.nl/programs/serendipwww/获取。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e581/8545293/8a84dddc539e/btab321f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e581/8545293/5f402ee8d797/btab321f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e581/8545293/8a84dddc539e/btab321f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e581/8545293/5f402ee8d797/btab321f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e581/8545293/8a84dddc539e/btab321f2.jpg

相似文献

1
SeRenDIP-CE: sequence-based interface prediction for conformational epitopes.SeRenDIP-CE:基于序列的构象表位界面预测
Bioinformatics. 2021 Oct 25;37(20):3421-3427. doi: 10.1093/bioinformatics/btab321.
2
SeRenDIP: SEquential REmasteriNg to DerIve profiles for fast and accurate predictions of PPI interface positions.SeRenDIP:用于快速准确预测 PPI 界面位置的序列重铸衍生预测 profile。
Bioinformatics. 2019 Nov 1;35(22):4794-4796. doi: 10.1093/bioinformatics/btz428.
3
Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest.透过森林看树木:使用随机森林基于序列预测同源和异源多聚体蛋白质-蛋白质相互作用位点
Bioinformatics. 2017 May 15;33(10):1479-1487. doi: 10.1093/bioinformatics/btx005.
4
BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes.BepiPred-2.0:利用构象表位改进基于序列的 B 细胞表位预测。
Nucleic Acids Res. 2017 Jul 3;45(W1):W24-W29. doi: 10.1093/nar/gkx346.
5
Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature.基于距离特征的随机森林算法从 3D 结构预测构象 B 细胞表位
BMC Bioinformatics. 2011 Aug 17;12:341. doi: 10.1186/1471-2105-12-341.
6
PIPENN: protein interface prediction from sequence with an ensemble of neural nets.PIPENN:利用神经网络集成从序列预测蛋白质界面
Bioinformatics. 2022 Apr 12;38(8):2111-2118. doi: 10.1093/bioinformatics/btac071.
7
Pep-3D-Search: a method for B-cell epitope prediction based on mimotope analysis.Pep-3D-Search:一种基于模拟表位分析的B细胞表位预测方法。
BMC Bioinformatics. 2008 Dec 16;9:538. doi: 10.1186/1471-2105-9-538.
8
Ab-initio conformational epitope structure prediction using genetic algorithm and SVM for vaccine design.基于遗传算法和 SVM 的从头构象表位结构预测在疫苗设计中的应用。
Comput Methods Programs Biomed. 2018 Jan;153:161-170. doi: 10.1016/j.cmpb.2017.10.011. Epub 2017 Oct 12.
9
Single-residue linear and conformational B cell epitopes prediction using random and ESM-2 based projections.基于随机和 ESM-2 投影的单残基线性和构象 B 细胞表位预测。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae084.
10
EPMLR: sequence-based linear B-cell epitope prediction method using multiple linear regression.EPMLR:基于序列的线性B细胞表位预测方法,采用多元线性回归。
BMC Bioinformatics. 2014 Dec 19;15(1):414. doi: 10.1186/s12859-014-0414-y.

引用本文的文献

1
Prediction of antibody-antigen interaction based on backbone aware with invariant point attention.基于具有不变点注意力的骨架感知的抗体-抗原相互作用预测。
BMC Bioinformatics. 2024 Nov 6;25(1):348. doi: 10.1186/s12859-024-05961-w.
2
PMSFF: Improved Protein Binding Residues Prediction through Multi-Scale Sequence-Based Feature Fusion Strategy.PMSFF:通过多尺度序列的基于特征融合策略来提高蛋白质结合残基预测。
Biomolecules. 2024 Sep 27;14(10):1220. doi: 10.3390/biom14101220.
3
A comprehensive overview of recent advances in generative models for antibodies.
抗体生成模型近期进展的全面概述。
Comput Struct Biotechnol J. 2024 Jun 20;23:2648-2660. doi: 10.1016/j.csbj.2024.06.016. eCollection 2024 Dec.
4
DL-PPI: a method on prediction of sequenced protein-protein interaction based on deep learning.DL-PPI:一种基于深度学习的预测序列蛋白质相互作用的方法。
BMC Bioinformatics. 2023 Dec 14;24(1):473. doi: 10.1186/s12859-023-05594-5.
5
Recent Progress in Antibody Epitope Prediction.抗体表位预测的最新进展
Antibodies (Basel). 2023 Aug 8;12(3):52. doi: 10.3390/antib12030052.
6
The CDR3 region as the major driver of TREM-1 interaction with its ligands, an characterization.作为TREM-1与其配体相互作用的主要驱动因素的CDR3区域,一项表征。 (原英文句子表述似乎不太完整准确,翻译出来的中文也稍显生硬,但严格按照要求进行了翻译)
Comput Struct Biotechnol J. 2023 Apr 13;21:2579-2590. doi: 10.1016/j.csbj.2023.04.008. eCollection 2023.
7
In silico and experimental methods for designing a potent anticancer arazyme-herceptin fusion protein in HER2-positive breast cancer.用于设计HER2阳性乳腺癌中一种强效抗癌抗酶-赫赛汀融合蛋白的计算机模拟和实验方法
J Mol Model. 2023 Apr 27;29(5):160. doi: 10.1007/s00894-023-05562-z.
8
Ten quick tips for sequence-based prediction of protein properties using machine learning.使用机器学习进行基于序列的蛋白质性质预测的十个快速技巧。
PLoS Comput Biol. 2022 Dec 1;18(12):e1010669. doi: 10.1371/journal.pcbi.1010669. eCollection 2022 Dec.
9
Bioinformatics tools and data resources for assay development of fluid protein biomarkers.用于液体蛋白质生物标志物检测开发的生物信息学工具和数据资源。
Biomark Res. 2022 Nov 15;10(1):83. doi: 10.1186/s40364-022-00425-w.
10
ProteinGLUE multi-task benchmark suite for self-supervised protein modeling.蛋白质 GLUE 多任务基准套件,用于自监督蛋白质建模。
Sci Rep. 2022 Sep 26;12(1):16047. doi: 10.1038/s41598-022-19608-4.