Department of Radiation Oncology, University of Rochester School of Medicine and Dentistry, Rochester, NY, USA.
Department of Research and Development, Bio-Thera Solutions, Guangzhou, China.
Methods Mol Biol. 2023;2552:239-254. doi: 10.1007/978-1-0716-2609-2_12.
Identifying protein antigenic epitopes that are recognizable by antibodies is a key step in immunologic research. This type of research has broad medical applications, such as new immunodiagnostic reagent discovery, vaccine design, and antibody design. However, due to the countless possibilities of potential epitopes, the experimental search through trial and error would be too costly and time-consuming to be practical. To facilitate this process and improve its efficiency, computational methods were developed to predict both linear epitopes and discontinuous antigenic epitopes. For linear B-cell epitope prediction, many methods were developed, including PREDITOP, PEOPLE, BEPITOPE, BepiPred, COBEpro, ABCpred, AAP, BCPred, BayesB, BEOracle/BROracle, BEST, LBEEP, DRREP, iBCE-EL, SVMTriP, etc. For the more challenging yet important task of discontinuous epitope prediction, methods were also developed, including CEP, DiscoTope, PEPITO, ElliPro, SEPPA, EPITOPIA, PEASE, EpiPred, SEPIa, EPCES, EPSVR, etc. In this chapter, we will discuss computational methods for B-cell epitope predictions of both linear and discontinuous epitopes. SVMTriP and EPCES/EPCSVR, the most successful among the methods for each type of the predictions, will be used as model methods to detail the standard protocols. For linear epitope prediction, SVMTriP was reported to achieve a sensitivity of 80.1% and a precision of 55.2% with a fivefold cross-validation based on a large dataset, yielding an AUC of 0.702. For discontinuous or conformational B-cell epitope prediction, EPCES and EPCSVR were both benchmarked by a curated independent test dataset in which all antigens had no complex structures with the antibody. The identified epitopes by these methods were later independently validated by various biochemical experiments. For these three model methods, webservers and all datasets are publicly available at http://sysbio.unl.edu/SVMTriP , http://sysbio.unl.edu/EPCES/ , and http://sysbio.unl.edu/EPSVR/ .
鉴定可被抗体识别的蛋白质抗原表位是免疫研究的关键步骤。这种研究具有广泛的医学应用,例如新的免疫诊断试剂发现、疫苗设计和抗体设计。然而,由于潜在表位的可能性无数,通过反复试验进行实验搜索将过于昂贵和耗时,不切实际。为了促进这一过程并提高其效率,开发了计算方法来预测线性表位和不连续抗原表位。对于线性 B 细胞表位预测,开发了许多方法,包括 PREDITOP、PEOPLE、BEPITOPE、BepiPred、COBEpro、ABCpred、AAP、BCPred、BayesB、BEOracle/BROracle、BEST、LBEEP、DRREP、iBCE-EL、SVMTriP 等。对于更具挑战性但重要的不连续表位预测任务,也开发了方法,包括 CEP、DiscoTope、PEPITO、ElliPro、SEPPA、EPITOPIA、PEASE、EpiPred、SEPIa、EPCES、EPSVR 等。在本章中,我们将讨论线性和不连续表位的 B 细胞表位预测的计算方法。SVMTriP 和 EPCES/EPCSVR 是每种类型预测中最成功的方法,将被用作模型方法详细介绍标准协议。对于线性表位预测,SVMTriP 在基于大型数据集的五重交叉验证中报告的灵敏度为 80.1%,精度为 55.2%,AUC 为 0.702。对于不连续或构象 B 细胞表位预测,EPCES 和 EPCSVR 均在经过精心整理的独立测试数据集上进行了基准测试,其中所有抗原与抗体均无复杂结构。这些方法识别的表位后来通过各种生化实验进行了独立验证。对于这三种模型方法,Web 服务器和所有数据集均可在以下网址获得:http://sysbio.unl.edu/SVMTriP、http://sysbio.unl.edu/EPCES/ 和 http://sysbio.unl.edu/EPSVR/。