Department of Computer Science, National Chiao Tung University, 1001 University Rd,, Hsinchu, Taiwan.
BMC Bioinformatics. 2014 Nov 18;15(1):378. doi: 10.1186/s12859-014-0378-y.
One of the major challenges in the field of vaccine design is identifying B-cell epitopes in continuously evolving viruses. Various tools have been developed to predict linear or conformational epitopes, each relying on different physicochemical properties and adopting distinct search strategies. We propose a meta-learning approach for epitope prediction based on stacked and cascade generalizations. Through meta learning, we expect a meta learner to be able integrate multiple prediction models, and outperform the single best-performing model. The objective of this study is twofold: (1) to analyze the complementary predictive strengths in different prediction tools, and (2) to introduce a generic computational model to exploit the synergy among various prediction tools. Our primary goal is not to develop any particular classifier for B-cell epitope prediction, but to advocate the feasibility of meta learning to epitope prediction. With the flexibility of meta learning, the researcher can construct various meta classification hierarchies that are applicable to epitope prediction in different protein domains.
We developed the hierarchical meta-learning architectures based on stacked and cascade generalizations. The bottom level of the hierarchy consisted of four conformational and four linear epitope prediction tools that served as the base learners. To perform consistent and unbiased comparisons, we tested the meta-learning method on an independent set of antigen proteins that were not used previously to train the base epitope prediction tools. In addition, we conducted correlation and ablation studies of the base learners in the meta-learning model. Low correlation among the predictions of the base learners suggested that the eight base learners had complementary predictive capabilities. The ablation analysis indicated that the eight base learners differentially interacted and contributed to the final meta model. The results of the independent test demonstrated that the meta-learning approach markedly outperformed the single best-performing epitope predictor.
Computational B-cell epitope prediction tools exhibit several differences that affect their performances when predicting epitopic regions in protein antigens. The proposed meta-learning approach for epitope prediction combines multiple prediction tools by integrating their complementary predictive strengths. Our experimental results demonstrate the superior performance of the combined approach in comparison with single epitope predictors.
在疫苗设计领域,一个主要挑战是识别不断进化的病毒中的 B 细胞表位。已经开发了各种工具来预测线性或构象表位,每个工具都依赖于不同的物理化学性质,并采用不同的搜索策略。我们提出了一种基于堆叠和级联泛化的元学习方法来进行表位预测。通过元学习,我们期望元学习者能够整合多个预测模型,并优于单个表现最佳的模型。本研究的目的有两个:(1)分析不同预测工具的互补预测优势,(2)引入一种通用的计算模型来利用各种预测工具之间的协同作用。我们的主要目标不是开发任何特定的 B 细胞表位预测分类器,而是提倡元学习在表位预测中的可行性。通过元学习的灵活性,研究人员可以构建各种适用于不同蛋白质结构域中表位预测的元分类层次结构。
我们基于堆叠和级联泛化开发了层次元学习架构。层次结构的底层由四个构象和四个线性表位预测工具组成,作为基础学习者。为了进行一致和无偏的比较,我们在一组独立的抗原蛋白上测试了元学习方法,这些蛋白以前没有用于训练基础表位预测工具。此外,我们还对元学习模型中的基础学习者进行了相关性和消融研究。基础学习者的预测之间的低相关性表明,这八个基础学习者具有互补的预测能力。消融分析表明,八个基础学习者以不同的方式相互作用并为最终的元模型做出贡献。独立测试的结果表明,元学习方法明显优于单个表现最佳的表位预测器。
计算 B 细胞表位预测工具在预测蛋白质抗原中的表位区域时表现出多种差异,这些差异会影响它们的性能。本文提出的用于表位预测的元学习方法通过整合多个预测工具的互补预测优势来组合这些预测工具。我们的实验结果表明,与单个表位预测器相比,组合方法具有优越的性能。