Faculty of Pharmacy, Medical University - Sofia, Sofia, Bulgaria.
Methods Mol Biol. 2023;2673:289-303. doi: 10.1007/978-1-0716-3239-0_20.
Prediction of bacterial immunogens is a prerequisite for the process of vaccine development through reverse vaccinology. The application of in silico methods allows significant reduction in time and cost for the discovery of potential vaccine candidates among proteins of a bacterial species. The steps in the prediction algorithm include collection of protein sequence datasets of known bacterial immunogens and non-immunogens, data preprocessing to transform the protein sequences into numerical matrices suitable for use as training and test sets for various machine learning methods, and derivation of predictive models. The performance of the derived models is evaluated by means of classification metrics.In this chapter, we present a protocol for predicting bacterial immunogenicity by applying machine learning methods. The protocol describes the process of model development from data collection and manipulation to training and validation of the derived models.
预测细菌免疫原是通过反向疫苗学开发疫苗的过程的前提。通过计算方法的应用,可以在细菌蛋白质中发现潜在疫苗候选物的过程中显著减少时间和成本。预测算法的步骤包括收集已知细菌免疫原和非免疫原的蛋白质序列数据集、数据预处理将蛋白质序列转换为适合各种机器学习方法的训练和测试集的数值矩阵,以及推导预测模型。通过分类指标来评估所推导模型的性能。在本章中,我们通过应用机器学习方法来介绍预测细菌免疫原性的协议。该协议描述了从数据收集和处理到训练和验证所推导模型的模型开发过程。