Bukhari Syed Nisar Hussain, Ogudo Kingsley A
National Institute of Electronics and Information Technology (NIELIT), Srinagar, J&K, India.
Department of Electrical & Electronics Engineering Faculty of Engineering and the Built Environment, University of Johannesburg, Johannesburg, South Africa.
PeerJ Comput Sci. 2024 Oct 10;10:e2319. doi: 10.7717/peerj-cs.2319. eCollection 2024.
Antigenic peptides (APs), also known as T-cell epitopes (TCEs), represent the immunogenic segment of pathogens capable of inducing an immune response, making them potential candidates for epitope-based vaccine (EBV) design. Traditional wet lab methods for identifying TCEs are expensive, challenging, and time-consuming. Alternatively, computational approaches employing machine learning (ML) techniques offer a faster and more cost-effective solution. In this study, we present a robust XGBoost ML model for predicting TCEs of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus as potential vaccine candidates. The peptide sequences comprising TCEs and non-TCEs retrieved from Immune Epitope Database Repository (IEDB) were subjected to feature extraction process to extract their physicochemical properties for model training. Upon evaluation using a test dataset, the model achieved an impressive accuracy of 97.6%, outperforming other ML classifiers. Employing a five-fold cross-validation a mean accuracy of 97.58% was recorded, indicating consistent and linear performance across all iterations. While the predicted epitopes show promise as vaccine candidates for SARS-CoV-2, further scientific examination through and studies is essential to validate their suitability.
抗原肽(APs),也称为T细胞表位(TCEs),是病原体中能够诱导免疫反应的免疫原性片段,使其成为基于表位的疫苗(EBV)设计的潜在候选物。传统的用于鉴定TCEs的湿实验室方法昂贵、具有挑战性且耗时。或者,采用机器学习(ML)技术的计算方法提供了一种更快且更具成本效益的解决方案。在本研究中,我们提出了一种强大的XGBoost ML模型,用于预测严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒的TCEs作为潜在的疫苗候选物。从免疫表位数据库存储库(IEDB)中检索到的包含TCEs和非TCEs的肽序列经过特征提取过程,以提取其物理化学性质用于模型训练。在使用测试数据集进行评估时,该模型达到了令人印象深刻的97.6%的准确率,优于其他ML分类器。采用五折交叉验证记录的平均准确率为97.58%,表明在所有迭代中性能一致且呈线性。虽然预测的表位显示出作为SARS-CoV-2疫苗候选物的前景,但通过[具体实验名称1]和[具体实验名称2]研究进行进一步的科学检验对于验证其适用性至关重要。