Department of Biotechnology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
Department of Biotechnology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
Int Immunopharmacol. 2020 Jan;78:106020. doi: 10.1016/j.intimp.2019.106020. Epub 2019 Nov 24.
This study was aimed to introduce a novel algorithm for determining linear B- and T-cell epitopes from Crimean-Congo haemorrhagic fever virus (CCHFV) antigens. To this end, 387 approved B- and T-cell epitopes, as well as 331 non-epitope peptides from different serotypes of the virus were collected from IEDB database for generating of the train datasets. After that, the physicochemical properties of the epitopes were expressed as the numeric vectors using Chou's pseudo amino acid composition method. The vectors then were used for training of four machine learning algorithms including artificial neural network (ANN), k-nearest neighbors (kNN), support vector machine (SVM) and Random forest (RF). The results confirmed that ANN was the most accurate algorithm for discriminating between the epitopes and non-epitopes with the accuracy of 0.90. Furthermore, for evaluating the performance of the ANN algorithm, an epitope prediction challenge was performed to a random peptide library from envelopment polyprotein of CCHFV. Moreover, the efficiency of the predicted epitopes in term of antigenicity and affinity to MHC-II were compared to the predicted epitope by standard epitope prediction tools based on their VaxiJen 2.0 score and molecular docking outputs. Finally, the ability of the screened epitopes to stimulation of humoral and cellular responses was evaluated by an in silico immune simulation process thought C-Immsim 10.1 server. The results confirmed that this method has more accuracy for epitope-mapping than the standard tools and could considered as an effective algorithm to develop a serotype independent one-click automated epitope based vaccine design tool.
本研究旨在介绍一种从克里米亚-刚果出血热病毒(CCHFV)抗原中确定线性 B 细胞和 T 细胞表位的新算法。为此,从 IEDB 数据库中收集了 387 个已批准的 B 细胞和 T 细胞表位以及 331 个来自不同血清型的非表位肽,用于生成训练数据集。之后,使用 Chou 的伪氨基酸组成方法将表位的理化性质表示为数值向量。然后,将这些向量用于训练四种机器学习算法,包括人工神经网络(ANN)、k-最近邻(kNN)、支持向量机(SVM)和随机森林(RF)。结果证实,ANN 是区分表位和非表位的最准确算法,准确率为 0.90。此外,为了评估 ANN 算法的性能,对 CCHFV 包膜多蛋白的随机肽文库进行了表位预测挑战。此外,通过基于其 VaxiJen 2.0 评分和分子对接输出的标准表位预测工具,比较了预测表位在抗原性和与 MHC-II 的亲和力方面的效率。最后,通过 C-Immsim 10.1 服务器的计算免疫模拟过程评估了筛选出的表位刺激体液和细胞反应的能力。结果证实,该方法在表位映射方面比标准工具具有更高的准确性,可被视为开发一种与血清型无关的一键式自动基于表位疫苗设计工具的有效算法。