Larsen Jens Erik Pontoppidan, Lund Ole, Nielsen Morten
Center for Biological Sequence Analysis, BioCentrum-DTU, Building 208, Technical University of Denmark, DK-2800 Kgs, Lyngby, Denmark.
Immunome Res. 2006 Apr 24;2:2. doi: 10.1186/1745-7580-2-2.
B-cell epitopes are the sites of molecules that are recognized by antibodies of the immune system. Knowledge of B-cell epitopes may be used in the design of vaccines and diagnostics tests. It is therefore of interest to develop improved methods for predicting B-cell epitopes. In this paper, we describe an improved method for predicting linear B-cell epitopes.
In order to do this, three data sets of linear B-cell epitope annotated proteins were constructed. A data set was collected from the literature, another data set was extracted from the AntiJen database and a data sets of epitopes in the proteins of HIV was collected from the Los Alamos HIV database. An unbiased validation of the methods was made by testing on data sets on which they were neither trained nor optimized on. We have measured the performance in a non-parametric way by constructing ROC-curves.
The best single method for predicting linear B-cell epitopes is the hidden Markov model. Combining the hidden Markov model with one of the best propensity scale methods, we obtained the BepiPred method. When tested on the validation data set this method performs significantly better than any of the other methods tested. The server and data sets are publicly available at http://www.cbs.dtu.dk/services/BepiPred.
B细胞表位是免疫系统抗体所识别的分子位点。了解B细胞表位可用于疫苗设计和诊断测试。因此,开发改进的B细胞表位预测方法具有重要意义。在本文中,我们描述了一种改进的线性B细胞表位预测方法。
为此,构建了三个线性B细胞表位注释蛋白的数据集。一个数据集从文献中收集,另一个数据集从AntiJen数据库中提取,HIV蛋白中的表位数据集从洛斯阿拉莫斯HIV数据库中收集。通过在未进行训练或优化的数据集上进行测试,对这些方法进行了无偏验证。我们通过构建ROC曲线以非参数方式测量了性能。
预测线性B细胞表位的最佳单一方法是隐马尔可夫模型。将隐马尔可夫模型与最佳倾向评分方法之一相结合,我们得到了BepiPred方法。在验证数据集上进行测试时,该方法的性能明显优于所测试的任何其他方法。服务器和数据集可在http://www.cbs.dtu.dk/services/BepiPred上公开获取。