Departamento de Bioquímica e Imunologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Avenida Antônio Carlos, 6627, Belo Horizonte, Brazil.
Departamento de Ciências da Computação, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
Sci Rep. 2018 Oct 8;8(1):14904. doi: 10.1038/s41598-018-33298-x.
Epitope identification is essential for developing effective antibodies that can detect and neutralize bioactive proteins. Computational prediction is a valuable and time-saving alternative for experimental identification. Current computational methods for epitope prediction are underused and undervalued due to their high false positive rate. In this work, we targeted common properties of linear B-cell epitopes identified in an individual protein class (metalloendopeptidases) and introduced an alternative method to reduce the false positive rate and increase accuracy, proposing to restrict predictive models to a single specific protein class. For this purpose, curated epitope sequences from metalloendopeptidases were transformed into frame-shifted Kmers (3 to 15 amino acid residues long). These Kmers were decomposed into a matrix of biochemical attributes and used to train a decision tree classifier. The resulting prediction model showed a lower false positive rate and greater area under the curve when compared to state-of-the-art methods. Our predictions were used for synthesizing peptides mimicking the predicted epitopes for immunization of mice. A predicted linear epitope that was previously undetected by an experimental immunoassay was able to induce neutralizing-antibody production in mice. Therefore, we present an improved prediction alternative and show that computationally identified epitopes can go undetected during experimental mapping.
表位鉴定对于开发能够检测和中和生物活性蛋白的有效抗体至关重要。计算预测是实验鉴定的一种有价值且节省时间的替代方法。由于其高假阳性率,当前用于表位预测的计算方法未得到充分利用和重视。在这项工作中,我们针对在单个蛋白质类别(金属内肽酶)中鉴定出的线性 B 细胞表位的共同特性,引入了一种替代方法来降低假阳性率并提高准确性,建议将预测模型限制在单个特定的蛋白质类别。为此,从金属内肽酶中精心挑选的表位序列被转化为移位 Kmer(3 到 15 个氨基酸残基长)。这些 Kmer 被分解成生化属性矩阵,并用于训练决策树分类器。与最先进的方法相比,所得到的预测模型显示出更低的假阳性率和更大的曲线下面积。我们的预测被用于合成模拟预测表位的肽,用于免疫小鼠。一个以前在实验免疫测定中未检测到的预测线性表位能够在小鼠中诱导产生中和抗体。因此,我们提出了一种改进的预测替代方法,并表明计算鉴定的表位在实验映射过程中可能会被忽略。