Mamitsuka H
C&C Media Research Laboratories, NEC Corporation, Kawasaki, Kanagawa, Japan.
Proteins. 1998 Dec 1;33(4):460-74. doi: 10.1002/(sici)1097-0134(19981201)33:4<460::aid-prot2>3.0.co;2-m.
The binding of a major histocompatibility complex (MHC) molecule to a peptide originating in an antigen is essential to recognizing antigens in immune systems, and it has proved to be important to use computers to predict the peptides that will bind to an MHC molecule. The purpose of this paper is twofold: First, we propose to apply supervised learning of hidden Markov models (HMMs) to this problem, which can surpass existing methods for the problem of predicting MHC-binding peptides. Second, we generate peptides that have high probabilities to bind to a certain MHC molecule, based on our proposed method using peptides binding to MHC molecules as a set of training data. From our experiments, in a type of cross-validation test, the discrimination accuracy of our supervised learning method is usually approximately 2-15% better than those of other methods, including backpropagation neural networks, which have been regarded as the most effective approach to this problem. Furthermore, using an HMM trained for HLA-A2, we present new peptide sequences that are provided with high binding probabilities by the HMM and that are thus expected to bind to HLA-A2 proteins. Peptide sequences not shown in this paper but with rather high binding probabilities can be obtained from the author.
主要组织相容性复合体(MHC)分子与源自抗原的肽的结合对于免疫系统识别抗原至关重要,并且事实证明使用计算机预测与MHC分子结合的肽很重要。本文的目的有两个:第一,我们提议将隐马尔可夫模型(HMM)的监督学习应用于这个问题,这可以超越现有的预测MHC结合肽问题的方法。第二,基于我们提出的方法,使用与MHC分子结合的肽作为一组训练数据,我们生成具有高概率与特定MHC分子结合的肽。从我们的实验来看,在一种交叉验证测试中,我们的监督学习方法的辨别准确率通常比其他方法(包括被认为是解决这个问题最有效方法的反向传播神经网络)高出约2 - 15%。此外,使用针对HLA - A2训练的HMM,我们展示了由HMM提供高结合概率从而有望与HLA - A2蛋白结合的新肽序列。本文未展示但具有相当高结合概率的肽序列可从作者处获得。