Liu Wen, Meng Xiangshan, Xu Qiqi, Flower Darren R, Li Tongbin
Department of Neuroscience, University of Minnesota, Minneapolis, MN 55455, USA.
BMC Bioinformatics. 2006 Mar 31;7:182. doi: 10.1186/1471-2105-7-182.
The binding between peptide epitopes and major histocompatibility complex proteins (MHCs) is an important event in the cellular immune response. Accurate prediction of the binding between short peptides and the MHC molecules has long been a principal challenge for immunoinformatics. Recently, the modeling of MHC-peptide binding has come to emphasize quantitative predictions: instead of categorizing peptides as "binders" or "non-binders" or as "strong binders" and "weak binders", recent methods seek to make predictions about precise binding affinities.
We developed a quantitative support vector machine regression (SVR) approach, called SVRMHC, to model peptide-MHC binding affinities. As a non-linear method, SVRMHC was able to generate models that out-performed existing linear models, such as the "additive method". By adopting a new "11-factor encoding" scheme, SVRMHC takes into account similarities in the physicochemical properties of the amino acids constituting the input peptides. When applied to MHC-peptide binding data for three mouse class I MHC alleles, the SVRMHC models produced more accurate predictions than those produced previously. Furthermore, comparisons based on Receiver Operating Characteristic (ROC) analysis indicated that SVRMHC was able to out-perform several prominent methods in identifying strongly binding peptides.
As a method with demonstrated performance in the quantitative modeling of MHC-peptide binding and in identifying strong binders, SVRMHC is a promising immunoinformatics tool with not inconsiderable future potential.
肽表位与主要组织相容性复合体蛋白(MHC)之间的结合是细胞免疫反应中的一个重要事件。长期以来,准确预测短肽与MHC分子之间的结合一直是免疫信息学的主要挑战。最近,MHC-肽结合的建模开始强调定量预测:最近的方法不再将肽分类为“结合物”或“非结合物”,也不再分类为“强结合物”和“弱结合物”,而是试图对精确的结合亲和力进行预测。
我们开发了一种定量支持向量机回归(SVR)方法,称为SVRMHC,用于对肽-MHC结合亲和力进行建模。作为一种非线性方法,SVRMHC能够生成优于现有线性模型(如“加法方法”)的模型。通过采用新的“11因子编码”方案,SVRMHC考虑了构成输入肽的氨基酸的物理化学性质的相似性。当应用于三种小鼠I类MHC等位基因的MHC-肽结合数据时,SVRMHC模型产生的预测比以前更准确。此外,基于接受者操作特征(ROC)分析的比较表明,SVRMHC在识别强结合肽方面能够优于几种突出的方法。
作为一种在MHC-肽结合的定量建模和识别强结合物方面具有已证明性能的方法,SVRMHC是一种有前途的免疫信息学工具,具有相当大的未来潜力。