Center for Biological Sequence Analysis, Department for Systems Biology, Technical University of Denmark, Lyngby, Denmark.
PLoS One. 2010 Feb 17;5(2):e9272. doi: 10.1371/journal.pone.0009272.
Successful predictions of peptide MHC binding typically require a large set of binding data for the specific MHC molecule that is examined. Structure based prediction methods promise to circumvent this requirement by evaluating the physical contacts a peptide can make with an MHC molecule based on the highly conserved 3D structure of peptide:MHC complexes. While several such methods have been described before, most are not publicly available and have not been independently tested for their performance. We here implemented and evaluated three prediction methods for MHC class II molecules: statistical potentials derived from the analysis of known protein structures; energetic evaluation of different peptide snapshots in a molecular dynamics simulation; and direct analysis of contacts made in known 3D structures of peptide:MHC complexes. These methods are ab initio in that they require structural data of the MHC molecule examined, but no specific peptide:MHC binding data. Moreover, these methods retain the ability to make predictions in a sufficiently short time scale to be useful in a real world application, such as screening a whole proteome for candidate binding peptides. A rigorous evaluation of each methods prediction performance showed that these are significantly better than random, but still substantially lower than the best performing sequence based class II prediction methods available. While the approaches presented here were developed independently, we have chosen to present our results together in order to support the notion that generating structure based predictions of peptide:MHC binding without using binding data is unlikely to give satisfactory results.
成功预测肽-MHC 结合通常需要大量针对所研究的特定 MHC 分子的结合数据。基于结构的预测方法有望通过基于肽-MHC 复合物的高度保守的 3D 结构来评估肽与 MHC 分子可以形成的物理接触来规避这一要求。虽然之前已经描述了几种这样的方法,但大多数方法不是公开的,并且没有对其性能进行独立测试。我们在这里实现并评估了三种用于 MHC 类 II 分子的预测方法:基于已知蛋白质结构分析得出的统计势;分子动力学模拟中不同肽快照的能量评估;以及对已知肽-MHC 复合物 3D 结构中形成的接触的直接分析。这些方法是从头开始的,因为它们需要所研究的 MHC 分子的结构数据,但不需要特定的肽-MHC 结合数据。此外,这些方法保留了在足够短的时间尺度内进行预测的能力,在实际应用中非常有用,例如筛选整个蛋白质组以寻找候选结合肽。对每种方法预测性能的严格评估表明,这些方法明显优于随机方法,但仍远低于现有性能最佳的基于序列的 II 类预测方法。虽然这里提出的方法是独立开发的,但我们选择将我们的结果一起呈现,以支持这样一种观点,即不使用结合数据生成基于结构的肽-MHC 结合预测不太可能产生令人满意的结果。