Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark.
PLoS One. 2010 Dec 30;5(12):e15877. doi: 10.1371/journal.pone.0015877.
Major Histocompatibility class II (MHC-II) molecules sample peptides from the extracellular space allowing the immune system to detect the presence of foreign microbes from this compartment. Prediction of MHC class II ligands is complicated by the open binding cleft of the MHC class II molecule, allowing binding of peptides extending out of the binding groove. Furthermore, only a few HLA-DR alleles have been characterized with a sufficient number of peptides (100-200 peptides per allele) to derive accurate description of their binding motif. Little work has been performed characterizing structural properties of MHC class II ligands. Here, we perform one such large-scale analysis. A large set of SYFPEITHI MHC class II ligands covering more than 20 different HLA-DR molecules was analyzed in terms of their secondary structure and surface exposure characteristics in the context of the native structure of the corresponding source protein. We demonstrated that MHC class II ligands are significantly more exposed and have significantly more coil content than other peptides in the same protein with similar predicted binding affinity. We next exploited this observation to derive an improved prediction method for MHC class II ligands by integrating prediction of MHC- peptide binding with prediction of surface exposure and protein secondary structure. This combined prediction method was shown to significantly outperform the state-of-the-art MHC class II peptide binding prediction method when used to identify MHC class II ligands. We also tried to integrate N- and O-glycosylation in our prediction methods but this additional information was found not to improve prediction performance. In summary, these findings strongly suggest that local structural properties influence antigen processing and/or the accessibility of peptides to the MHC class II molecule.
主要组织相容性复合体 II (MHC-II) 分子从细胞外空间中采样肽段,使免疫系统能够从该隔室检测外来微生物的存在。MHC Ⅱ类配体的预测受到 MHC Ⅱ类分子开放结合腔的复杂性的影响,允许结合延伸出结合槽的肽段。此外,只有少数 HLA-DR 等位基因具有足够数量的肽段(每个等位基因 100-200 个肽段)来准确描述其结合基序。对 MHC Ⅱ类配体的结构特性进行了很少的研究。在这里,我们进行了这样的大规模分析。在天然结构的背景下,对涵盖 20 多种不同 HLA-DR 分子的大量 SYFPEITHI MHC Ⅱ类配体进行了二级结构和表面暴露特性的分析。我们证明,与同一蛋白质中具有相似预测结合亲和力的其他肽段相比,MHC Ⅱ类配体的暴露程度显著更高,且具有更多的无规卷曲含量。接下来,我们利用这一观察结果,通过整合 MHC-肽结合预测、表面暴露预测和蛋白质二级结构预测,开发了一种改进的 MHC Ⅱ类配体预测方法。当用于识别 MHC Ⅱ类配体时,该联合预测方法的性能明显优于最先进的 MHC Ⅱ类肽结合预测方法。我们还尝试在我们的预测方法中整合 N-和 O-糖基化,但发现这些额外信息并不能提高预测性能。总之,这些发现强烈表明局部结构特性会影响抗原加工和/或肽段与 MHC Ⅱ类分子的可及性。