Jurtz Vanessa, Paul Sinu, Andreatta Massimo, Marcatili Paolo, Peters Bjoern, Nielsen Morten
Department of Bio and Health Informatics, Technical University of Denmark, DK-2800 Lyngby, Denmark.
Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037; and.
J Immunol. 2017 Nov 1;199(9):3360-3368. doi: 10.4049/jimmunol.1700893. Epub 2017 Oct 4.
Cytotoxic T cells are of central importance in the immune system's response to disease. They recognize defective cells by binding to peptides presented on the cell surface by MHC class I molecules. Peptide binding to MHC molecules is the single most selective step in the Ag-presentation pathway. Therefore, in the quest for T cell epitopes, the prediction of peptide binding to MHC molecules has attracted widespread attention. In the past, predictors of peptide-MHC interactions have primarily been trained on binding affinity data. Recently, an increasing number of MHC-presented peptides identified by mass spectrometry have been reported containing information about peptide-processing steps in the presentation pathway and the length distribution of naturally presented peptides. In this article, we present NetMHCpan-4.0, a method trained on binding affinity and eluted ligand data leveraging the information from both data types. Large-scale benchmarking of the method demonstrates an increase in predictive performance compared with state-of-the-art methods when it comes to identification of naturally processed ligands, cancer neoantigens, and T cell epitopes.
细胞毒性T细胞在免疫系统对疾病的反应中至关重要。它们通过与MHC I类分子呈递在细胞表面的肽段结合来识别缺陷细胞。肽段与MHC分子的结合是抗原呈递途径中最具选择性的单一步骤。因此,在寻找T细胞表位的过程中,预测肽段与MHC分子的结合受到了广泛关注。过去,肽段与MHC相互作用的预测器主要是基于结合亲和力数据进行训练的。最近,越来越多通过质谱鉴定的MHC呈递肽段被报道,这些肽段包含了呈递途径中肽段加工步骤以及天然呈递肽段长度分布的信息。在本文中,我们介绍了NetMHCpan-4.0,这是一种基于结合亲和力和洗脱配体数据进行训练的方法,利用了这两种数据类型的信息。该方法的大规模基准测试表明,在识别天然加工的配体、癌症新抗原和T细胞表位方面,与现有最先进的方法相比,其预测性能有所提高。