Department of Health technology, Technical University of Denmark, DK-2800 Lyngby, Denmark.
The Jenner Institute, Nuffield Department of Medicine, University of Oxford, OX37DQ, UK.
Expert Rev Proteomics. 2022 Feb;19(2):77-88. doi: 10.1080/14789450.2022.2064278. Epub 2022 Apr 21.
The comprehensive collection of peptides presented by major histocompatibility complex (MHC) molecules on the cell surface is collectively known as the immunopeptidome. The analysis and interpretation of such data sets holds great promise for furthering our understanding of basic immunology and adaptive immune activation and regulation, and for direct rational discovery of T cell antigens and the design of T-cell-based therapeutics and vaccines. These applications are, however, challenged by the complex nature of immunopeptidome data.
Here, we describe the benefits and shortcomings of applying liquid chromatography-tandem mass spectrometry (MS) to obtain large-scale immunopeptidome data sets and illustrate how the accurate analysis and optimal interpretation of such data is reliant on the availability of refined and highly optimized machine learning approaches.
Further, we demonstrate how the accuracy of immunoinformatics prediction methods within the field of MHC antigen presentation has benefited greatly from the availability of MS-immunopeptidomics data, and exemplify how optimal antigen discovery is best performed in a synergistic combination of MS experiments and such in silico models trained on large-scale immunopeptidomics data.
主要组织相容性复合体 (MHC) 分子在细胞表面呈现的肽的综合集合被统称为免疫肽组。分析和解释此类数据集有望进一步加深我们对基础免疫学和适应性免疫激活和调节的理解,并直接合理地发现 T 细胞抗原,设计基于 T 细胞的治疗方法和疫苗。然而,这些应用受到免疫肽组数据的复杂性质的挑战。
在这里,我们描述了应用液相色谱-串联质谱 (MS) 获得大规模免疫肽组数据集的优势和缺点,并说明了如何准确分析和优化此类数据的解释依赖于精细和高度优化的机器学习方法的可用性。
此外,我们展示了 MHC 抗原呈递领域中免疫信息学预测方法的准确性如何极大地受益于 MS 免疫肽组学数据的可用性,并举例说明了如何通过 MS 实验和基于大规模免疫肽组学数据训练的此类计算模型的协同组合来最佳地进行最佳抗原发现。