Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Technical University of Denmark, Kongens Lyngby, Denmark.
Immunity. 2023 Jul 11;56(7):1681-1698.e13. doi: 10.1016/j.immuni.2023.05.009. Epub 2023 Jun 9.
CD4+ T cell responses are exquisitely antigen specific and directed toward peptide epitopes displayed by human leukocyte antigen class II (HLA-II) on antigen-presenting cells. Underrepresentation of diverse alleles in ligand databases and an incomplete understanding of factors affecting antigen presentation in vivo have limited progress in defining principles of peptide immunogenicity. Here, we employed monoallelic immunopeptidomics to identify 358,024 HLA-II binders, with a particular focus on HLA-DQ and HLA-DP. We uncovered peptide-binding patterns across a spectrum of binding affinities and enrichment of structural antigen features. These aspects underpinned the development of context-aware predictor of T cell antigens (CAPTAn), a deep learning model that predicts peptide antigens based on their affinity to HLA-II and full sequence of their source proteins. CAPTAn was instrumental in discovering prevalent T cell epitopes from bacteria in the human microbiome and a pan-variant epitope from SARS-CoV-2. Together CAPTAn and associated datasets present a resource for antigen discovery and the unraveling genetic associations of HLA alleles with immunopathologies.
CD4+ T 细胞反应具有高度的抗原特异性,针对的是抗原呈递细胞上人类白细胞抗原 II 类 (HLA-II) 展示的肽表位。配体数据库中不同等位基因的代表性不足以及对体内抗原呈递影响因素的不完全了解,限制了对肽免疫原性原则的定义。在这里,我们采用单等位基因免疫肽组学来鉴定 358,024 种 HLA-II 结合物,特别关注 HLA-DQ 和 HLA-DP。我们发现了一系列结合亲和力和结构抗原特征富集的肽结合模式。这些方面为基于 HLA-II 亲和力和源蛋白全长序列预测肽抗原的深度学习模型——T 细胞抗原的上下文感知预测器 (CAPTAn) 的开发提供了支持。CAPTAn 有助于从人类微生物组中的细菌中发现常见的 T 细胞表位和 SARS-CoV-2 的泛变体表位。CAPTAn 和相关数据集共同提供了抗原发现的资源,并揭示了 HLA 等位基因与免疫病理学的遗传关联。