Division of Infectious Diseases and Applied Immunology, The Institute of Medical Sciences Research Hospital, The University of Tokyo, Tokyo, Japan.
Front Immunol. 2019 Apr 16;10:827. doi: 10.3389/fimmu.2019.00827. eCollection 2019.
Immunodominant T cell epitopes preferentially targeted in multiple individuals are the critical element of successful vaccines and targeted immunotherapies. However, the underlying principles of this "convergence" of adaptive immunity among different individuals remain poorly understood. To quantitatively describe epitope immunogenicity, here we propose a supervised machine learning framework generating probabilistic estimates of immunogenicity, termed "immunogenicity scores," based on the numerical features computed through sequence-based simulation approximating the molecular scanning process of peptides presented onto major histocompatibility complex (MHC) by the human T cell receptor (TCR) repertoire. Notably, overlapping sets of intermolecular interaction parameters were commonly utilized in MHC-I and MHC-II prediction. Moreover, a similar simulation of individual TCR-peptide interaction using the same set of interaction parameters yielded correlates of TCR affinity. Pathogen-derived epitopes and tumor-associated epitopes with positive T cell reactivity generally had higher immunogenicity scores than non-immunogenic counterparts, whereas thymically expressed self-epitopes were assigned relatively low scores regardless of their immunogenicity annotation. Immunogenicity score dynamics among single amino acid mutants delineated the landscape of position- and residue-specific mutational impacts. Simulation of position-specific immunogenicity score dynamics detected residues with high escape potential in multiple epitopes, consistent with known escape mutations in the literature. This study indicates that targeting of epitopes by human adaptive immunity is to some extent directed by defined thermodynamic principles. The proposed framework also has a practical implication in that it may enable to more efficiently prioritize epitope candidates highly prone to T cell recognition in multiple individuals, warranting prospective validation across different cohorts.
免疫优势 T 细胞表位是成功疫苗和靶向免疫疗法的关键要素,这些表位优先被多个个体中的适应性免疫靶向。然而,不同个体之间适应性免疫“收敛”的基本原理仍知之甚少。为了定量描述表位免疫原性,我们提出了一个监督机器学习框架,基于通过基于序列的模拟计算得出的数值特征,生成免疫原性的概率估计,称为“免疫原性评分”,该模拟过程模拟了人类 T 细胞受体(TCR)库对主要组织相容性复合物(MHC)呈递的肽的分子扫描过程。值得注意的是,MHC-I 和 MHC-II 预测中普遍使用了重叠的分子间相互作用参数集。此外,使用相同的相互作用参数集模拟单个 TCR-肽相互作用,可产生 TCR 亲和力的相关性。具有阳性 T 细胞反应的病原体衍生表位和肿瘤相关表位通常具有更高的免疫原性评分,而无论其免疫原性注释如何,胸腺表达的自身表位的评分相对较低。单个氨基酸突变体之间的免疫原性评分动态描绘了位置和残基特异性突变影响的景观。位置特异性免疫原性评分动态的模拟检测到了多个表位中具有高逃逸潜力的残基,与文献中已知的逃逸突变一致。这项研究表明,人类适应性免疫对表位的靶向在某种程度上是由定义明确的热力学原理指导的。该框架还具有实际意义,因为它可以更有效地优先选择在多个个体中容易被 T 细胞识别的表位候选物,这需要在不同队列中进行前瞻性验证。