Eccleston R Charlotte, Wan Shunzhou, Dalchau Neil, Coveney Peter V
Centre for Computational Science, Department of Chemistry, University College London, London, United Kingdom.
Microsoft Research, Cambridge, United Kingdom.
Front Immunol. 2017 Jul 10;8:797. doi: 10.3389/fimmu.2017.00797. eCollection 2017.
T lymphocytes are stimulated when they recognize short peptides bound to class I proteins of the major histocompatibility complex (MHC) protein, as peptide-MHC complexes. Due to the diversity in T-cell receptor (TCR) molecules together with both the peptides and MHC proteins they bind to, it has been difficult to design vaccines and treatments based on these interactions. Machine learning has made some progress in trying to predict the immunogenicity of peptide sequences in the context of specific MHC class I alleles but, as such approaches cannot integrate temporal information and lack explanatory power, their scope will always be limited. Here, we advocate a mechanistic description of antigen presentation and TCR activation which is explanatory, predictive, and quantitative, drawing on modeling approaches that collectively span several length and time scales, being capable of furnishing reliable biological descriptions that are difficult for experimentalists to provide. It is a form of multiscale systems biology. We propose the use of chemical rate equations to describe the time evolution of the foreign and host proteins to explain how the original proteins end up being presented on the cell surface as peptide fragments, while we invoke molecular dynamics to describe the key binding processes on the molecular level, including those of peptide-MHC complexes with TCRs which lie at the heart of the immune response. On each level, complementary methods based on machine learning are available, and we discuss the relationship between these divergent approaches. The pursuit of predictive mechanistic modeling approaches requires experimentalists to adapt their work so as to acquire, store, and expose data that can be used to verify and validate such models.
当T淋巴细胞识别与主要组织相容性复合体(MHC)I类蛋白结合的短肽(即肽-MHC复合体)时,它们就会被激活。由于T细胞受体(TCR)分子以及它们所结合的肽和MHC蛋白具有多样性,基于这些相互作用来设计疫苗和治疗方法一直很困难。机器学习在尝试预测特定MHC I类等位基因背景下肽序列的免疫原性方面取得了一些进展,但是,由于此类方法无法整合时间信息且缺乏解释力,其应用范围始终有限。在此,我们提倡对抗原呈递和TCR激活进行一种解释性、预测性和定量性的机制描述,借鉴跨越多个长度和时间尺度的建模方法,能够提供实验人员难以提供的可靠生物学描述。这是一种多尺度系统生物学形式。我们建议使用化学速率方程来描述外源蛋白和宿主蛋白的时间演化,以解释原始蛋白最终如何以肽片段的形式呈现在细胞表面,同时我们调用分子动力学来描述分子水平上的关键结合过程,包括肽-MHC复合体与TCR的结合过程,而这正是免疫反应的核心。在每个层面上,都有基于机器学习的互补方法,我们将讨论这些不同方法之间的关系。追求预测性机制建模方法要求实验人员调整他们的工作,以便获取、存储和公开可用于验证和确认此类模型的数据。