Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland; Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland; Agora Cancer Research Centre, 1011 Lausanne, Switzerland; SIB Swiss Institute of Bioinformatics, Quartier Sorge, Bâtiment Amphipôle, 1015 Lausanne, Switzerland.
Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland; Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland; Agora Cancer Research Centre, 1011 Lausanne, Switzerland.
Immunity. 2023 Nov 14;56(11):2650-2663.e6. doi: 10.1016/j.immuni.2023.09.002. Epub 2023 Oct 9.
The accurate selection of neoantigens that bind to class I human leukocyte antigen (HLA) and are recognized by autologous T cells is a crucial step in many cancer immunotherapy pipelines. We reprocessed whole-exome sequencing and RNA sequencing (RNA-seq) data from 120 cancer patients from two external large-scale neoantigen immunogenicity screening assays combined with an in-house dataset of 11 patients and identified 46,017 somatic single-nucleotide variant mutations and 1,781,445 neo-peptides, of which 212 mutations and 178 neo-peptides were immunogenic. Beyond features commonly used for neoantigen prioritization, factors such as the location of neo-peptides within protein HLA presentation hotspots, binding promiscuity, and the role of the mutated gene in oncogenicity were predictive for immunogenicity. The classifiers accurately predicted neoantigen immunogenicity across datasets and improved their ranking by up to 30%. Besides insights into machine learning methods for neoantigen ranking, we have provided homogenized datasets valuable for developing and benchmarking companion algorithms for neoantigen-based immunotherapies.
准确选择与人类白细胞抗原(HLA)结合并被自体 T 细胞识别的新抗原是许多癌症免疫疗法的关键步骤。我们重新处理了来自两个外部大规模新抗原免疫原性筛选测定的 120 名癌症患者的全外显子组测序和 RNA 测序(RNA-seq)数据,结合我们的 11 名患者的内部数据集,鉴定出 46,017 个体细胞单核苷酸变异和 1,781,445 个新肽,其中 212 个突变和 178 个新肽具有免疫原性。除了常用于新抗原优先级排序的特征外,新肽在蛋白质 HLA 呈递热点内的位置、结合多态性以及突变基因在致癌性中的作用等因素也可预测免疫原性。分类器在不同数据集上准确地预测了新抗原的免疫原性,并将其排名提高了高达 30%。除了对新抗原排序的机器学习方法的见解外,我们还提供了均质化的数据集,这些数据集对于开发和基准测试基于新抗原的免疫疗法的配套算法非常有价值。