• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统地对肽-MHC 结合预测因子进行基准测试:从合成到天然加工的表位。

Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes.

机构信息

Global Research IT, Merck & Co., Inc., Boston, MA, United States of America.

出版信息

PLoS Comput Biol. 2018 Nov 8;14(11):e1006457. doi: 10.1371/journal.pcbi.1006457. eCollection 2018 Nov.

DOI:10.1371/journal.pcbi.1006457
PMID:30408041
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6224037/
Abstract

A number of machine learning-based predictors have been developed for identifying immunogenic T-cell epitopes based on major histocompatibility complex (MHC) class I and II binding affinities. Rationally selecting the most appropriate tool has been complicated by the evolving training data and machine learning methods. Despite the recent advances made in generating high-quality MHC-eluted, naturally processed ligandome, the reliability of new predictors on these epitopes has yet to be evaluated. This study reports the latest benchmarking on an extensive set of MHC-binding predictors by using newly available, untested data of both synthetic and naturally processed epitopes. 32 human leukocyte antigen (HLA) class I and 24 HLA class II alleles are included in the blind test set. Artificial neural network (ANN)-based approaches demonstrated better performance than regression-based machine learning and structural modeling. Among the 18 predictors benchmarked, ANN-based mhcflurry and nn_align perform the best for MHC class I 9-mer and class II 15-mer predictions, respectively, on binding/non-binding classification (Area Under Curves = 0.911). NetMHCpan4 also demonstrated comparable predictive power. Our customization of mhcflurry to a pan-HLA predictor has achieved similar accuracy to NetMHCpan. The overall accuracy of these methods are comparable between 9-mer and 10-mer testing data. However, the top methods deliver low correlations between the predicted versus the experimental affinities for strong MHC binders. When used on naturally processed MHC-ligands, tools that have been trained on elution data (NetMHCpan4 and MixMHCpred) shows better accuracy than pure binding affinity predictor. The variability of false prediction rate is considerable among HLA types and datasets. Finally, structure-based predictor of Rosetta FlexPepDock is less optimal compared to the machine learning approaches. With our benchmarking of MHC-binding and MHC-elution predictors using a comprehensive metrics, a unbiased view for establishing best practice of T-cell epitope predictions is presented, facilitating future development of methods in immunogenomics.

摘要

许多基于机器学习的预测因子已经被开发出来,用于根据主要组织相容性复合体 (MHC) Ⅰ类和Ⅱ类结合亲和力来鉴定免疫原性 T 细胞表位。由于不断发展的训练数据和机器学习方法,合理选择最合适的工具变得复杂。尽管最近在生成高质量 MHC 洗脱的天然加工配体组方面取得了进展,但新预测因子在这些表位上的可靠性仍有待评估。本研究报告了通过使用新的未经测试的合成和天然加工表位数据对广泛的 MHC 结合预测因子进行的最新基准测试。32 个人白细胞抗原 (HLA) I 类和 24 HLA II 类等位基因包含在盲测集中。基于人工神经网络 (ANN) 的方法表现出优于基于回归的机器学习和结构建模的更好性能。在基准测试的 18 个预测因子中,基于 ANN 的 mhcflurry 和 nn_align 分别在 MHC I 9-mer 和 II 15- mer 预测的结合/非结合分类中表现最佳(曲线下面积 = 0.911)。NetMHCpan4 也表现出相当的预测能力。我们将 mhcflurry 定制为泛 HLA 预测因子,与 NetMHCpan 达到了类似的准确性。这些方法在 9- mer 和 10- mer 测试数据之间的整体准确性相当。然而,对于强 MHC 结合物,这些方法的预测与实验亲和力之间的相关性较低。当用于天然加工的 MHC 配体时,基于洗脱数据训练的工具(NetMHCpan4 和 MixMHCpred)比纯结合亲和力预测因子具有更高的准确性。假预测率的变化在 HLA 类型和数据集之间相当大。最后,与机器学习方法相比,基于结构的 Rosetta FlexPepDock 预测器的效果较差。通过使用全面的指标对 MHC 结合和 MHC 洗脱预测因子进行基准测试,为建立 T 细胞表位预测的最佳实践提供了一个公正的视角,为免疫基因组学中方法的未来发展提供了便利。

相似文献

1
Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes.系统地对肽-MHC 结合预测因子进行基准测试:从合成到天然加工的表位。
PLoS Comput Biol. 2018 Nov 8;14(11):e1006457. doi: 10.1371/journal.pcbi.1006457. eCollection 2018 Nov.
2
Benchmarking predictions of MHC class I restricted T cell epitopes in a comprehensively studied model system.在一个经过全面研究的模型系统中对 MHC Ⅰ类限制性 T 细胞表位的预测进行基准测试。
PLoS Comput Biol. 2020 May 26;16(5):e1007757. doi: 10.1371/journal.pcbi.1007757. eCollection 2020 May.
3
Determination of a Predictive Cleavage Motif for Eluted Major Histocompatibility Complex Class II Ligands.鉴定洗脱的主要组织相容性复合体 II 类配体的预测性切割基序。
Front Immunol. 2018 Aug 6;9:1795. doi: 10.3389/fimmu.2018.01795. eCollection 2018.
4
Development and validation of an epitope prediction tool for swine (PigMatrix) based on the pocket profile method.基于口袋轮廓法的猪表位预测工具(PigMatrix)的开发与验证
BMC Bioinformatics. 2015 Sep 15;16:290. doi: 10.1186/s12859-015-0724-8.
5
Sequence conservation analysis and in silico human leukocyte antigen-peptide binding predictions for the Mtb72F and M72 tuberculosis candidate vaccine antigens.结核分枝杆菌72F(Mtb72F)和M72结核候选疫苗抗原的序列保守性分析及计算机模拟人白细胞抗原-肽结合预测
BMC Immunol. 2015 Oct 22;16:63. doi: 10.1186/s12865-015-0119-7.
6
Deep convolutional neural networks for pan-specific peptide-MHC class I binding prediction.用于 pan 特异性肽-MHC 类 I 结合预测的深度卷积神经网络。
BMC Bioinformatics. 2017 Dec 28;18(1):585. doi: 10.1186/s12859-017-1997-x.
7
NNAlign_MA; MHC Peptidome Deconvolution for Accurate MHC Binding Motif Characterization and Improved T-cell Epitope Predictions.NNAlign_MA;用于精确 MHC 结合基序特征描述和改进 T 细胞表位预测的 MHC 肽组分解。
Mol Cell Proteomics. 2019 Dec;18(12):2459-2477. doi: 10.1074/mcp.TIR119.001658. Epub 2019 Oct 2.
8
A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction.HLA 类 I 肽结合预测的生物信息学工具的综合评价与性能评估。
Brief Bioinform. 2020 Jul 15;21(4):1119-1135. doi: 10.1093/bib/bbz051.
9
Toward the prediction of class I and II mouse major histocompatibility complex-peptide-binding affinity: in silico bioinformatic step-by-step guide using quantitative structure-activity relationships.迈向I类和II类小鼠主要组织相容性复合体-肽结合亲和力的预测:使用定量构效关系的计算机生物信息学逐步指南
Methods Mol Biol. 2007;409:227-45. doi: 10.1007/978-1-60327-118-9_16.
10
Predicting promiscuous antigenic T cell epitopes of Mycobacterium tuberculosis mymA operon proteins binding to MHC Class I and Class II molecules.预测结核分枝杆菌mymA操纵子蛋白与MHC I类和II类分子结合的混杂抗原性T细胞表位。
Infect Genet Evol. 2016 Oct;44:182-189. doi: 10.1016/j.meegid.2016.07.004. Epub 2016 Jul 5.

引用本文的文献

1
Computational methods and data resources for predicting tumor neoantigens.预测肿瘤新抗原的计算方法和数据资源
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf302.
2
Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA-A0201.用于预测含非标准氨基酸的肽与HLA - A0201之间结合亲和力的机器学习应用
PLoS One. 2025 Jun 27;20(6):e0314833. doi: 10.1371/journal.pone.0314833. eCollection 2025.
3
In Silico-Designed TGFβRI/TGFβRII Receptor Complex Peptide Inhibitors Exhibit Biological Activity In Vitro.

本文引用的文献

1
MHCflurry: Open-Source Class I MHC Binding Affinity Prediction.MHCflurry:开源的 I 类 MHC 结合亲和力预测。
Cell Syst. 2018 Jul 25;7(1):129-132.e4. doi: 10.1016/j.cels.2018.05.014. Epub 2018 Jun 27.
2
An automated benchmarking platform for MHC class II binding prediction methods.用于 MHC Ⅱ类结合预测方法的自动化基准测试平台。
Bioinformatics. 2018 May 1;34(9):1522-1528. doi: 10.1093/bioinformatics/btx820.
3
NetMHCpan-4.0: Improved Peptide-MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data.
计算机设计的转化生长因子β受体I/转化生长因子β受体II受体复合物肽抑制剂在体外具有生物活性。
J Cell Mol Med. 2025 Apr;29(8):e70548. doi: 10.1111/jcmm.70548.
4
Immunoinformatic strategy for developing multi-epitope subunit vaccine against Helicobacter pylori.开发抗幽门螺杆菌多表位亚单位疫苗的免疫信息学策略
PLoS One. 2025 Feb 7;20(2):e0318750. doi: 10.1371/journal.pone.0318750. eCollection 2025.
5
Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA0201.机器学习应用于预测含非标准氨基酸的肽与HLA0201之间的结合亲和力。
bioRxiv. 2024 Nov 21:2024.11.19.624425. doi: 10.1101/2024.11.19.624425.
6
Developing Vaccines in Pancreatic Adenocarcinoma: Trials and Tribulations.开发胰腺腺癌疫苗:试验与磨难。
Curr Oncol. 2024 Aug 23;31(9):4855-4884. doi: 10.3390/curroncol31090361.
7
Energy landscapes of peptide-MHC binding.多肽-MHC 结合的能量景观。
PLoS Comput Biol. 2024 Sep 3;20(9):e1012380. doi: 10.1371/journal.pcbi.1012380. eCollection 2024 Sep.
8
GIHP: Graph convolutional neural network based interpretable pan-specific HLA-peptide binding affinity prediction.GIHP:基于图卷积神经网络的可解释泛特异性HLA-肽结合亲和力预测
Front Genet. 2024 Jul 10;15:1405032. doi: 10.3389/fgene.2024.1405032. eCollection 2024.
9
MHCII-peptide presentation: an assessment of the state-of-the-art prediction methods.MHCII-肽呈递:对最新预测方法的评估
Front Immunol. 2024 Mar 12;15:1293706. doi: 10.3389/fimmu.2024.1293706. eCollection 2024.
10
Advances in Therapeutic Cancer Vaccines, Their Obstacles, and Prospects Toward Tumor Immunotherapy.治疗性癌症疫苗的进展、障碍及其在肿瘤免疫治疗中的前景
Mol Biotechnol. 2025 Apr;67(4):1336-1366. doi: 10.1007/s12033-024-01144-3. Epub 2024 Apr 16.
NetMHCpan-4.0:整合洗脱配体和肽结合亲和力数据的改进的肽与主要组织相容性复合体I类相互作用预测
J Immunol. 2017 Nov 1;199(9):3360-3368. doi: 10.4049/jimmunol.1700893. Epub 2017 Oct 4.
4
Deciphering HLA-I motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity.解析HLA肽组中的HLA-I基序可改善新抗原预测并识别调节HLA特异性的变构现象。
PLoS Comput Biol. 2017 Aug 23;13(8):e1005725. doi: 10.1371/journal.pcbi.1005725. eCollection 2017 Aug.
5
Mass Spectrometry Profiling of HLA-Associated Peptidomes in Mono-allelic Cells Enables More Accurate Epitope Prediction.单等位基因细胞中HLA相关肽组的质谱分析可实现更准确的表位预测。
Immunity. 2017 Feb 21;46(2):315-326. doi: 10.1016/j.immuni.2017.02.007.
6
Applications of Immunogenomics to Cancer.免疫基因组学在癌症中的应用。
Cell. 2017 Feb 9;168(4):600-612. doi: 10.1016/j.cell.2017.01.014.
7
The problem with neoantigen prediction.新抗原预测的问题。
Nat Biotechnol. 2017 Feb 8;35(2):97. doi: 10.1038/nbt.3800.
8
Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry.通过质谱直接鉴定天然人黑色素瘤组织上呈现的临床相关新表位。
Nat Commun. 2016 Nov 21;7:13404. doi: 10.1038/ncomms13404.
9
A large fraction of HLA class I ligands are proteasome-generated spliced peptides.大量的 HLA Ⅰ类配体是蛋白酶体生成的剪接肽。
Science. 2016 Oct 21;354(6310):354-358. doi: 10.1126/science.aaf4384. Epub 2016 Oct 20.
10
Mass spectrometric analysis of the HLA class I peptidome of melanoma cell lines as a promising tool for the identification of putative tumor-associated HLA epitopes.黑色素瘤细胞系HLA I类肽组的质谱分析作为鉴定假定肿瘤相关HLA表位的一种有前景的工具。
Cancer Immunol Immunother. 2016 Nov;65(11):1377-1393. doi: 10.1007/s00262-016-1897-3. Epub 2016 Sep 6.