epiTCR：一种高灵敏度的 TCR-肽结合预测因子。

epiTCR: a highly sensitive predictor for TCR-peptide binding.

机构信息

Medical Genetics Institute, Ho Chi Minh City, Vietnam.

NexCalibur Therapeutics, Wilmington, DE, United States.

出版信息

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad284.

DOI:10.1093/bioinformatics/btad284

PMID:37094220

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10159657/

Abstract

MOTIVATION

Predicting the binding between T-cell receptor (TCR) and peptide presented by human leucocyte antigen molecule is a highly challenging task and a key bottleneck in the development of immunotherapy. Existing prediction tools, despite exhibiting good performance on the datasets they were built with, suffer from low true positive rates when used to predict epitopes capable of eliciting T-cell responses in patients. Therefore, an improved tool for TCR-peptide prediction built upon a large dataset combining existing publicly available data is still needed.

RESULTS

We collected data from five public databases (IEDB, TBAdb, VDJdb, McPAS-TCR, and 10X) to form a dataset of >3 million TCR-peptide pairs, 3.27% of which were binding interactions. We proposed epiTCR, a Random Forest-based method dedicated to predicting the TCR-peptide interactions. epiTCR used simple input of TCR CDR3β sequences and antigen sequences, which are encoded by flattened BLOSUM62. epiTCR performed with area under the curve (0.98) and higher sensitivity (0.94) than other existing tools (NetTCR, Imrex, ATM-TCR, and pMTnet), while maintaining comparable prediction specificity (0.9). We identified seven epitopes that contributed to 98.67% of false positives predicted by epiTCR and exerted similar effects on other tools. We also demonstrated a considerable influence of peptide sequences on prediction, highlighting the need for more diverse peptides in a more balanced dataset. In conclusion, epiTCR is among the most well-performing tools, thanks to the use of combined data from public sources and its use will contribute to the quest in identifying neoantigens for precision cancer immunotherapy.

AVAILABILITY AND IMPLEMENTATION

epiTCR is available on GitHub (https://github.com/ddiem-ri-4D/epiTCR).

摘要

动机

预测 T 细胞受体 (TCR) 与人类白细胞抗原分子呈递的肽之间的结合是一项极具挑战性的任务，也是免疫疗法发展的关键瓶颈。现有的预测工具尽管在其构建的数据集中表现出良好的性能，但在用于预测能够在患者中引发 T 细胞反应的表位时，其真阳性率较低。因此，仍然需要一个基于包含现有公开可用数据的大型数据集构建的改进的 TCR-肽预测工具。

结果

我们从五个公共数据库（IEDB、TBAdb、VDJdb、McPAS-TCR 和 10X）收集数据，形成了一个包含超过 300 万个 TCR-肽对的数据集，其中 3.27%是结合相互作用。我们提出了 epiTCR，这是一种基于随机森林的方法，专门用于预测 TCR-肽相互作用。epiTCR 使用 TCR CDR3β 序列和抗原序列的简单输入，这些序列由展平的 BLOSUM62 编码。epiTCR 的曲线下面积（0.98）和更高的敏感性（0.94）优于其他现有工具（NetTCR、Imrex、ATM-TCR 和 pMTnet），同时保持相当的预测特异性（0.9）。我们确定了七个表位，这些表位对 epiTCR 预测的 98.67%假阳性贡献最大，并对其他工具产生了类似的影响。我们还证明了肽序列对预测的重要影响，这突出表明需要在更平衡的数据集中使用更多样化的肽。总之，epiTCR 是表现最好的工具之一，这要归功于使用来自公共资源的组合数据及其使用将有助于识别用于精准癌症免疫疗法的新抗原。

可用性和实施

epiTCR 可在 GitHub（https://github.com/ddiem-ri-4D/epiTCR）上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12e0/10159657/fcaea897b66f/btad284f1.jpg

相似文献

epiTCR: a highly sensitive predictor for TCR-peptide binding.

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad284.

Prediction of Specific TCR-Peptide Binding From Large Dictionaries of TCR-Peptide Pairs.

Front Immunol. 2020 Aug 25;11:1803. doi: 10.3389/fimmu.2020.01803. eCollection 2020.

AttnTAP: A Dual-input Framework Incorporating the Attention Mechanism for Accurately Predicting TCR-peptide Binding.

Front Genet. 2022 Aug 22;13:942491. doi: 10.3389/fgene.2022.942491. eCollection 2022.

Identification of the cognate peptide-MHC target of T cell receptors using molecular modeling and force field scoring.

Mol Immunol. 2018 Feb;94:91-97. doi: 10.1016/j.molimm.2017.12.019. Epub 2017 Dec 27.

DLpTCR: an ensemble deep learning framework for predicting immunogenic peptide recognized by T cell receptor.

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab335.

TITAN: T-cell receptor specificity prediction with bimodal attention networks.

Bioinformatics. 2021 Jul 12;37(Suppl_1):i237-i244. doi: 10.1093/bioinformatics/btab294.

BERTrand-peptide:TCR binding prediction using Bidirectional Encoder Representations from Transformers augmented with random TCR pairing.

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad468.

NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions.

Front Immunol. 2022 Dec 6;13:1055151. doi: 10.3389/fimmu.2022.1055151. eCollection 2022.

TCR-Pred: A new web-application for prediction of epitope and MHC specificity for CDR3 TCR sequences using molecular fragment descriptors.

Immunology. 2023 Aug;169(4):447-453. doi: 10.1111/imm.13641. Epub 2023 Mar 16.

VDJdb: a curated database of T-cell receptor sequences with known antigen specificity.

Nucleic Acids Res. 2018 Jan 4;46(D1):D419-D427. doi: 10.1093/nar/gkx760.

引用本文的文献

TCR-pMHC Binding Specificity Prediction From Structure Using Graph Neural Networks.

IEEE Trans Comput Biol Bioinform. 2025 Jan-Feb;22(1):171-179. doi: 10.1109/TCBBIO.2024.3504235.

TCR-epiDiff: solving dual challenges of TCR generation and binding prediction.

Bioinformatics. 2025 Jul 1;41(Supplement_1):i125-i132. doi: 10.1093/bioinformatics/btaf202.

nuTCRacker: Predicting the Recognition of HLA-I-Peptide Complexes by αβTCRs for Unseen Peptides.

Eur J Immunol. 2025 Jul;55(7):e51607. doi: 10.1002/eji.202451607.

Benchmarking of T cell receptor-epitope predictors with ePytope-TCR.

Cell Genom. 2025 Jun 27:100946. doi: 10.1016/j.xgen.2025.100946.

Computational methods and data resources for predicting tumor neoantigens.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf302.

A systematic review of T cell epitopes defined from the proteome of human immunodeficiency virus.

Virus Res. 2025 Jun 23;358:199602. doi: 10.1016/j.virusres.2025.199602.

Phage display enables machine learning discovery of cancer antigen-specific TCRs.

Sci Adv. 2025 Jun 13;11(24):eads5589. doi: 10.1126/sciadv.ads5589. Epub 2025 Jun 11.

LightCTL: lightweight contrastive TCR-pMHC specificity learning with context-aware prompt.

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf246.

TRAP: a contrastive learning-enhanced framework for robust TCR-pMHC binding prediction with improved generalizability.

Chem Sci. 2025 Apr 29. doi: 10.1039/d4sc08141b.

OnmiMHC: a machine learning solution for UCEC tumor vaccine development through enhanced peptide-MHC binding prediction.

Front Immunol. 2025 Feb 28;16:1550252. doi: 10.3389/fimmu.2025.1550252. eCollection 2025.

本文引用的文献

TSNAdb v2.0: The Updated Version of Tumor-specific Neoantigen Database.

Genomics Proteomics Bioinformatics. 2023 Apr;21(2):259-266. doi: 10.1016/j.gpb.2022.09.012. Epub 2022 Oct 6.

Rapid Assessment of T-Cell Receptor Specificity of the Immune Repertoire.

Nat Comput Sci. 2021 May;1(5):362-373. doi: 10.1038/s43588-021-00076-1. Epub 2021 May 24.

Deep learning-based prediction of the T cell receptor-antigen binding specificity.

Nat Mach Intell. 2021 Oct;3(10):864-875. doi: 10.1038/s42256-021-00383-2. Epub 2021 Sep 23.

ATM-TCR: TCR-Epitope Binding Affinity Prediction Using a Multi-Head Self-Attention Model.

Front Immunol. 2022 Jul 6;13:893247. doi: 10.3389/fimmu.2022.893247. eCollection 2022.

dbPepNeo2.0: A Database for Human Tumor Neoantigen Peptides From Mass Spectrometry and TCR Recognition.

Front Immunol. 2022 Apr 13;13:855976. doi: 10.3389/fimmu.2022.855976. eCollection 2022.

A machine learning model for ranking candidate HLA class I neoantigens based on known neoepitopes from multiple human tumor types.

Nat Cancer. 2021 May;2(5):563-574. doi: 10.1038/s43018-021-00197-6. Epub 2021 May 3.

NetTCR-2.0 enables accurate prediction of TCR-peptide binding by using paired TCRα and β sequence data.

Commun Biol. 2021 Sep 10;4(1):1060. doi: 10.1038/s42003-021-02610-3.

TITAN: T-cell receptor specificity prediction with bimodal attention networks.

Bioinformatics. 2021 Jul 12;37(Suppl_1):i237-i244. doi: 10.1093/bioinformatics/btab294.

A framework for highly multiplexed dextramer mapping and prediction of T cell receptor sequences to antigen specificity.

Sci Adv. 2021 May 14;7(20). doi: 10.1126/sciadv.abf5835. Print 2021 May.

Contribution of T Cell Receptor Alpha and Beta CDR3, MHC Typing, V and J Genes to Peptide Binding Prediction.

Front Immunol. 2021 Apr 26;12:664514. doi: 10.3389/fimmu.2021.664514. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

epiTCR：一种高灵敏度的 TCR-肽结合预测因子。

epiTCR: a highly sensitive predictor for TCR-peptide binding.

机构信息

Medical Genetics Institute, Ho Chi Minh City, Vietnam.

NexCalibur Therapeutics, Wilmington, DE, United States.