• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多阳性对比学习的交叉注意力模型用于T细胞受体-抗原结合预测

Multi-positive contrastive learning-based cross-attention model for T cell receptor-antigen binding prediction.

作者信息

Shuai Yi, Shen Pengcheng, Zhang Xianrui

机构信息

Peng Cheng Laboratory, Shenzhen, 518066, China.

State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai, 200240, China.

出版信息

Comput Methods Programs Biomed. 2025 May 10;268:108797. doi: 10.1016/j.cmpb.2025.108797.

DOI:10.1016/j.cmpb.2025.108797
PMID:40378554
Abstract

BACKGROUND AND OBJECTIVE

T cells play a vital role in the immune system by recognizing and eliminating infected or cancerous cells, thus driving adaptive immune responses. Their activation is triggered by the binding of T cell receptors (TCRs) to epitopes presented on Major Histocompatibility Complex (MHC) molecules. However, experimentally identifying antigens that could be recognizable by T cells and possess immunogenic properties is resource-intensive, with most candidates proving non-immunogenic, underscoring the need for computational tools to predict peptide-MHC (pMHC) and TCR binding. Despite extensive efforts, accurately predicting TCR-antigen binding pairs remains challenging due to the vast diversity of TCRs.

METHODS

In this study, we propose a Contrastive Cross-attention model for TCR (ConTCR) and pMHC binding prediction. Firstly, the pMHC and TCR sequences are transformed into high-level embedding by pretrained encoders as feature representations. Then, we employ the multi-modal cross-attention to combine the features between pMHC sequences and TCR sequences. Next, based on the contrastive learning strategy, we pretrained the backbone of ConTCR to boost the model's feature extraction ability for pMHC and TCR sequences. Finally, the model is fine-tuned for classification between positive and negative samples.

RESULTS

Based on this advanced strategy, our proposed model could effectively capture the critical information on TCR-pMHC interactions, and the model is visualized by the attention score heatmap for interpretability. ConTCR demonstrates strong generalization in predicting binding specificity for unseen epitopes and diverse TCR repertoires. On independent non-zero-shot test sets, the model achieved AUC-ROC scores of 0.849 and 0.950; on zero-shot test sets, it obtained AUC-ROC scores of 0.830 and 0.938.

CONCLUSION

Our framework offers a promising solution for improving pMHC-TCR binding prediction and model interpretability. By leveraging the ConTCR model and pMHC-TCR features, we achieve more precise precision than recently advanced models. Overall, ConTCR is a robust tool for predicting pMHC-TCR binding and holds significant promise to advance TCR-based immunotherapies as a valuable artificial intelligence tool. The codes and data used in this study are available at this website.

摘要

背景与目的

T细胞通过识别和清除受感染或癌变的细胞在免疫系统中发挥至关重要的作用,从而驱动适应性免疫反应。它们的激活是由T细胞受体(TCR)与主要组织相容性复合体(MHC)分子上呈递的表位结合所触发的。然而,通过实验鉴定可被T细胞识别并具有免疫原性的抗原需要耗费大量资源,大多数候选抗原被证明无免疫原性,这凸显了使用计算工具预测肽-MHC(pMHC)和TCR结合的必要性。尽管付出了巨大努力,但由于TCR的巨大多样性,准确预测TCR-抗原结合对仍然具有挑战性。

方法

在本研究中,我们提出了一种用于TCR(ConTCR)和pMHC结合预测的对比交叉注意力模型。首先,通过预训练的编码器将pMHC和TCR序列转换为高级嵌入作为特征表示。然后,我们采用多模态交叉注意力来结合pMHC序列和TCR序列之间的特征。接下来,基于对比学习策略,我们对ConTCR的主干进行预训练,以提高模型对pMHC和TCR序列的特征提取能力。最后,对模型进行微调以区分正样本和负样本。

结果

基于这一先进策略,我们提出的模型能够有效地捕捉TCR-pMHC相互作用的关键信息,并通过注意力分数热图对模型进行可视化以提高可解释性。ConTCR在预测未见表位和多样TCR库的结合特异性方面表现出很强的泛化能力。在独立的非零样本测试集上,该模型的AUC-ROC分数分别为0.849和0.950;在零样本测试集上,其AUC-ROC分数分别为0.830和0.938。

结论

我们的框架为改进pMHC-TCR结合预测和模型可解释性提供了一个有前景的解决方案。通过利用ConTCR模型和pMHC-TCR特征,我们实现了比最近的先进模型更高的精度。总体而言,ConTCR是一种预测pMHC-TCR结合的强大工具,作为一种有价值的人工智能工具,在推进基于TCR的免疫疗法方面具有重大前景。本研究中使用的代码和数据可在本网站获取。

相似文献

1
Multi-positive contrastive learning-based cross-attention model for T cell receptor-antigen binding prediction.基于多阳性对比学习的交叉注意力模型用于T细胞受体-抗原结合预测
Comput Methods Programs Biomed. 2025 May 10;268:108797. doi: 10.1016/j.cmpb.2025.108797.
2
LightCTL: lightweight contrastive TCR-pMHC specificity learning with context-aware prompt.LightCTL:基于上下文感知提示的轻量级对比性TCR-pMHC特异性学习
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf246.
3
Attention-aware differential learning for predicting peptide-MHC class I binding and T cell receptor recognition.用于预测肽-MHC I类结合和T细胞受体识别的注意力感知差异学习
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf038.
4
A structural-based machine learning method to classify binding affinities between TCR and peptide-MHC complexes.一种基于结构的机器学习方法,用于分类 TCR 与肽-MHC 复合物之间的结合亲和力。
Mol Immunol. 2021 Nov;139:76-86. doi: 10.1016/j.molimm.2021.07.020. Epub 2021 Aug 26.
5
Structure-Directed Pan-Specific T-Cell Receptor-Peptide-Major Histocompatibility Complex Interaction Prediction.基于结构的泛特异性T细胞受体-肽-主要组织相容性复合体相互作用预测
J Chem Inf Model. 2025 May 12;65(9):4674-4686. doi: 10.1021/acs.jcim.5c00055. Epub 2025 Apr 29.
6
MPID-T: database for sequence-structure-function information on T-cell receptor/peptide/MHC interactions.MPID-T:T细胞受体/肽/MHC相互作用的序列-结构-功能信息数据库。
Appl Bioinformatics. 2006;5(2):111-4. doi: 10.2165/00822942-200605020-00005.
7
Quantifying conformational changes in the TCR:pMHC-I binding interface.量化TCR:pMHC-I结合界面中的构象变化。
Front Immunol. 2024 Dec 2;15:1491656. doi: 10.3389/fimmu.2024.1491656. eCollection 2024.
8
Attention-aware contrastive learning for predicting T cell receptor-antigen binding specificity.注意感知对比学习预测 T 细胞受体-抗原结合特异性。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac378.
9
T-cell receptor triggering is critically dependent on the dimensions of its peptide-MHC ligand.T细胞受体的触发严重依赖于其肽-MHC配体的尺寸。
Nature. 2005 Jul 28;436(7050):578-82. doi: 10.1038/nature03843.
10
A Computational Strategy for the Rapid Identification and Ranking of Patient-Specific T Cell Receptors Bound to Neoantigens.一种用于快速鉴定和排序与新抗原结合的患者特异性T细胞受体的计算策略。
Macromol Rapid Commun. 2024 Dec;45(24):e2400225. doi: 10.1002/marc.202400225. Epub 2024 Jun 12.