• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Pythia-PPI对突变后蛋白质-蛋白质结合亲和力变化进行可靠预测。

Reliable prediction of protein-protein binding affinity changes upon mutations with Pythia-PPI.

作者信息

Tao Fangting, Sun Jinyuan, Gao Pengyue, Gao George Fu, Wu Bian

机构信息

Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China.

Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.

出版信息

Natl Sci Rev. 2025 Jun 10;12(6):nwaf231. doi: 10.1093/nsr/nwaf231. eCollection 2025 Jun.

DOI:10.1093/nsr/nwaf231
PMID:40575548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12199698/
Abstract

Protein-protein interactions (PPIs) are essential for numerous biological functions and predicting binding affinity changes caused by mutations is crucial for understanding the impact of genetic variation and advancing protein engineering. Although machine-learning-based methods show promise in improving prediction accuracy, limited experimental data remain a significant bottleneck. In this study, we employed multitask learning and self-distillation to overcome the data limitation and improve the accuracy of protein-protein binding affinity prediction. By incorporating a mutation stability prediction task, our model achieved state-of-the-art accuracy on the SKEMPI dataset and was subsequently used to predict binding affinity changes for millions of mutations, generating an expanded dataset for self-distillation. Compared with prevalent methods, Pythia-PPI increased the Pearson's correlation between predictions and experimental data from 0.6447 to 0.7850 on the SKEMPI dataset and from 0.3654 to 0.6050 on the viral-receptor dataset. Experimental validation further confirmed its ability to identify high-affinity mutations on the CB6 antibody in complex with the severe acute respiratory syndrome coronavirus 2 prototype receptor binding domain, with the best single-point mutant among the top 10 predictions showing a 2-fold increase in binding affinity. These findings demonstrate that Pythia-PPI is a valuable tool for analysing the fitness landscape of PPIs. A web server for Pythia-PPI is available at https://pythiappi.wulab.xyz for easy access.

摘要

蛋白质-蛋白质相互作用(PPIs)对于众多生物学功能至关重要,预测由突变引起的结合亲和力变化对于理解遗传变异的影响和推进蛋白质工程至关重要。尽管基于机器学习的方法在提高预测准确性方面显示出前景,但有限的实验数据仍然是一个重大瓶颈。在本研究中,我们采用多任务学习和自蒸馏来克服数据限制并提高蛋白质-蛋白质结合亲和力预测的准确性。通过纳入突变稳定性预测任务,我们的模型在SKEMPI数据集上达到了当前最优的准确性,随后被用于预测数百万个突变的结合亲和力变化,生成了一个用于自蒸馏的扩展数据集。与流行方法相比,Pythia-PPI在SKEMPI数据集上使预测与实验数据之间的皮尔逊相关性从0.6447提高到0.7850,在病毒-受体数据集上从0.3654提高到0.6050。实验验证进一步证实了其在识别与严重急性呼吸综合征冠状病毒2原型受体结合域复合的CB6抗体上的高亲和力突变的能力,前10个预测中最佳的单点突变显示结合亲和力增加了2倍。这些发现表明Pythia-PPI是分析PPIs适应性景观的有价值工具。可通过https://pythiappi.wulab.xyz访问Pythia-PPI的网络服务器,以便于使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/d6f46ad6f215/nwaf231fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/edb689c5e9a4/nwaf231fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/d7e6b92b709a/nwaf231fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/7f0985e0c4ac/nwaf231fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/6196a513734a/nwaf231fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/3b8b0bb22632/nwaf231fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/d6f46ad6f215/nwaf231fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/edb689c5e9a4/nwaf231fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/d7e6b92b709a/nwaf231fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/7f0985e0c4ac/nwaf231fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/6196a513734a/nwaf231fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/3b8b0bb22632/nwaf231fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b72/12199698/d6f46ad6f215/nwaf231fig6.jpg

相似文献

1
Reliable prediction of protein-protein binding affinity changes upon mutations with Pythia-PPI.使用Pythia-PPI对突变后蛋白质-蛋白质结合亲和力变化进行可靠预测。
Natl Sci Rev. 2025 Jun 10;12(6):nwaf231. doi: 10.1093/nsr/nwaf231. eCollection 2025 Jun.
2
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
4
Proton pump inhibitors for the prevention of non-steroidal anti-inflammatory drug-induced ulcers and dyspepsia.质子泵抑制剂用于预防非甾体抗炎药引起的溃疡和消化不良。
Cochrane Database Syst Rev. 2025 May 8;5(5):CD014585. doi: 10.1002/14651858.CD014585.pub2.
5
Can We Accurately Predict Adult Height in Pediatric Patients Who Undergo Treatment for Sarcoma?我们能否准确预测接受肉瘤治疗的儿科患者的成人身高?
Clin Orthop Relat Res. 2025 Feb 6;483(7):1338-46. doi: 10.1097/CORR.0000000000003409.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
7
Physical interventions to interrupt or reduce the spread of respiratory viruses.物理干预措施以阻断或减少呼吸道病毒的传播。
Cochrane Database Syst Rev. 2023 Jan 30;1(1):CD006207. doi: 10.1002/14651858.CD006207.pub6.
8
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
9
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.
10
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

本文引用的文献

1
Structure-based self-supervised learning enables ultrafast protein stability prediction upon mutation.基于结构的自监督学习能够在突变时实现超快速蛋白质稳定性预测。
Innovation (Camb). 2025 Jan 6;6(1):100750. doi: 10.1016/j.xinn.2024.100750.
2
Simulating 500 million years of evolution with a language model.用语言模型模拟5亿年的进化历程。
Science. 2025 Feb 21;387(6736):850-858. doi: 10.1126/science.ads0018. Epub 2025 Jan 16.
3
Protein interactions in human pathogens revealed through deep learning.通过深度学习揭示人类病原体中的蛋白质相互作用。
Nat Microbiol. 2024 Oct;9(10):2642-2652. doi: 10.1038/s41564-024-01791-x. Epub 2024 Sep 18.
4
AI models collapse when trained on recursively generated data.当在递归生成的数据上训练 AI 模型时,模型会崩溃。
Nature. 2024 Jul;631(8022):755-759. doi: 10.1038/s41586-024-07566-y. Epub 2024 Jul 24.
5
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.
6
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
7
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
8
Masked inverse folding with sequence transfer for protein representation learning.用于蛋白质表示学习的带序列转移的掩码反向折叠
Protein Eng Des Sel. 2023 Jan 21;36. doi: 10.1093/protein/gzad015.
9
Mega-scale experimental analysis of protein folding stability in biology and design.大规模实验分析生物学和设计中的蛋白质折叠稳定性。
Nature. 2023 Aug;620(7973):434-444. doi: 10.1038/s41586-023-06328-6. Epub 2023 Jul 19.
10
Robust deep learning-based protein sequence design using ProteinMPNN.使用 ProteinMPNN 进行健壮的基于深度学习的蛋白质序列设计。
Science. 2022 Oct 7;378(6615):49-56. doi: 10.1126/science.add2187. Epub 2022 Sep 15.