• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepAllo:使用具有多任务学习的蛋白质语言模型(pLM)进行变构位点预测。

DeepAllo: allosteric site prediction using protein language model (pLM) with multitask learning.

作者信息

Khokhar Moaaz, Keskin Ozlem, Gursoy Attila

机构信息

Department of Computer Engineering, Koç University, 34450 Istanbul, Turkey.

KUIS AI Center, Koç University, 34450 Istanbul, Turkey.

出版信息

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf294.

DOI:10.1093/bioinformatics/btaf294
PMID:40372465
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12145174/
Abstract

MOTIVATION

Allostery, the process by which binding at one site perturbs a distant site, is being rendered as a key focus in the field of drug development with its substantial impact on protein function. The identification of allosteric pockets (sites) is a challenging task and several techniques have been developed, including Machine Learning to predict allosteric pockets that utilize both static and pocket features.

RESULTS

Our work, DeepAllo, is the first study that combines fine-tuned protein language model (pLM) with FPocket features and shows an increase in prediction performance of allosteric sites over previous studies. The pLM model was fine-tuned on AlloSteric Database (ASD) in Multitask Learning setting and was further used as a feature extractor to train XGBoost and AutoML models. The best model predicts allosteric pockets with 89.66% F1 score and 90.5% of allosteric pockets in the top 3 positions, outperforming previous results. A case study has been performed on proteins with known allosteric pockets, which shows the proof of our approach. Moreover, an effort was made to explain the pLM by visualizing its attention mechanism among allosteric and non-allosteric residues.

AVAILABILITY AND IMPLEMENTATION

The source code is available on GitHub (https://github.com/MoaazK/deepallo) and archived on Zenodo (DOI: 10.5281/zenodo.15255379). The trained model is hosted on Hugging Face (DOI: 10.57967/hf/5198). The dataset used for training and evaluation is archived on Zenodo (DOI: 10.5281/zenodo.15255437).

摘要

动机

变构是指一个位点的结合会干扰远处位点的过程,由于其对蛋白质功能有重大影响,已成为药物开发领域的一个关键焦点。变构口袋(位点)的识别是一项具有挑战性的任务,已经开发了几种技术,包括利用静态和口袋特征来预测变构口袋的机器学习技术。

结果

我们的工作DeepAllo是第一项将微调后的蛋白质语言模型(pLM)与FPocket特征相结合的研究,并且显示出变构位点的预测性能比以前的研究有所提高。pLM模型在多任务学习设置下于变构数据库(ASD)上进行了微调,并进一步用作特征提取器来训练XGBoost和自动机器学习模型。最佳模型预测变构口袋的F1分数为89.66%,并且90.5%的变构口袋在前三位,优于先前的结果。对具有已知变构口袋的蛋白质进行了案例研究,证明了我们方法的有效性。此外,还通过可视化其在变构和非变构残基之间的注意力机制来努力解释pLM。

可用性和实现方式

源代码可在GitHub(https://github.com/MoaazK/deepallo)上获取,并已存档于Zenodo(DOI:10.5281/zenodo.15255379)。训练好的模型托管在Hugging Face(DOI:10.57967/hf/5198)上。用于训练和评估的数据集已存档于Zenodo(DOI:10.5281/zenodo.15255437)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/97054c25f811/btaf294f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/9d034b64cc44/btaf294f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/eeb797b107eb/btaf294f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/3fac0aebb716/btaf294f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/a1f91b66f6ad/btaf294f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/57a87ed970a1/btaf294f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/97054c25f811/btaf294f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/9d034b64cc44/btaf294f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/eeb797b107eb/btaf294f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/3fac0aebb716/btaf294f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/a1f91b66f6ad/btaf294f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/57a87ed970a1/btaf294f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec75/12145174/97054c25f811/btaf294f6.jpg

相似文献

1
DeepAllo: allosteric site prediction using protein language model (pLM) with multitask learning.DeepAllo:使用具有多任务学习的蛋白质语言模型(pLM)进行变构位点预测。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf294.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Mental Health First Aid as a tool for improving mental health and well-being.心理健康急救作为改善心理健康和福祉的工具。
Cochrane Database Syst Rev. 2023 Aug 22;8(8):CD013127. doi: 10.1002/14651858.CD013127.pub2.
10
ToxinPred 3.0: An improved method for predicting the toxicity of peptides.ToxinPred 3.0:一种改进的多肽毒性预测方法。
Comput Biol Med. 2024 Sep;179:108926. doi: 10.1016/j.compbiomed.2024.108926. Epub 2024 Jul 21.

引用本文的文献

1
STINGAllo: a web server for high-throughput prediction of allosteric site-forming residues using internal protein nanoenvironment descriptors.STINGAllo:一个利用蛋白质内部纳米环境描述符进行变构位点形成残基高通量预测的网络服务器。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf424.
2
Allosteric Coupling in Full-Length Lyn Kinase Revealed by Molecular Dynamics and Network Analysis.分子动力学和网络分析揭示全长Lyn激酶中的变构偶联
Int J Mol Sci. 2025 Jun 18;26(12):5835. doi: 10.3390/ijms26125835.

本文引用的文献

1
ASD2023: towards the integrating landscapes of allosteric knowledgebase.ASD2023:迈向变构知识库的综合景观。
Nucleic Acids Res. 2024 Jan 5;52(D1):D376-D383. doi: 10.1093/nar/gkad915.
2
PASSerRank: Prediction of allosteric sites with learning to rank.PASSerRank:基于学习排序的别构位点预测。
J Comput Chem. 2023 Oct 30;44(28):2223-2229. doi: 10.1002/jcc.27193. Epub 2023 Aug 10.
3
PASSer: fast and accurate prediction of protein allosteric sites.PASSer:快速准确预测蛋白质变构位点。
Nucleic Acids Res. 2023 Jul 5;51(W1):W427-W431. doi: 10.1093/nar/gkad303.
4
PASSer2.0: Accurate Prediction of Protein Allosteric Sites Through Automated Machine Learning.PASSer2.0:通过自动化机器学习准确预测蛋白质变构位点
Front Mol Biosci. 2022 Jul 11;9:879251. doi: 10.3389/fmolb.2022.879251. eCollection 2022.
5
Wandering beyond small molecules: peptides as allosteric protein modulators.游走于小分子之外:肽类作为变构蛋白调节剂。
Trends Pharmacol Sci. 2022 May;43(5):406-423. doi: 10.1016/j.tips.2021.10.011. Epub 2021 Nov 29.
6
PASSer: Prediction of Allosteric Sites Server.PASSer:变构位点预测服务器。
Mach Learn Sci Technol. 2021 Sep;2(3). doi: 10.1088/2632-2153/abe6d6. Epub 2021 May 13.
7
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
8
Prediction of Orthosteric and Allosteric Regulations on Cannabinoid Receptors Using Supervised Machine Learning Classifiers.使用监督机器学习分类器预测大麻素受体的变构调节。
Mol Pharm. 2019 Jun 3;16(6):2605-2615. doi: 10.1021/acs.molpharmaceut.9b00182. Epub 2019 May 3.
9
NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning.NetSurfP-2.0:通过集成深度学习改进蛋白质结构特征预测。
Proteins. 2019 Jun;87(6):520-527. doi: 10.1002/prot.25674. Epub 2019 Mar 9.
10
Clustering huge protein sequence sets in linear time.线性时间内的大规模蛋白质序列集聚类。
Nat Commun. 2018 Jun 29;9(1):2542. doi: 10.1038/s41467-018-04964-5.