• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用PocketGen高效生成蛋白质口袋

Efficient Generation of Protein Pockets with PocketGen.

作者信息

Zhang Zaixi, Shen Wan Xiang, Liu Qi, Zitnik Marinka

机构信息

State Key Laboratory of Cognitive Intelligence, University of Science and Technology of China, Hefei, Anhui, China.

Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China.

出版信息

bioRxiv. 2024 Sep 23:2024.02.25.581968. doi: 10.1101/2024.02.25.581968.

DOI:10.1101/2024.02.25.581968
PMID:38464121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10925136/
Abstract

Designing protein-binding proteins is critical for drug discovery. However, the AI-based design of such proteins is challenging due to the complexity of ligand-protein interactions, the flexibility of ligand molecules and amino acid side chains, and sequence-structure dependencies. We introduce PocketGen, a deep generative model that simultaneously produces both the residue sequence and atomic structure of the protein regions where ligand interactions occur. PocketGen ensures consistency between sequence and structure by using a graph transformer for structural encoding and a sequence refinement module based on a protein language model. The bilevel graph transformer captures interactions at multiple scales, including atom, residue, and ligand levels. To enhance sequence refinement, PocketGen integrates a structural adapter into the protein language model, ensuring that structure-based predictions align with sequence-based predictions. PocketGen can generate high-fidelity protein pockets with superior binding affinity and structural validity. It operates ten times faster than physics-based methods and achieves a 95% success rate, defined as the percentage of generated pockets with higher binding affinity than reference pockets. Additionally, it attains an amino acid recovery rate exceeding 64%.

摘要

设计蛋白质结合蛋白对于药物发现至关重要。然而,由于配体-蛋白质相互作用的复杂性、配体分子和氨基酸侧链的灵活性以及序列-结构依赖性,基于人工智能设计此类蛋白质具有挑战性。我们引入了PocketGen,这是一种深度生成模型,它能同时生成发生配体相互作用的蛋白质区域的残基序列和原子结构。PocketGen通过使用用于结构编码的图变换器和基于蛋白质语言模型的序列优化模块来确保序列和结构之间的一致性。双层图变换器捕获多个尺度的相互作用,包括原子、残基和配体水平。为了增强序列优化,PocketGen将结构适配器集成到蛋白质语言模型中,确保基于结构的预测与基于序列的预测一致。PocketGen可以生成具有优异结合亲和力和结构有效性的高保真蛋白质口袋。它的运行速度比基于物理的方法快十倍,成功率达到95%,成功率定义为生成的口袋中结合亲和力高于参考口袋的百分比。此外,它的氨基酸回收率超过64%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/fa7f461f4b11/nihpp-2024.02.25.581968v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/45dd61755da9/nihpp-2024.02.25.581968v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/24e973e4668a/nihpp-2024.02.25.581968v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/0d0daa9827a1/nihpp-2024.02.25.581968v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/6db06ee84d5f/nihpp-2024.02.25.581968v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/f559bba305fe/nihpp-2024.02.25.581968v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/fa7f461f4b11/nihpp-2024.02.25.581968v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/45dd61755da9/nihpp-2024.02.25.581968v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/24e973e4668a/nihpp-2024.02.25.581968v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/0d0daa9827a1/nihpp-2024.02.25.581968v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/6db06ee84d5f/nihpp-2024.02.25.581968v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/f559bba305fe/nihpp-2024.02.25.581968v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e06/11421053/fa7f461f4b11/nihpp-2024.02.25.581968v4-f0005.jpg

相似文献

1
Efficient Generation of Protein Pockets with PocketGen.使用PocketGen高效生成蛋白质口袋
bioRxiv. 2024 Sep 23:2024.02.25.581968. doi: 10.1101/2024.02.25.581968.
2
DeepTGIN: a novel hybrid multimodal approach using transformers and graph isomorphism networks for protein-ligand binding affinity prediction.DeepTGIN:一种使用Transformer和图同构网络进行蛋白质-配体结合亲和力预测的新型混合多模态方法。
J Cheminform. 2024 Dec 29;16(1):147. doi: 10.1186/s13321-024-00938-6.
3
The algebraic extended atom-type graph-based model for precise ligand-receptor binding affinity prediction.基于代数扩展原子类型图的精确配体-受体结合亲和力预测模型。
J Cheminform. 2025 Jan 22;17(1):10. doi: 10.1186/s13321-025-00955-z.
4
End-to-End Protein Normal Mode Frequency Predictions Using Language and Graph Models and Application to Sonification.使用语言和图形模型进行端到端蛋白质正常模式频率预测及其在可听化中的应用。
ACS Nano. 2022 Dec 27;16(12):20656-20670. doi: 10.1021/acsnano.2c07681. Epub 2022 Nov 23.
5
LS-align: an atom-level, flexible ligand structural alignment algorithm for high-throughput virtual screening.LS-align:一种适用于高通量虚拟筛选的原子级、灵活的配体结构对齐算法。
Bioinformatics. 2018 Jul 1;34(13):2209-2218. doi: 10.1093/bioinformatics/bty081.
6
Identification of ligand-binding pockets in proteins using residue preference methods.使用残基偏好方法识别蛋白质中的配体结合口袋。
Protein Pept Lett. 2009;16(8):984-90. doi: 10.2174/092986609788923284.
7
Deep generative model for drug design from protein target sequence.基于蛋白质靶点序列的药物设计深度生成模型。
J Cheminform. 2023 Mar 28;15(1):38. doi: 10.1186/s13321-023-00702-2.
8
GNNSeq: A Sequence-Based Graph Neural Network for Predicting Protein-Ligand Binding Affinity.GNNSeq:一种基于序列的图神经网络,用于预测蛋白质-配体结合亲和力。
Pharmaceuticals (Basel). 2025 Feb 26;18(3):329. doi: 10.3390/ph18030329.
9
Atomic protein structure refinement using all-atom graph representations and SE(3)-equivariant graph transformer.使用全原子图表示和 SE(3)-等变图转换器进行原子蛋白质结构精修。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad298.
10
Comparison of structure- and ligand-based scoring functions for deep generative models: a GPCR case study.深度生成模型中基于结构和配体的评分函数比较:以G蛋白偶联受体为例的研究
J Cheminform. 2021 May 13;13(1):39. doi: 10.1186/s13321-021-00516-0.

本文引用的文献

1
Atomic context-conditioned protein sequence design using LigandMPNN.使用配体消息传递神经网络进行原子上下文条件蛋白质序列设计。
Nat Methods. 2025 Apr;22(4):717-723. doi: 10.1038/s41592-025-02626-1. Epub 2025 Mar 28.
2
Structure-based drug design with equivariant diffusion models.基于结构的药物设计与等变扩散模型
Nat Comput Sci. 2024 Dec;4(12):899-909. doi: 10.1038/s43588-024-00737-x. Epub 2024 Dec 9.
3
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
4
De novo design of drug-binding proteins with predictable binding energy and specificity.从头设计具有可预测结合能和特异性的药物结合蛋白。
Science. 2024 Apr 5;384(6691):106-112. doi: 10.1126/science.adl5364. Epub 2024 Apr 4.
5
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
6
Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering.机器学习辅助酶工程面临的机遇与挑战
ACS Cent Sci. 2024 Feb 5;10(2):226-241. doi: 10.1021/acscentsci.3c01275. eCollection 2024 Feb 28.
7
PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences.PoseBusters:基于人工智能的对接方法无法生成符合物理原理的构象,也无法推广到新序列。
Chem Sci. 2023 Dec 13;15(9):3130-3139. doi: 10.1039/d3sc04185a. eCollection 2024 Feb 28.
8
An equivariant generative framework for molecular graph-structure Co-design.用于分子图结构协同设计的等变生成框架。
Chem Sci. 2023 Jul 19;14(31):8380-8392. doi: 10.1039/d3sc02538a. eCollection 2023 Aug 9.
9
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
10
Uni-GBSA: an open-source and web-based automatic workflow to perform MM/GB(PB)SA calculations for virtual screening.Uni-GBSA:一种开源的、基于网络的自动工作流程,用于执行虚拟筛选的 MM/GB(PB)SA 计算。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad218.