Suppr
超能文献

预测蛋白质中配体结合位点的多种构象表明，AlphaFold2 可能记得太多了。

Predicting multiple conformations of ligand binding sites in proteins suggests that AlphaFold2 may remember too much.

机构信息

Department of Biomedical Engineering, Boston University, Boston, MA 02215.

Department of Chemistry, Boston University, Boston, MA 02215.

出版信息

Proc Natl Acad Sci U S A. 2024 Nov 26;121(48):e2412719121. doi: 10.1073/pnas.2412719121. Epub 2024 Nov 20.

DOI:10.1073/pnas.2412719121

PMID:39565312

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11621821/

Abstract

The goal of this paper is predicting the conformational distributions of ligand binding sites using the AlphaFold2 (AF2) protein structure prediction program with stochastic subsampling of the multiple sequence alignment (MSA). We explored the opening of cryptic ligand binding sites in 16 proteins, where the closed and open conformations define the expected extreme points of the conformational variation. Due to the many structures of these proteins in the Protein Data Bank (PDB), we were able to study whether the distribution of X-ray structures affects the distribution of AF2 models. We have found that AF2 generates both a cluster of open and a cluster of closed models for proteins that have comparable numbers of open and closed structures in the PDB and not too many other conformations. This was observed even with default MSA parameters, thus without further subsampling. In contrast, with the exception of a single protein, AF2 did not yield multiple clusters of conformations for proteins that had imbalanced numbers of open and closed structures in the PDB, or had substantial numbers of other structures. Subsampling improved the results only for a single protein, but very shallow MSA led to incorrect structures. The ability of generating both open and closed conformations for six out of the 16 proteins agrees with the success rates of similar studies reported in the literature. However, we showed that this partial success is due to AF2 "remembering" the conformational distributions in the PDB and that the approach fails to predict rarely seen conformations.

摘要

本文旨在使用 AlphaFold2（AF2）蛋白质结构预测程序，通过对多重序列比对（MSA）进行随机抽样，预测配体结合位点的构象分布。我们探索了 16 种蛋白质中隐蔽配体结合位点的开启，其中封闭和开放构象定义了构象变化的预期极值。由于这些蛋白质在蛋白质数据库（PDB）中有许多结构，我们能够研究 X 射线结构的分布是否影响 AF2 模型的分布。我们发现，对于在 PDB 中具有可比数量的开放和封闭结构且没有太多其他构象的蛋白质，AF2 为其生成了开放模型簇和封闭模型簇。即使使用默认的 MSA 参数，也可以观察到这种情况，因此无需进一步抽样。相比之下，除了一种蛋白质外，对于在 PDB 中具有开放和封闭结构数量不平衡或具有大量其他结构的蛋白质，AF2 并未产生多个构象簇。抽样仅改善了一种蛋白质的结果，但非常浅的 MSA 导致了不正确的结构。对于 16 种蛋白质中的 6 种蛋白质，生成开放和封闭构象的能力与文献中报道的类似研究的成功率一致。然而，我们表明，这种部分成功是由于 AF2“记住”了 PDB 中的构象分布，并且该方法无法预测罕见出现的构象。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/514f/11621821/56dc361a0a74/pnas.2412719121fig01.jpg

相似文献

Predicting multiple conformations of ligand binding sites in proteins suggests that AlphaFold2 may remember too much.

Proc Natl Acad Sci U S A. 2024 Nov 26;121(48):e2412719121. doi: 10.1073/pnas.2412719121. Epub 2024 Nov 20.

A comprehensive exploration of the druggable conformational space of protein kinases using AI-predicted structures.

PLoS Comput Biol. 2024 Jul 24;20(7):e1012302. doi: 10.1371/journal.pcbi.1012302. eCollection 2024 Jul.

Exploring the Druggable Conformational Space of Protein Kinases Using AI-Generated Structures.

bioRxiv. 2023 Sep 2:2023.08.31.555779. doi: 10.1101/2023.08.31.555779.

AlphaFold2-Based Characterization of Apo and Holo Protein Structures and Conformational Ensembles Using Randomized Alanine Sequence Scanning Adaptation: Capturing Shared Signature Dynamics and Ligand-Induced Conformational Changes.

Int J Mol Sci. 2024 Dec 2;25(23):12968. doi: 10.3390/ijms252312968.

Interpretable Atomistic Prediction and Functional Analysis of Conformational Ensembles and Allosteric States in Protein Kinases Using AlphaFold2 Adaptation with Randomized Sequence Scanning and Local Frustration Profiling.

bioRxiv. 2024 Feb 20:2024.02.15.580591. doi: 10.1101/2024.02.15.580591.

Sampling alternative conformational states of transporters and receptors with AlphaFold2.

Elife. 2022 Mar 3;11:e75751. doi: 10.7554/eLife.75751.

Conservation of Hot Spots and Ligand Binding Sites in Protein Models by AlphaFold2.

J Chem Inf Model. 2024 Feb 12;64(3):960-973. doi: 10.1021/acs.jcim.3c01761. Epub 2024 Jan 22.

Challenge for Deep Learning: Protein Structure Prediction of Ligand-Induced Conformational Changes at Allosteric and Orthosteric Sites.

J Chem Inf Model. 2024 Nov 25;64(22):8481-8494. doi: 10.1021/acs.jcim.4c01475. Epub 2024 Nov 1.

Structure prediction of alternative protein conformations.

Nat Commun. 2024 Aug 26;15(1):7328. doi: 10.1038/s41467-024-51507-2.

AlphaFold2's training set powers its predictions of some fold-switched conformations.

Protein Sci. 2025 Apr;34(4):e70105. doi: 10.1002/pro.70105.

引用本文的文献

Modeling CAPRI Targets of Round 55 by Combining AlphaFold and Docking.

Proteins. 2025 Jun 6. doi: 10.1002/prot.26853.

Emerging frontiers in protein structure prediction following the AlphaFold revolution.

J R Soc Interface. 2025 Apr;22(225):20240886. doi: 10.1098/rsif.2024.0886. Epub 2025 Apr 16.

Leveraging Sequence Purification for Accurate Prediction of Multiple Conformational States with AlphaFold2.

Res Sq. 2025 Mar 4:rs.3.rs-6087969. doi: 10.21203/rs.3.rs-6087969/v1.

Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR.

Res Sq. 2025 Feb 19:rs.3.rs-5994356. doi: 10.21203/rs.3.rs-5994356/v1.

Modeling Alternative Conformational States of Pseudo-Symmetric Solute Carrier Transporters using Methods from Deep Learning.

bioRxiv. 2024 Dec 16:2024.07.15.603529. doi: 10.1101/2024.07.15.603529.

Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR.

bioRxiv. 2025 Feb 26:2024.06.26.600902. doi: 10.1101/2024.06.26.600902.

本文引用的文献

A comparison of antibody-antigen complex sequence-to-structure prediction methods and their systematic biases.

Protein Sci. 2024 Sep;33(9):e5127. doi: 10.1002/pro.5127.

Can Protein Structure Prediction Methods Capture Alternative Conformations of Membrane Transporters?

J Chem Inf Model. 2024 Apr 22;64(8):3524-3536. doi: 10.1021/acs.jcim.3c01936. Epub 2024 Apr 2.

High-throughput prediction of protein conformational distributions with subsampled AlphaFold2.

Nat Commun. 2024 Mar 27;15(1):2464. doi: 10.1038/s41467-024-46715-9.

Predicting multiple conformations via sequence clustering and AlphaFold2.

Nature. 2024 Jan;625(7996):832-839. doi: 10.1038/s41586-023-06832-9. Epub 2023 Nov 13.

AFsample: improving multimer prediction with AlphaFold using massive sampling.

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad573.

Modeling conformational states of proteins with AlphaFold.

Curr Opin Struct Biol. 2023 Aug;81:102645. doi: 10.1016/j.sbi.2023.102645. Epub 2023 Jun 29.

Machine Learning Generation of Dynamic Protein Conformational Ensembles.

Molecules. 2023 May 12;28(10):4047. doi: 10.3390/molecules28104047.

Accelerating Cryptic Pocket Discovery Using AlphaFold.

J Chem Theory Comput. 2023 Jul 25;19(14):4355-4363. doi: 10.1021/acs.jctc.2c01189. Epub 2023 Mar 22.

Direct generation of protein conformational ensembles via machine learning.

Nat Commun. 2023 Feb 11;14(1):774. doi: 10.1038/s41467-023-36443-x.

Improving peptide-protein docking with AlphaFold-Multimer using forced sampling.

Front Bioinform. 2022 Sep 26;2:959160. doi: 10.3389/fbinf.2022.959160. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

预测蛋白质中配体结合位点的多种构象表明，AlphaFold2 可能记得太多了。

Predicting multiple conformations of ligand binding sites in proteins suggests that AlphaFold2 may remember too much.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译