• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SFCalculator:连接深度生成模型与晶体学

SFCalculator: connecting deep generative models and crystallography.

作者信息

Li Minhuan, Dalton Kevin, Hekstra Doeke

机构信息

John A. Paulson School of Engineering & Applied Sciences, Harvard University.

Department of Molecular & Cellular Biology, Harvard University.

出版信息

bioRxiv. 2025 Jan 19:2025.01.12.632630. doi: 10.1101/2025.01.12.632630.

DOI:10.1101/2025.01.12.632630
PMID:39868231
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11760793/
Abstract

Proteins drive biochemical transformations by transitioning through distinct conformational states. Understanding these states is essential for modulating protein function. Although X-ray crystallography has enabled revolutionary advances in protein structure prediction by machine learning, this connection was made at the level of atomic models, not the underlying data. This lack of connection to crystallographic data limits the potential for further advances in both the accuracy of protein structure prediction and the application of machine learning to experimental structure determination. Here, we present SFCalculator, a differentiable pipeline that generates crystallographic observables from atomistic molecular structures with bulk solvent correction, bridging crystallographic data and neural network-based molecular modeling. We validate SFCalculator against conventional methods and demonstrate its utility by establishing three important proof-of-concept applications. First, SFCalculator enables accurate placement of molecular models relative to crystal lattices (known as phasing). Second, SFCalculator enables the search of the latent space of generative models for conformations that fit crystallographic data and are, therefore, also implicitly constrained by the information encoded by the model. Finally, SFCalculator enables the use of crystallographic data during training of generative models, enabling these models to generate an ensemble of conformations consistent with crystallographic data. SFCalculator, therefore, enables a new generation of analytical paradigms integrating crystallographic data and machine learning.

摘要

蛋白质通过转变为不同的构象状态来驱动生化转化。理解这些状态对于调节蛋白质功能至关重要。尽管X射线晶体学通过机器学习在蛋白质结构预测方面取得了革命性进展,但这种联系是在原子模型层面建立的,而非基础数据层面。这种与晶体学数据缺乏联系的情况限制了蛋白质结构预测准确性以及机器学习在实验结构测定中应用的进一步发展潜力。在此,我们展示了SFCalculator,这是一种可微管道,可通过具有体溶剂校正的原子分子结构生成晶体学可观测量,架起了晶体学数据与基于神经网络的分子建模之间的桥梁。我们将SFCalculator与传统方法进行了验证,并通过建立三个重要的概念验证应用展示了其效用。首先,SFCalculator能够相对于晶格准确放置分子模型(称为相位确定)。其次,SFCalculator能够在生成模型的潜在空间中搜索适合晶体学数据的构象,因此也受到模型编码信息的隐含约束。最后,SFCalculator能够在生成模型训练期间使用晶体学数据,使这些模型能够生成与晶体学数据一致的构象集合。因此,SFCalculator实现了整合晶体学数据和机器学习的新一代分析范式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/585051d8fc0b/nihpp-2025.01.12.632630v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/095aa8a695a1/nihpp-2025.01.12.632630v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/85d717a352fb/nihpp-2025.01.12.632630v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/5feefaa4e99a/nihpp-2025.01.12.632630v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/cdb1b76fcefc/nihpp-2025.01.12.632630v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/585051d8fc0b/nihpp-2025.01.12.632630v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/095aa8a695a1/nihpp-2025.01.12.632630v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/85d717a352fb/nihpp-2025.01.12.632630v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/5feefaa4e99a/nihpp-2025.01.12.632630v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/cdb1b76fcefc/nihpp-2025.01.12.632630v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a994/11760793/585051d8fc0b/nihpp-2025.01.12.632630v2-f0007.jpg

相似文献

1
SFCalculator: connecting deep generative models and crystallography.SFCalculator:连接深度生成模型与晶体学
bioRxiv. 2025 Jan 19:2025.01.12.632630. doi: 10.1101/2025.01.12.632630.
2
CrysFormer: Protein structure determination via Patterson maps, deep learning, and partial structure attention.晶体former:通过帕特森图、深度学习和部分结构注意力进行蛋白质结构测定。
Struct Dyn. 2024 Aug 14;11(4):044701. doi: 10.1063/4.0000252. eCollection 2024 Jul.
3
Machine Learning Generation of Dynamic Protein Conformational Ensembles.机器学习生成动态蛋白质构象集合。
Molecules. 2023 May 12;28(10):4047. doi: 10.3390/molecules28104047.
4
Exploring the dynamic information content of a protein NMR structure: comparison of a molecular dynamics simulation with the NMR and X-ray structures of Escherichia coli ribonuclease HI.探索蛋白质核磁共振结构的动态信息内容:大肠杆菌核糖核酸酶HI的分子动力学模拟与核磁共振及X射线结构的比较。
Proteins. 1999 Jul 1;36(1):87-110. doi: 10.1002/(sici)1097-0134(19990701)36:1<87::aid-prot8>3.0.co;2-r.
5
Transferable deep generative modeling of intrinsically disordered protein conformations.内在无序蛋白质构象的可转移深度生成建模
bioRxiv. 2024 Feb 8:2024.02.08.579522. doi: 10.1101/2024.02.08.579522.
6
Transferable deep generative modeling of intrinsically disordered protein conformations.可转移的深度生成模型对固有无序蛋白质构象的建模。
PLoS Comput Biol. 2024 May 23;20(5):e1012144. doi: 10.1371/journal.pcbi.1012144. eCollection 2024 May.
7
A deep learning solution for crystallographic structure determination.深度学习在晶体结构测定中的应用
IUCrJ. 2023 Jul 1;10(Pt 4):487-496. doi: 10.1107/S2052252523004293.
8
AlphaFold as a Prior: Experimental Structure Determination Conditioned on a Pretrained Neural Network.以AlphaFold为先验:基于预训练神经网络的实验结构测定
bioRxiv. 2025 Mar 11:2025.02.18.638828. doi: 10.1101/2025.02.18.638828.
9
Using cryo-electron microscopy maps for X-ray structure determination.利用冷冻电子显微镜图谱进行X射线结构测定。
IUCrJ. 2018 May 11;5(Pt 4):382-389. doi: 10.1107/S2052252518005857. eCollection 2018 Jul 1.
10
Ensemble MD simulations restrained via crystallographic data: accurate structure leads to accurate dynamics.通过晶体学数据约束的集成分子动力学模拟:精确的结构带来精确的动力学。
Protein Sci. 2014 Apr;23(4):488-507. doi: 10.1002/pro.2433.

本文引用的文献

1
Direct visualization of electric-field-stimulated ion conduction in a potassium channel.钾通道中电场刺激离子传导的直接可视化
Cell. 2025 Jan 9;188(1):77-88.e15. doi: 10.1016/j.cell.2024.12.006.
2
Scalable protein design using optimization in a relaxed sequence space.利用松弛序列空间中的优化进行可扩展的蛋白质设计。
Science. 2024 Oct 25;386(6720):439-445. doi: 10.1126/science.adq1741. Epub 2024 Oct 24.
3
The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era.
处理后预测模型在分子置换中的成功率:对 AlphaFold 时代实验相位的影响。
Acta Crystallogr D Struct Biol. 2024 Nov 1;80(Pt 11):766-779. doi: 10.1107/S2059798324009380. Epub 2024 Oct 3.
4
Quantum refinement in real and reciprocal space using the Phenix and ORCA software.使用Phenix和ORCA软件在实空间和倒易空间中进行量子精修。
IUCrJ. 2024 Nov 1;11(Pt 6):921-937. doi: 10.1107/S2052252524008406.
5
BioCARS: Synchrotron facility for probing structural dynamics of biological macromolecules.生物结构分析与研究协作系统:用于探测生物大分子结构动力学的同步加速器设施。
Struct Dyn. 2024 Jan 31;11(1):014301. doi: 10.1063/4.0000238. eCollection 2024 Jan.
6
OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials.OpenMM 8:基于机器学习势的分子动力学模拟。
J Phys Chem B. 2024 Jan 11;128(1):109-116. doi: 10.1021/acs.jpcb.3c06662. Epub 2023 Dec 28.
7
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
8
Structural evidence for intermediates during O formation in photosystem II.结构证据表明在光系统 II 中 O 形成过程中的中间体。
Nature. 2023 May;617(7961):629-636. doi: 10.1038/s41586-023-06038-z. Epub 2023 May 3.
9
Ultrafast structural changes direct the first molecular events of vision.超快结构变化指导视觉的第一个分子事件。
Nature. 2023 Mar;615(7954):939-944. doi: 10.1038/s41586-023-05863-6. Epub 2023 Mar 22.
10
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.