• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于去噪扩散网络的从头蛋白质设计,无需预先训练的结构预测模型。

De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.

机构信息

Department of Rheumatology and Immunology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, Center for Advanced Interdisciplinary Science and Biomedicine of IHM, University of Science and Technology of China, Hefei, China.

MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China.

出版信息

Nat Methods. 2024 Nov;21(11):2107-2116. doi: 10.1038/s41592-024-02437-w. Epub 2024 Oct 9.

DOI:10.1038/s41592-024-02437-w
PMID:39384986
Abstract

The recent success of RFdiffusion, a method for protein structure design with a denoising diffusion probabilistic model, has relied on fine-tuning the RoseTTAFold structure prediction network for protein backbone denoising. Here, we introduce SCUBA-diffusion (SCUBA-D), a protein backbone denoising diffusion probabilistic model freshly trained by considering co-diffusion of sequence representation to enhance model regularization and adversarial losses to minimize data-out-of-distribution errors. While matching the performance of the pretrained RoseTTAFold-based RFdiffusion in generating experimentally realizable protein structures, SCUBA-D readily generates protein structures with not-yet-observed overall folds that are different from those predictable with RoseTTAFold. The accuracy of SCUBA-D was confirmed by the X-ray structures of 16 designed proteins and a protein complex, and by experiments validating designed heme-binding proteins and Ras-binding proteins. Our work shows that deep generative models of images or texts can be fruitfully extended to complex physical objects like protein structures by addressing outstanding issues such as the data-out-of-distribution errors.

摘要

最近,RFdiffusion 方法在蛋白质结构设计方面取得了成功,该方法使用去噪扩散概率模型对蛋白质结构进行预测。RFdiffusion 方法成功的关键在于对 RoseTTAFold 结构预测网络进行微调,以实现对蛋白质主链的去噪。在这项研究中,我们引入了 SCUBA-diffusion(SCUBA-D),这是一种全新的蛋白质主链去噪扩散概率模型,它通过考虑序列表示的共同扩散来增强模型正则化,并通过对抗损失来最小化数据分布外误差。虽然 SCUBA-D 在生成可实验实现的蛋白质结构方面的性能与基于预训练的 RoseTTAFold 的 RFdiffusion 相当,但 SCUBA-D 可以轻松生成尚未观察到的具有全新整体折叠的蛋白质结构,这些结构是无法用 RoseTTAFold 预测的。通过对 16 个设计蛋白质和一个蛋白质复合物的 X 射线结构进行验证,以及对设计的血红素结合蛋白和 Ras 结合蛋白进行实验验证,证实了 SCUBA-D 的准确性。我们的工作表明,通过解决数据分布外误差等突出问题,深度生成模型可以成功地扩展到像蛋白质结构这样的复杂物理对象。

相似文献

1
De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.基于去噪扩散网络的从头蛋白质设计,无需预先训练的结构预测模型。
Nat Methods. 2024 Nov;21(11):2107-2116. doi: 10.1038/s41592-024-02437-w. Epub 2024 Oct 9.
2
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
3
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
4
Protein structure generation via folding diffusion.通过折叠扩散生成蛋白质结构
Nat Commun. 2024 Feb 5;15(1):1059. doi: 10.1038/s41467-024-45051-2.
5
Multistate and functional protein design using RoseTTAFold sequence space diffusion.使用RoseTTAFold序列空间扩散进行多状态和功能性蛋白质设计。
Nat Biotechnol. 2024 Sep 25. doi: 10.1038/s41587-024-02395-w.
6
Protein Design Using Structure-Prediction Networks: AlphaFold and RoseTTAFold as Protein Structure Foundation Models.使用结构预测网络进行蛋白质设计:AlphaFold 和 RoseTTAFold 作为蛋白质结构基础模型。
Cold Spring Harb Perspect Biol. 2024 Jul 1;16(7):a041472. doi: 10.1101/cshperspect.a041472.
7
Proteins of well-defined structures can be designed without backbone readjustment by a statistical model.具有明确结构的蛋白质可以通过统计模型进行设计,而无需对主链进行重新调整。
J Struct Biol. 2016 Dec;196(3):350-357. doi: 10.1016/j.jsb.2016.08.002. Epub 2016 Aug 11.
8
Protein structure determination using metagenome sequence data.利用宏基因组序列数据进行蛋白质结构测定。
Science. 2017 Jan 20;355(6322):294-298. doi: 10.1126/science.aah4043.
9
The 6th Computational Structural Bioinformatics Workshop.第六届计算结构生物信息学研讨会
BMC Struct Biol. 2013;13 Suppl 1(Suppl 1):I1. doi: 10.1186/1472-6807-13-S1-I1. Epub 2013 Nov 8.
10
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

引用本文的文献

1
ProDualNet: dual-target protein sequence design method based on protein language model and structure model.ProDualNet:基于蛋白质语言模型和结构模型的双靶点蛋白质序列设计方法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf391.
2
Modification and applications of glucose oxidase: optimization strategies and high-throughput screening technologies.葡萄糖氧化酶的修饰与应用:优化策略及高通量筛选技术
World J Microbiol Biotechnol. 2025 Jul 12;41(7):266. doi: 10.1007/s11274-025-04475-8.
3
De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.

本文引用的文献

1
Rotamer-free protein sequence design based on deep learning and self-consistency.基于深度学习和自一致性的无旋转异构体蛋白质序列设计
Nat Comput Sci. 2022 Jul;2(7):451-462. doi: 10.1038/s43588-022-00273-6. Epub 2022 Jul 21.
2
Score-based generative modeling for de novo protein design.基于得分的从头蛋白质设计生成模型。
Nat Comput Sci. 2023 May;3(5):382-392. doi: 10.1038/s43588-023-00440-3. Epub 2023 May 4.
3
Illuminating protein space with a programmable generative model.用可编程生成模型照亮蛋白质空间。
基于每个残基二级结构约束条件,利用整合长短期记忆网络(LSTM)和注意力机制的轻量级扩散模型进行大型多肽的从头设计。
Molecules. 2025 Feb 28;30(5):1116. doi: 10.3390/molecules30051116.
4
Characterization of PRDM9 Multifunctionality in Yak Testes Through Protein Interaction Mapping.通过蛋白质相互作用图谱对牦牛睾丸中PRDM9多功能性的表征
Int J Mol Sci. 2025 Feb 8;26(4):1420. doi: 10.3390/ijms26041420.
5
Machine learning in molecular biophysics: Protein allostery, multi-level free energy simulations, and lipid phase transitions.分子生物物理学中的机器学习:蛋白质别构、多级自由能模拟和脂质相变
Biophys Rev (Melville). 2025 Feb 12;6(1):011305. doi: 10.1063/5.0248589. eCollection 2025 Mar.
6
FoldMark: Safeguarding Protein Structure Generative Models with Distributional and Evolutionary Watermarking.FoldMark:通过分布和进化水印保护蛋白质结构生成模型
bioRxiv. 2025 Jun 4:2024.10.23.619960. doi: 10.1101/2024.10.23.619960.
Nature. 2023 Nov;623(7989):1070-1078. doi: 10.1038/s41586-023-06728-8. Epub 2023 Nov 15.
4
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
5
De novo design of protein interactions with learned surface fingerprints.从头设计具有学习到的表面指纹的蛋白质相互作用。
Nature. 2023 May;617(7959):176-184. doi: 10.1038/s41586-023-05993-x. Epub 2023 Apr 26.
6
De novo design of luciferases using deep learning.利用深度学习进行荧光素酶的从头设计。
Nature. 2023 Feb;614(7949):774-780. doi: 10.1038/s41586-023-05696-3. Epub 2023 Feb 22.
7
Scaffolding protein functional sites using deep learning.利用深度学习构建支架蛋白功能位点。
Science. 2022 Jul 22;377(6604):387-394. doi: 10.1126/science.abn2100. Epub 2022 Jul 21.
8
Ig-VAE: Generative modeling of protein structure by direct 3D coordinate generation.Ig-VAE:通过直接 3D 坐标生成对蛋白质结构进行生成式建模。
PLoS Comput Biol. 2022 Jun 27;18(6):e1010271. doi: 10.1371/journal.pcbi.1010271. eCollection 2022 Jun.
9
A backbone-centred energy function of neural networks for protein design.基于神经网络的蛋白质设计中心骨干能量函数。
Nature. 2022 Feb;602(7897):523-528. doi: 10.1038/s41586-021-04383-5. Epub 2022 Feb 9.
10
De novo protein design by deep network hallucination.基于深度网络幻觉的从头设计蛋白质。
Nature. 2021 Dec;600(7889):547-552. doi: 10.1038/s41586-021-04184-w. Epub 2021 Dec 1.