• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepFold:通过优化损失函数、改进模板特征和重新优化能量函数来增强蛋白质结构预测。

DeepFold: enhancing protein structure prediction through optimized loss functions, improved template features, and re-optimized energy function.

机构信息

Department of Computer Science, Hanyang University, Seoul 04763, Korea.

Center for Advanced Computation, Korea Institute for Advanced Study, Seoul 02455, Korea.

出版信息

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad712.

DOI:10.1093/bioinformatics/btad712
PMID:37995286
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10699847/
Abstract

MOTIVATION

Predicting protein structures with high accuracy is a critical challenge for the broad community of life sciences and industry. Despite progress made by deep neural networks like AlphaFold2, there is a need for further improvements in the quality of detailed structures, such as side-chains, along with protein backbone structures.

RESULTS

Building upon the successes of AlphaFold2, the modifications we made include changing the losses of side-chain torsion angles and frame aligned point error, adding loss functions for side chain confidence and secondary structure prediction, and replacing template feature generation with a new alignment method based on conditional random fields. We also performed re-optimization by conformational space annealing using a molecular mechanics energy function which integrates the potential energies obtained from distogram and side-chain prediction. In the CASP15 blind test for single protein and domain modeling (109 domains), DeepFold ranked fourth among 132 groups with improvements in the details of the structure in terms of backbone, side-chain, and Molprobity. In terms of protein backbone accuracy, DeepFold achieved a median GDT-TS score of 88.64 compared with 85.88 of AlphaFold2. For TBM-easy/hard targets, DeepFold ranked at the top based on Z-scores for GDT-TS. This shows its practical value to the structural biology community, which demands highly accurate structures. In addition, a thorough analysis of 55 domains from 39 targets with publicly available structures indicates that DeepFold shows superior side-chain accuracy and Molprobity scores among the top-performing groups.

AVAILABILITY AND IMPLEMENTATION

DeepFold tools are open-source software available at https://github.com/newtonjoo/deepfold.

摘要

动机

以高精度预测蛋白质结构是生命科学和工业界广大领域面临的一项关键挑战。尽管像 AlphaFold2 这样的深度神经网络取得了进展,但仍需要进一步提高侧链等详细结构以及蛋白质骨架结构的质量。

结果

在 AlphaFold2 成功的基础上,我们进行了以下修改:改变侧链扭转角和框架对齐点误差的损失,添加侧链置信度和二级结构预测的损失函数,并使用基于条件随机场的新对齐方法替代模板特征生成。我们还通过使用基于分布和侧链预测的势函数的分子力学能量函数进行构象空间退火的重新优化。在 CASP15 对单个蛋白质和域建模的盲测(109 个域)中,DeepFold 在 132 个组中排名第四,在骨架、侧链和 Molprobity 方面的结构细节方面有所改进。在蛋白质骨架准确性方面,DeepFold 的中位数 GDT-TS 得分达到 88.64,而 AlphaFold2 为 85.88。对于 TBM-easy/hard 目标,基于 GDT-TS 的 Z 分数,DeepFold 排名第一。这表明它对结构生物学社区具有实际价值,结构生物学社区需要高度准确的结构。此外,对来自 39 个具有公开结构目标的 55 个域的彻底分析表明,DeepFold 在表现最佳的组中显示出卓越的侧链准确性和 Molprobity 分数。

可用性和实现

DeepFold 工具是可在 https://github.com/newtonjoo/deepfold 上获得的开源软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/9e934203d8f0/btad712f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/97262a00997d/btad712f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/c008eb888eb9/btad712f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/a317bf613296/btad712f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/0eebfebf5b9a/btad712f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/4eceda066226/btad712f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/9e934203d8f0/btad712f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/97262a00997d/btad712f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/c008eb888eb9/btad712f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/a317bf613296/btad712f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/0eebfebf5b9a/btad712f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/4eceda066226/btad712f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4ce/10699847/9e934203d8f0/btad712f6.jpg

相似文献

1
DeepFold: enhancing protein structure prediction through optimized loss functions, improved template features, and re-optimized energy function.DeepFold:通过优化损失函数、改进模板特征和重新优化能量函数来增强蛋白质结构预测。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad712.
2
Fast and accurate Ab Initio Protein structure prediction using deep learning potentials.使用深度学习势能进行快速准确的从头开始蛋白质结构预测。
PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539. eCollection 2022 Sep.
3
Learning structural motif representations for efficient protein structure search.学习结构基元表示以实现高效的蛋白质结构搜索。
Bioinformatics. 2018 Sep 1;34(17):i773-i780. doi: 10.1093/bioinformatics/bty585.
4
MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。
Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.
5
Protein structure modeling for CASP10 by multiple layers of global optimization.通过多层全局优化进行CASP10的蛋白质结构建模。
Proteins. 2014 Feb;82 Suppl 2:188-95. doi: 10.1002/prot.24397. Epub 2013 Oct 24.
6
Template based protein structure modeling by global optimization in CASP11.在蛋白质结构预测技术关键评估第11轮(CASP11)中基于模板的蛋白质结构全局优化建模
Proteins. 2016 Sep;84 Suppl 1:221-32. doi: 10.1002/prot.24917. Epub 2015 Sep 14.
7
SELECTpro: effective protein model selection using a structure-based energy function resistant to BLUNDERs.SELECTpro:使用基于结构的抗BLUNDERs能量函数进行有效的蛋白质模型选择。
BMC Struct Biol. 2008 Dec 3;8:52. doi: 10.1186/1472-6807-8-52.
8
High accuracy template based modeling by global optimization.通过全局优化实现基于高精度模板的建模。
Proteins. 2007;69 Suppl 8:83-9. doi: 10.1002/prot.21628.
9
Assessment of template-based modeling of protein structure in CASP11.CASP11中基于模板的蛋白质结构建模评估。
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):200-20. doi: 10.1002/prot.25049. Epub 2016 Jun 15.
10
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。
Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.

引用本文的文献

1
Computational and Experimental Study of Metal-Organic Frameworks (MOFs) as Antimicrobial Agents against .金属有机框架材料(MOFs)作为抗菌剂的计算与实验研究
ACS Appl Mater Interfaces. 2025 Apr 9;17(14):20628-20646. doi: 10.1021/acsami.4c15851. Epub 2025 Mar 27.
2
Novel lineage of anelloviruses with large genomes identified in dolphins.在海豚中鉴定出具有大基因组的新型环曲病毒谱系。
J Virol. 2025 Jan 31;99(1):e0137024. doi: 10.1128/jvi.01370-24. Epub 2024 Dec 12.
3
Easy and accurate protein structure prediction using ColabFold.

本文引用的文献

1
OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization.OpenFold:重新训练 AlphaFold2 可深入了解其学习机制和泛化能力。
Nat Methods. 2024 Aug;21(8):1514-1524. doi: 10.1038/s41592-024-02272-z. Epub 2024 May 14.
2
Improving AlphaFold2-based protein tertiary structure prediction with MULTICOM in CASP15.在蛋白质结构预测关键评估第15轮(CASP15)中使用MULTICOM改进基于AlphaFold2的蛋白质三级结构预测
Commun Chem. 2023 Sep 7;6(1):188. doi: 10.1038/s42004-023-00991-6.
3
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.
使用ColabFold进行简单而准确的蛋白质结构预测。
Nat Protoc. 2025 Mar;20(3):620-642. doi: 10.1038/s41596-024-01060-5. Epub 2024 Oct 14.
4
Natural history of eukaryotic DNA viruses with double jelly-roll major capsid proteins.双发夹状主要衣壳蛋白真核 DNA 病毒的自然史。
Proc Natl Acad Sci U S A. 2024 Jun 4;121(23):e2405771121. doi: 10.1073/pnas.2405771121. Epub 2024 May 28.
5
Natural history of eukaryotic DNA viruses with double jelly-roll major capsid proteins.具有双果冻卷主要衣壳蛋白的真核DNA病毒的自然史。
bioRxiv. 2024 Mar 18:2024.03.18.585575. doi: 10.1101/2024.03.18.585575.
6
PS-GO parametric protein search engine.PS-GO参数化蛋白质搜索引擎。
Comput Struct Biotechnol J. 2024 Apr 8;23:1499-1509. doi: 10.1016/j.csbj.2024.04.003. eCollection 2024 Dec.
在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。
Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.
4
Improved protein structure prediction with trRosettaX2, AlphaFold2, and optimized MSAs in CASP15.利用 trRosettaX2、AlphaFold2 和优化的 MSAs 在 CASP15 中提高蛋白质结构预测。
Proteins. 2023 Dec;91(12):1704-1711. doi: 10.1002/prot.26570. Epub 2023 Aug 10.
5
Improving protein structure prediction with extended sequence similarity searches and deep-learning-based refinement in CASP15.在 CASP15 中使用扩展序列相似性搜索和基于深度学习的精修来改进蛋白质结构预测。
Proteins. 2023 Dec;91(12):1712-1723. doi: 10.1002/prot.26551. Epub 2023 Jul 24.
6
De novo design of protein structure and function with RFdiffusion.利用 RFdiffusion 从头设计蛋白质结构和功能。
Nature. 2023 Aug;620(7976):1089-1100. doi: 10.1038/s41586-023-06415-8. Epub 2023 Jul 11.
7
Prediction of protein structures, functions and interactions using the IntFOLD7, MultiFOLD and ModFOLDdock servers.使用 IntFOLD7、MultiFOLD 和 ModFOLDdock 服务器预测蛋白质结构、功能和相互作用。
Nucleic Acids Res. 2023 Jul 5;51(W1):W274-W280. doi: 10.1093/nar/gkad297.
8
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
9
MGnify: the microbiome sequence data analysis resource in 2023.MGnify:2023 年的微生物组序列数据分析资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D753-D759. doi: 10.1093/nar/gkac1080.
10
Improving protein structure prediction using templates and sequence embedding.利用模板和序列嵌入改进蛋白质结构预测。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac723.