通过受 RNA 启发的 Ansatz 实现蛋白质环结构的原子精度预测。

Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz.

机构信息

Departments of Biochemistry and Physics, Stanford University, Stanford, California, United States of America.

出版信息

PLoS One. 2013 Oct 21;8(10):e74830. doi: 10.1371/journal.pone.0074830. eCollection 2013.

DOI:10.1371/journal.pone.0074830

PMID:24204571

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3804535/

Abstract

Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. Current approaches, such as the protein local optimization protocol or kinematic inversion closure (KIC) Monte Carlo, involve stages that coarse-grain proteins, simplifying modeling but precluding a systematic search of all-atom configurations. This article introduces an alternative modeling strategy based on a 'stepwise ansatz', recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth 'RNA-puzzle' competition. These results establish all-atom enumeration as an unusually systematic approach to ab initio protein structure modeling that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic accuracy.

摘要

仅从序列预测生物聚合物的原子分辨率结构仍然是一个难题，即使对于大型蛋白质的小亚段也是如此。在比较建模和蛋白质设计中经常出现的这种环预测挑战，如果环的长度超过 10 个残基并且周围的侧链构象被擦除，则会变得难以处理。当前的方法，如蛋白质局部优化协议或运动学反转封闭 (KIC) 蒙特卡罗，涉及简化建模但排除所有原子构型系统搜索的蛋白质粗粒化阶段。本文介绍了一种替代建模策略，该策略基于最近为 RNA 建模开发的“逐步假设”，该假设认为任何现实的全原子分子构象都可以通过残基逐步枚举来构建。当与 Rosetta 框架中的类似于动态编程的递归相结合时，所得的逐步组装 (SWA) 协议可以以数千个 CPU 小时的可观但可实现的成本对 12 个残基环进行枚举采样。在之前建立的基准测试中，与 KIC 建模相比，SWA 在 20 个环中恢复了 19 个具有亚埃精度的晶体学构象，而 KIC 建模的 14 个具有可比的计算能力支出。此外，SWA 在生物文献中突出显示的另外 15 个环的额外数据集上给出了高精度结果，这些环因其不规则性或不寻常的长度而引人注目。成功案例包括顺式-Pro 触摸环、穿过其他侧链隧道的环以及长度达 24 个残基的环。剩余的问题案例可追溯到 Rosetta 全原子能量函数的不准确性。在另外五个盲测中，SWA 实现了亚埃精度的模型，包括在蛋白质/RNA 结合界面中的首次成功，以及在第四个“RNA 谜题”竞赛中的 YbxF/kink-turn 相互作用。这些结果确立了全原子枚举作为一种异常系统的从头蛋白质结构建模方法，该方法可以利用高性能计算和物理现实的能量函数更一致地实现原子精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed39/3804535/fe0feeb0ca87/pone.0074830.g001.jpg

相似文献

Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz.通过受 RNA 启发的 Ansatz 实现蛋白质环结构的原子精度预测。

PLoS One. 2013 Oct 21;8(10):e74830. doi: 10.1371/journal.pone.0074830. eCollection 2013.

An enumerative stepwise ansatz enables atomic-accuracy RNA loop modeling.枚举逐步逼近方法可实现原子精度的 RNA 环建模。

Proc Natl Acad Sci U S A. 2011 Dec 20;108(51):20573-8. doi: 10.1073/pnas.1106516108. Epub 2011 Dec 5.

Ab Initio Prediction of 3-D Conformations for Protein Long Loops with High Accuracy and Applications to Antibody CDRH3 Modeling.高精度从头预测蛋白质长环的三维构象及其在抗体互补决定区3建模中的应用

J Chem Inf Model. 2023 Dec 11;63(23):7568-7577. doi: 10.1021/acs.jcim.3c01051. Epub 2023 Nov 28.

Modeling structurally variable regions in homologous proteins with rosetta.使用Rosetta对同源蛋白中的结构可变区域进行建模。

Proteins. 2004 May 15;55(3):656-77. doi: 10.1002/prot.10629.

A hierarchical approach to all-atom protein loop prediction.一种用于全原子蛋白质环预测的分层方法。

Proteins. 2004 May 1;55(2):351-67. doi: 10.1002/prot.10613.

LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains.LEAP：通过整合粗粒采样和优化能量评分，并结合主链和侧链的全原子精修，实现对蛋白质环构象的高精度预测。

J Comput Chem. 2014 Feb 5;35(4):335-41. doi: 10.1002/jcc.23509. Epub 2013 Dec 10.

Toward better refinement of comparative models: predicting loops in inexact environments.迈向比较模型的更好优化：在不精确环境中预测环区

Proteins. 2008 Aug 15;72(3):959-71. doi: 10.1002/prot.21990.

The 6th Computational Structural Bioinformatics Workshop.第六届计算结构生物信息学研讨会

BMC Struct Biol. 2013;13 Suppl 1(Suppl 1):I1. doi: 10.1186/1472-6807-13-S1-I1. Epub 2013 Nov 8.

Improvements to robotics-inspired conformational sampling in rosetta.在 Rosetta 中改进基于机器人灵感的构象采样。

PLoS One. 2013 May 21;8(5):e63090. doi: 10.1371/journal.pone.0063090. Print 2013.

Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method.使用距离引导的顺序链增长蒙特卡罗方法进行快速蛋白质环采样和结构预测。

PLoS Comput Biol. 2014 Apr 24;10(4):e1003539. doi: 10.1371/journal.pcbi.1003539. eCollection 2014 Apr.

引用本文的文献

The HIV-1 nuclear export complex reveals the role of RNA in CRM1 cargo recognition.HIV-1核输出复合物揭示了RNA在CRM1货物识别中的作用。

Mol Cell. 2025 Aug 21;85(16):3108-3122.e7. doi: 10.1016/j.molcel.2025.07.015.

Pairing a high-resolution statistical potential with a nucleobase-centric sampling algorithm for improving RNA model refinement.利用高分辨率统计势能与碱基中心采样算法相结合提高 RNA 模型精修。

Nat Commun. 2021 May 13;12(1):2777. doi: 10.1038/s41467-021-23100-4.

Macromolecular modeling and design in Rosetta: recent methods and frameworks.罗塞塔中的大分子建模和设计：最新方法和框架。

Nat Methods. 2020 Jul;17(7):665-680. doi: 10.1038/s41592-020-0848-2. Epub 2020 Jun 1.

RNA-Puzzles Round IV: 3D structure predictions of four ribozymes and two aptamers.RNA 谜题第四轮：四种核酶和两种适体的 3D 结构预测。

RNA. 2020 Aug;26(8):982-995. doi: 10.1261/rna.075341.120. Epub 2020 May 5.

A key interaction with RPA orients XPA in NER complexes.与 RPA 的关键相互作用使 XPA 在 NER 复合物中定向。

Nucleic Acids Res. 2020 Feb 28;48(4):2173-2188. doi: 10.1093/nar/gkz1231.

Computational design of structured loops for new protein functions.用于新蛋白质功能的结构化环的计算设计。

Biol Chem. 2019 Feb 25;400(3):275-288. doi: 10.1515/hsz-2018-0348.

Blind prediction of noncanonical RNA structure at atomic accuracy.原子精度的非规范 RNA 结构的盲预测。

Sci Adv. 2018 May 25;4(5):eaar5316. doi: 10.1126/sciadv.aar5316. eCollection 2018 May.

RosettaAntibodyDesign (RAbD): A general framework for computational antibody design.罗塞塔抗体设计（RAbD）：一种通用的计算抗体设计框架。

PLoS Comput Biol. 2018 Apr 27;14(4):e1006112. doi: 10.1371/journal.pcbi.1006112. eCollection 2018 Apr.

RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme.RNA谜题第三轮：五个核糖开关和一个核酶的三维RNA结构预测

RNA. 2017 May;23(5):655-672. doi: 10.1261/rna.060368.116. Epub 2017 Jan 30.

Accurate Structure Prediction of CDR H3 Loops Enabled by a Novel Structure-Based C-Terminal Constraint.基于新型结构的C端约束实现CDR H3环的精确结构预测。

J Immunol. 2017 Jan 1;198(1):505-515. doi: 10.4049/jimmunol.1601137. Epub 2016 Nov 21.

本文引用的文献

Structure determination of noncanonical RNA motifs guided by ¹H NMR chemical shifts.基于 ¹H NMR 化学位移的非规范 RNA 基序结构测定。

Nat Methods. 2014 Apr;11(4):413-6. doi: 10.1038/nmeth.2876. Epub 2014 Mar 2.

Serverification of molecular modeling applications: the Rosetta Online Server that Includes Everyone (ROSIE).分子建模应用的服务器化：包含每个人的罗塞塔在线服务器（ROSIE）。

PLoS One. 2013 May 22;8(5):e63906. doi: 10.1371/journal.pone.0063906. Print 2013.

T box RNA decodes both the information content and geometry of tRNA to affect gene expression.T 盒 RNA 解码 tRNA 的信息内容和几何形状，以影响基因表达。

Proc Natl Acad Sci U S A. 2013 Apr 30;110(18):7240-5. doi: 10.1073/pnas.1222214110. Epub 2013 Apr 15.

Advances, interactions, and future developments in the CNS, Phenix, and Rosetta structural biology software systems.中枢神经系统、菲尼克斯和罗塞塔结构生物学软件系统的进展、相互作用和未来发展。

Annu Rev Biophys. 2013;42:265-87. doi: 10.1146/annurev-biophys-083012-130253. Epub 2013 Feb 28.

Flexible backbone sampling methods to model and design protein alternative conformations.用于模拟和设计蛋白质替代构象的灵活主链采样方法。

Methods Enzymol. 2013;523:61-85. doi: 10.1016/B978-0-12-394292-0.00004-7.

Correcting pervasive errors in RNA crystallography through enumerative structure prediction.通过枚举结构预测纠正 RNA 晶体学中的普遍错误。

Nat Methods. 2013 Jan;10(1):74-6. doi: 10.1038/nmeth.2262. Epub 2012 Dec 2.

Studying "invisible" excited protein states in slow exchange with a major state conformation.研究与主要构象缓慢交换的“不可见”的激发态蛋白质。

J Am Chem Soc. 2012 May 16;134(19):8148-61. doi: 10.1021/ja3001419. Epub 2012 May 3.

Refinement of protein structure homology models via long, all-atom molecular dynamics simulations.通过长程、全原子分子动力学模拟来完善蛋白质结构同源模型。

Proteins. 2012 Aug;80(8):2071-9. doi: 10.1002/prot.24098. Epub 2012 May 15.

Role of the biomolecular energy gap in protein design, structure, and evolution.生物分子能量间隙在蛋白质设计、结构和进化中的作用。

Cell. 2012 Apr 13;149(2):262-73. doi: 10.1016/j.cell.2012.03.016.

RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction.RNA 难题：一种类似于 CASP 的 RNA 三维结构预测评估方法。

RNA. 2012 Apr;18(4):610-25. doi: 10.1261/rna.031054.111. Epub 2012 Feb 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过受 RNA 启发的 Ansatz 实现蛋白质环结构的原子精度预测。

Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献