面向蛋白质侧链堆积的准确性和速度：构象文库的系统研究。

Toward the Accuracy and Speed of Protein Side-Chain Packing: A Systematic Study on Rotamer Libraries.

出版信息

J Chem Inf Model. 2020 Jan 27;60(1):410-420. doi: 10.1021/acs.jcim.9b00812. Epub 2019 Dec 31.

DOI:10.1021/acs.jcim.9b00812

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7938712/

Abstract

Protein rotamers refer to the conformational isomers taken by the side-chains of amino acids to accommodate specific structural folding environments. Since accurate modeling of atomic interactions is difficult, rotamer information collected from experimentally solved protein structures is often used to guide side-chain packing in protein folding and sequence design studies. Many rotamer libraries have been built in the literature but there is little quantitative guidance on which libraries should be chosen for different structural modeling studies. Here, we performed a comparative study of six widely used rotamer libraries and systematically examined their suitability for protein folding and sequence design in four aspects: (1) side-chain match accuracy, (2) side-chain conformation prediction, (3) protein sequence design, and (4) computational time cost. We demonstrated that, compared to the backbone-dependent rotamer libraries (BBDRLs), the backbone-independent rotamer libraries (BBIRLs) generated conformations that more closely matched the native conformations due to the larger number of rotamers in the local rotamer search spaces. However, more practically, using an optimized physical energy function incorporated into a simulated annealing Monte Carlo searching scheme, we showed that utilization of the BBDRLs could result in higher accuracies in side-chain prediction and higher sequence recapitulation rates in protein design experiments. Detailed data analyses showed that the major advantage of BBDRLs lies in the energy term derived from the rotamer probabilities that are associated with the individual backbone torsion angle subspaces. This term is important for distinguishing between amino acid identities as well as the rotamer conformations of an amino acid. Meanwhile, the backbone torsion angle subspace-specific rotamer search drastically speeds up the searching time, despite the significantly larger number of total rotamers in the BBDRLs. These results should provide important guidance for the development and selection of rotamer libraries for practical protein design and structure prediction studies.

摘要

蛋白质构象异构体是指氨基酸侧链采取的构象异构，以适应特定的结构折叠环境。由于原子相互作用的精确建模较为困难，因此经常使用从实验解决的蛋白质结构中收集的构象异构体信息来指导蛋白质折叠和序列设计研究中的侧链包装。文献中已经构建了许多构象异构体库，但对于不同的结构建模研究应该选择哪些库，几乎没有定量的指导。在这里，我们对六种广泛使用的构象异构体库进行了比较研究，并从四个方面系统地检查了它们在蛋白质折叠和序列设计中的适用性：（1）侧链匹配精度，（2）侧链构象预测，（3）蛋白质序列设计和（4）计算时间成本。我们证明，与依赖于骨架的构象异构体库（BBDRLs）相比，由于局部构象异构体搜索空间中的构象异构体数量更多，独立于骨架的构象异构体库（BBIRLs）生成的构象更接近天然构象。然而，更实际的是，通过使用优化的物理能量函数和模拟退火蒙特卡罗搜索方案，我们表明，在蛋白质设计实验中，使用 BBDRL 可以提高侧链预测的准确性和更高的序列再现率。详细的数据分析表明，BBDRL 的主要优势在于与单个骨架扭转角子空间相关的构象异构体概率衍生的能量项。该术语对于区分氨基酸身份以及氨基酸的构象异构体非常重要。同时，骨架扭转角子空间特定的构象异构体搜索尽管 BBDRL 中的总构象异构体数量明显增加，但大大加快了搜索时间。这些结果应为实际的蛋白质设计和结构预测研究中构象异构体库的开发和选择提供重要指导。

相似文献

Toward the Accuracy and Speed of Protein Side-Chain Packing: A Systematic Study on Rotamer Libraries.

J Chem Inf Model. 2020 Jan 27;60(1):410-420. doi: 10.1021/acs.jcim.9b00812. Epub 2019 Dec 31.

A protein-dependent side-chain rotamer library.

BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S10. doi: 10.1186/1471-2105-12-S14-S10.

Rotamer libraries and probabilities of transition between rotamers for the side chains in protein-protein binding.

Proteins. 2012 Aug;80(8):2089-98. doi: 10.1002/prot.24103. Epub 2012 Jun 12.

A backbone-dependent rotamer library with high (ϕ, ψ) coverage using metadynamics simulations.

Protein Sci. 2022 Dec;31(12):e4491. doi: 10.1002/pro.4491.

Incorporating knowledge-based biases into an energy-based side-chain modeling method: application to comparative modeling of protein structure.

Biopolymers. 2001 Aug;59(2):72-86. doi: 10.1002/1097-0282(200108)59:2<72::AID-BIP1007>3.0.CO;2-S.

Rotamers: to be or not to be? An analysis of amino acid side-chain conformations in globular proteins.

J Mol Biol. 1993 Mar 20;230(2):592-612. doi: 10.1006/jmbi.1993.1172.

Design of a rotamer library for coarse-grained models in protein-folding simulations.

J Chem Inf Model. 2014 Jan 27;54(1):302-13. doi: 10.1021/ci4005833. Epub 2013 Dec 31.

Advantages of fine-grained side chain conformer libraries.

Protein Eng. 2003 Dec;16(12):963-9. doi: 10.1093/protein/gzg143.

A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions.

Structure. 2011 Jun 8;19(6):844-58. doi: 10.1016/j.str.2011.03.019.

Exploiting Sequence-Dependent Rotamer Information in Global Optimization of Proteins.

J Phys Chem B. 2022 Oct 27;126(42):8381-8390. doi: 10.1021/acs.jpcb.2c04647. Epub 2022 Oct 18.

引用本文的文献

To pack or not to pack: revisiting protein side-chain packing in the post-AlphaFold era.

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf297.

To pack or not to pack: revisiting protein side-chain packing in the post-AlphaFold era.

bioRxiv. 2025 Feb 27:2025.02.22.639681. doi: 10.1101/2025.02.22.639681.

Amino-Acid Characteristics in Protein Native State Structures.

Biomolecules. 2024 Jul 7;14(7):805. doi: 10.3390/biom14070805.

Invariant point message passing for protein side chain packing.

Proteins. 2024 Oct;92(10):1220-1233. doi: 10.1002/prot.26705. Epub 2024 May 24.

Invariant point message passing for protein side chain packing.

bioRxiv. 2023 Dec 21:2023.08.03.551328. doi: 10.1101/2023.08.03.551328.

A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images.

Med Biol Eng Comput. 2024 Mar;62(3):913-924. doi: 10.1007/s11517-023-02984-y. Epub 2023 Dec 13.

Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs.

J Chem Inf Model. 2023 Jul 24;63(14):4405-4422. doi: 10.1021/acs.jcim.3c00134. Epub 2023 Jul 6.

Decoding CRISPR-Cas PAM recognition with UniDesign.

Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad133.

GeoPacker: A novel deep learning framework for protein side-chain modeling.

Protein Sci. 2022 Dec;31(12):e4484. doi: 10.1002/pro.4484.

Exploiting Sequence-Dependent Rotamer Information in Global Optimization of Proteins.

J Phys Chem B. 2022 Oct 27;126(42):8381-8390. doi: 10.1021/acs.jpcb.2c04647. Epub 2022 Oct 18.

本文引用的文献

EvoEF2: accurate and fast energy function for computational protein design.

Bioinformatics. 2020 Feb 15;36(4):1135-1142. doi: 10.1093/bioinformatics/btz740.

EvoDesign: Designing Protein-Protein Binding Interactions Using Evolutionary Interface Profiles in Conjunction with an Optimized Physical Energy Function.

J Mol Biol. 2019 Jun 14;431(13):2467-2476. doi: 10.1016/j.jmb.2019.02.028. Epub 2019 Mar 7.

The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design.

J Chem Theory Comput. 2017 Jun 13;13(6):3031-3048. doi: 10.1021/acs.jctc.7b00125. Epub 2017 May 12.

Quantifying side-chain conformational variations in protein structure.

Sci Rep. 2016 Nov 15;6:37024. doi: 10.1038/srep37024.

Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules.

J Chem Theory Comput. 2016 Dec 13;12(12):6201-6212. doi: 10.1021/acs.jctc.6b00819. Epub 2016 Nov 7.

Protein side-chain packing problem: is there still room for improvement?

Brief Bioinform. 2017 Nov 1;18(6):1033-1043. doi: 10.1093/bib/bbw079.

Use of an Improved Matching Algorithm to Select Scaffolds for Enzyme Design Based on a Complex Active Site Model.

PLoS One. 2016 May 31;11(5):e0156559. doi: 10.1371/journal.pone.0156559. eCollection 2016.

Computational design of enzyme-ligand binding using a combined energy function and deterministic sequence optimization algorithm.

J Mol Model. 2015 Aug;21(8):191. doi: 10.1007/s00894-015-2742-x. Epub 2015 Jul 11.

Combined covalent-electrostatic model of hydrogen bonding improves structure prediction with Rosetta.

J Chem Theory Comput. 2015 Feb 10;11(2):609-22. doi: 10.1021/ct500864r.

The I-TASSER Suite: protein structure and function prediction.

Nat Methods. 2015 Jan;12(1):7-8. doi: 10.1038/nmeth.3213.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

面向蛋白质侧链堆积的准确性和速度：构象文库的系统研究。

Toward the Accuracy and Speed of Protein Side-Chain Packing: A Systematic Study on Rotamer Libraries.

出版信息

J Chem Inf Model. 2020 Jan 27;60(1):410-420. doi: 10.1021/acs.jcim.9b00812. Epub 2019 Dec 31.

DOI:10.1021/acs.jcim.9b00812

PMID:31851497

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7938712/

Abstract

摘要

面向蛋白质侧链堆积的准确性和速度：构象文库的系统研究。

Toward the Accuracy and Speed of Protein Side-Chain Packing: A Systematic Study on Rotamer Libraries.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

面向蛋白质侧链堆积的准确性和速度：构象文库的系统研究。

Toward the Accuracy and Speed of Protein Side-Chain Packing: A Systematic Study on Rotamer Libraries.

出版信息