• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于β-折叠蛋白质均聚物模型的自适应箱框架搜索方法。

An adaptive bin framework search method for a beta-sheet protein homopolymer model.

作者信息

Shmygelska Alena, Hoos Holger H

机构信息

Department of Structural Biology, Stanford University, Stanford, CA 94305, USA.

出版信息

BMC Bioinformatics. 2007 Apr 24;8:136. doi: 10.1186/1471-2105-8-136.

DOI:10.1186/1471-2105-8-136
PMID:17451609
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1894818/
Abstract

BACKGROUND

The problem of protein structure prediction consists of predicting the functional or native structure of a protein given its linear sequence of amino acids. This problem has played a prominent role in the fields of biomolecular physics and algorithm design for over 50 years. Additionally, its importance increases continually as a result of an exponential growth over time in the number of known protein sequences in contrast to a linear increase in the number of determined structures. Our work focuses on the problem of searching an exponentially large space of possible conformations as efficiently as possible, with the goal of finding a global optimum with respect to a given energy function. This problem plays an important role in the analysis of systems with complex search landscapes, and particularly in the context of ab initio protein structure prediction.

RESULTS

In this work, we introduce a novel approach for solving this conformation search problem based on the use of a bin framework for adaptively storing and retrieving promising locally optimal solutions. Our approach provides a rich and general framework within which a broad range of adaptive or reactive search strategies can be realized. Here, we introduce adaptive mechanisms for choosing which conformations should be stored, based on the set of conformations already stored in memory, and for biasing choices when retrieving conformations from memory in order to overcome search stagnation.

CONCLUSION

We show that our bin framework combined with a widely used optimization method, Monte Carlo search, achieves significantly better performance than state-of-the-art generalized ensemble methods for a well-known protein-like homopolymer model on the face-centered cubic lattice.

摘要

背景

蛋白质结构预测问题包括根据蛋白质的氨基酸线性序列预测其功能结构或天然结构。在过去50多年里,这个问题在生物分子物理学和算法设计领域一直扮演着重要角色。此外,由于已知蛋白质序列数量随时间呈指数增长,而已确定结构的数量呈线性增长,其重要性也在不断增加。我们的工作重点是尽可能高效地搜索指数级庞大的可能构象空间,目标是找到相对于给定能量函数的全局最优解。这个问题在具有复杂搜索格局的系统分析中,特别是在从头开始的蛋白质结构预测背景下,起着重要作用。

结果

在这项工作中,我们引入了一种新颖的方法来解决这个构象搜索问题,该方法基于使用一个箱式框架来自适应地存储和检索有前景的局部最优解。我们的方法提供了一个丰富且通用的框架,在其中可以实现广泛的自适应或反应式搜索策略。在这里,我们引入了自适应机制,用于根据已存储在内存中的构象集选择应存储哪些构象,以及在从内存中检索构象时偏向选择以克服搜索停滞。

结论

我们表明,对于面心立方晶格上的一个著名的类蛋白质均聚物模型,我们的箱式框架与广泛使用的优化方法蒙特卡罗搜索相结合,比现有最先进的广义系综方法具有显著更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/d19bc1f81873/1471-2105-8-136-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/2dee7958566c/1471-2105-8-136-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/76579e9a4930/1471-2105-8-136-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/1117c1b7ef8f/1471-2105-8-136-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/8c0f1b95137f/1471-2105-8-136-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/0e0f912a479b/1471-2105-8-136-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/20007907c87f/1471-2105-8-136-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/d19bc1f81873/1471-2105-8-136-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/2dee7958566c/1471-2105-8-136-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/76579e9a4930/1471-2105-8-136-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/1117c1b7ef8f/1471-2105-8-136-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/8c0f1b95137f/1471-2105-8-136-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/0e0f912a479b/1471-2105-8-136-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/20007907c87f/1471-2105-8-136-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56bb/1894818/d19bc1f81873/1471-2105-8-136-7.jpg

相似文献

1
An adaptive bin framework search method for a beta-sheet protein homopolymer model.一种用于β-折叠蛋白质均聚物模型的自适应箱框架搜索方法。
BMC Bioinformatics. 2007 Apr 24;8:136. doi: 10.1186/1471-2105-8-136.
2
A replica exchange Monte Carlo algorithm for protein folding in the HP model.用于HP模型中蛋白质折叠的复制交换蒙特卡罗算法。
BMC Bioinformatics. 2007 Sep 17;8:342. doi: 10.1186/1471-2105-8-342.
3
A Novel Method Using Abstract Convex Underestimation in Ab-Initio Protein Structure Prediction for Guiding Search in Conformational Feature Space.一种在从头算蛋白质结构预测中使用抽象凸低估来指导构象特征空间搜索的新方法。
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):887-900. doi: 10.1109/TCBB.2015.2497226. Epub 2015 Nov 2.
4
Can molecular dynamics simulations help in discriminating correct from erroneous protein 3D models?分子动力学模拟能否有助于区分正确与错误的蛋白质三维模型?
BMC Bioinformatics. 2008 Jan 7;9:6. doi: 10.1186/1471-2105-9-6.
5
On distance and similarity in fold space.关于折叠空间中的距离和相似性。
Bioinformatics. 2008 Mar 15;24(6):872-3. doi: 10.1093/bioinformatics/btn040. Epub 2008 Jan 28.
6
Three-dimensional shape-structure comparison method for protein classification.用于蛋白质分类的三维形状结构比较方法
IEEE/ACM Trans Comput Biol Bioinform. 2006 Jul-Sep;3(3):193-207. doi: 10.1109/TCBB.2006.43.
7
Generalized pattern search algorithm for Peptide structure prediction.用于肽结构预测的广义模式搜索算法。
Biophys J. 2008 Nov 15;95(10):4988-99. doi: 10.1529/biophysj.107.124016. Epub 2008 May 16.
8
A multi-objective evolutionary approach to the protein structure prediction problem.一种用于蛋白质结构预测问题的多目标进化方法。
J R Soc Interface. 2006 Feb 22;3(6):139-51. doi: 10.1098/rsif.2005.0083.
9
Fitting protein chains to cubic lattice is NP-complete.将蛋白质链拟合到立方晶格是NP完全问题。
J Bioinform Comput Biol. 2008 Feb;6(1):93-106. doi: 10.1142/s0219720008003308.
10
PSI: indexing protein structures for fast similarity search.PSI:为快速相似性搜索对蛋白质结构进行索引。
Bioinformatics. 2003;19 Suppl 1:i81-3. doi: 10.1093/bioinformatics/btg1009.

引用本文的文献

1
Understanding the evolutionary trend of intrinsically structural disorders in cancer relevant proteins as probed by Shannon entropy scoring and structure network analysis.理解由香农熵评分和结构网络分析探测到的癌症相关蛋白质中固有结构障碍的进化趋势。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):549. doi: 10.1186/s12859-018-2552-0.

本文引用的文献

1
Improving protein structure prediction with model-based search.利用基于模型的搜索改进蛋白质结构预测。
Bioinformatics. 2005 Jun;21 Suppl 1:i66-74. doi: 10.1093/bioinformatics/bti1029.
2
Generalized-ensemble algorithms: enhanced sampling techniques for Monte Carlo and molecular dynamics simulations.广义系综算法:蒙特卡罗和分子动力学模拟的增强采样技术
J Mol Graph Model. 2004 May;22(5):425-39. doi: 10.1016/j.jmgm.2003.12.009.
3
Predictions without templates: new folds, secondary structure, and contacts in CASP5.无模板预测:CASP5中的新折叠、二级结构及相互作用
Proteins. 2003;53 Suppl 6:436-56. doi: 10.1002/prot.10546.
4
A minimal physically realistic protein-like lattice model: designing an energy landscape that ensures all-or-none folding to a unique native state.一个最小的物理现实蛋白质样晶格模型:设计一种能量景观,确保全或无折叠到唯一的天然状态。
Biophys J. 2003 Mar;84(3):1518-26. doi: 10.1016/S0006-3495(03)74964-9.
5
Understanding protein folding with energy landscape theory. Part I: Basic concepts.用能量景观理论理解蛋白质折叠。第一部分:基本概念。
Q Rev Biophys. 2002 May;35(2):111-67. doi: 10.1017/s0033583502003761.
6
Local energy landscape flattening: parallel hyperbolic Monte Carlo sampling of protein folding.局部能量景观平坦化:蛋白质折叠的并行双曲蒙特卡罗采样
Proteins. 2002 Aug 1;48(2):192-201. doi: 10.1002/prot.10141.
7
Ab initio protein structure prediction via a combination of threading, lattice folding, clustering, and structure refinement.通过穿线法、晶格折叠、聚类和结构优化相结合的从头算蛋白质结构预测。
Proteins. 2001;Suppl 5:149-56. doi: 10.1002/prot.1172.
8
Predicting novel protein folds by using FRAGFOLD.使用FRAGFOLD预测新型蛋白质折叠。
Proteins. 2001;Suppl 5:127-32. doi: 10.1002/prot.1171.
9
Generalized-ensemble algorithms for molecular simulations of biopolymers.用于生物聚合物分子模拟的广义系综算法。
Biopolymers. 2001;60(2):96-123. doi: 10.1002/1097-0282(2001)60:2<96::AID-BIP1007>3.0.CO;2-F.
10
Ab initio protein structure prediction: progress and prospects.从头算蛋白质结构预测:进展与展望。
Annu Rev Biophys Biomol Struct. 2001;30:173-89. doi: 10.1146/annurev.biophys.30.1.173.