Suppr超能文献

用于从头蛋白质结构预测的SimFold能量函数:与Rosetta的一致性。

SimFold energy function for de novo protein structure prediction: consensus with Rosetta.

作者信息

Fujitsuka Yoshimi, Chikenji George, Takada Shoji

机构信息

Graduate School of Natural Science and Technology, Kobe University, Kobe, Japan.

出版信息

Proteins. 2006 Feb 1;62(2):381-98. doi: 10.1002/prot.20748.

Abstract

Predicting protein tertiary structures by in silico folding is still very difficult for proteins that have new folds. Here, we developed a coarse-grained energy function, SimFold, for de novo structure prediction, performed a benchmark test of prediction with fragment assembly simulations for 38 test proteins, and proposed consensus prediction with Rosetta. The SimFold energy consists of many terms that take into account solvent-induced effects on the basis of physicochemical consideration. In the benchmark test, SimFold succeeded in predicting native structures within 6.5 A for 12 of 38 proteins; this success rate was the same as that by the publicly available version of Rosetta (ab initio version 1.2) run with default parameters. We investigated which energy terms in SimFold contribute to structure prediction performance, finding that the hydrophobic interaction is the most crucial for the prediction, whereas other sequence-specific terms have weak but positive roles. In the benchmark, well-predicted proteins by SimFold and by Rosetta were not the same for 5 of 12 proteins, which led us to introduce consensus prediction. With combined decoys, we succeeded in prediction for 16 proteins, four more than SimFold or Rosetta separately. For each of 38 proteins, structural ensembles generated by SimFold and by Rosetta were qualitatively compared by mapping sampled structural space onto two dimensions. For proteins of which one of the two methods succeeded and the other failed in prediction, the former had a less scattered ensemble located around the native. For proteins of which both methods succeeded in prediction, often two ensembles were mixed up.

摘要

对于具有新折叠结构的蛋白质,通过计算机模拟折叠来预测其三级结构仍然非常困难。在此,我们开发了一种粗粒度能量函数SimFold用于从头结构预测,对38个测试蛋白质进行了片段组装模拟预测的基准测试,并提出了与Rosetta的共识预测。SimFold能量由许多基于物理化学考虑而考虑溶剂诱导效应的项组成。在基准测试中,SimFold成功地为38个蛋白质中的12个预测了6.5埃以内的天然结构;这一成功率与使用默认参数运行的公开可用版本的Rosetta(从头版本1.2)相同。我们研究了SimFold中的哪些能量项对结构预测性能有贡献,发现疏水相互作用对预测最为关键,而其他序列特异性项虽作用较弱但为正向作用。在基准测试中,SimFold和Rosetta预测良好的蛋白质对于12个蛋白质中的5个并不相同,这促使我们引入共识预测。通过组合诱饵,我们成功地对16个蛋白质进行了预测,比SimFold或Rosetta单独预测的多了4个。对于38个蛋白质中的每一个,通过将采样的结构空间映射到二维上,对SimFold和Rosetta生成的结构集合进行了定性比较。对于两种方法中一种成功而另一种预测失败的蛋白质,前者在天然结构周围的集合分布较不分散。对于两种方法都成功预测的蛋白质,通常两个集合会混在一起。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验