• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的新型非线性基于知识的平均力势。

Novel nonlinear knowledge-based mean force potentials based on machine learning.

机构信息

Shanghai Key Lab of Intelligent Information Processing and the School of Computer Science, Fudan University, Old Yifu Building, Room 202-5, 220 Handan Road, Shanhai 200433, China.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):476-86. doi: 10.1109/TCBB.2010.86.

DOI:10.1109/TCBB.2010.86
PMID:20820079
Abstract

The prediction of 3D structures of proteins from amino acid sequences is one of the most challenging problems in molecular biology. An essential task for solving this problem with coarse-grained models is to deduce effective interaction potentials. The development and evaluation of new energy functions is critical to accurately modeling the properties of biological macromolecules. Knowledge-based mean force potentials are derived from statistical analysis of proteins of known structures. Current knowledge-based potentials are almost in the form of weighted linear sum of interaction pairs. In this study, a class of novel nonlinear knowledge-based mean force potentials is presented. The potential parameters are obtained by nonlinear classifiers, instead of relative frequencies of interaction pairs against a reference state or linear classifiers. The support vector machine is used to derive the potential parameters on data sets that contain both native structures and decoy structures. Five knowledge-based mean force Boltzmann-based or linear potentials are introduced and their corresponding nonlinear potentials are implemented. They are the DIH potential (single-body residue-level Boltzmann-based potential), the DFIRE-SCM potential (two-body residue-level Boltzmann-based potential), the FS potential (two-body atom-level Boltzmann-based potential), the HR potential (two-body residue-level linear potential), and the T32S3 potential (two-body atom-level linear potential). Experiments are performed on well-established decoy sets, including the LKF data set, the CASP7 data set, and the Decoys “R”Us data set. The evaluation metrics include the energy Z score and the ability of each potential to discriminate native structures from a set of decoy structures. Experimental results show that all nonlinear potentials significantly outperform the corresponding Boltzmann-based or linear potentials, and the proposed discriminative framework is effective in developing knowledge-based mean force potentials. The nonlinear potentials can be widely used for ab initio protein structure prediction, model quality assessment, protein docking, and other challenging problems in computational biology.

摘要

从氨基酸序列预测蛋白质的 3D 结构是分子生物学中最具挑战性的问题之一。使用粗粒度模型解决此问题的一个基本任务是推导出有效的相互作用势。开发和评估新的能量函数对于准确建模生物大分子的性质至关重要。基于知识的平均力势是从已知结构的蛋白质的统计分析中得出的。目前基于知识的势几乎都是相互作用对相对于参考状态的加权线性和的形式。在这项研究中,提出了一类新的非线性基于知识的平均力势。势参数是通过非线性分类器而不是相互作用对的相对频率或线性分类器获得的。支持向量机用于从包含天然结构和诱饵结构的数据集中推导出势参数。介绍了五个基于知识的平均力 Boltzmann 或线性势,并实现了它们对应的非线性势。它们是 DIH 势(单体重组水平 Boltzmann 势)、DFIRE-SCM 势(双体重组水平 Boltzmann 势)、FS 势(双体重组原子水平 Boltzmann 势)、HR 势(双体重组水平线性势)和 T32S3 势(双体重组原子水平线性势)。在包括 LKF 数据集、CASP7 数据集和 Decoys “R”Us 数据集在内的成熟的诱饵集上进行了实验。评估指标包括能量 Z 得分和每种势区分天然结构和一组诱饵结构的能力。实验结果表明,所有非线性势都明显优于相应的 Boltzmann 势或线性势,并且所提出的判别框架在开发基于知识的平均力势方面是有效的。非线性势可广泛用于从头蛋白质结构预测、模型质量评估、蛋白质对接和计算生物学中的其他挑战性问题。

相似文献

1
Novel nonlinear knowledge-based mean force potentials based on machine learning.基于机器学习的新型非线性基于知识的平均力势。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):476-86. doi: 10.1109/TCBB.2010.86.
2
An accurate, residue-level, pair potential of mean force for folding and binding based on the distance-scaled, ideal-gas reference state.一种基于距离缩放的理想气体参考态的、用于折叠和结合的精确到残基水平的平均力对势。
Protein Sci. 2004 Feb;13(2):400-11. doi: 10.1110/ps.03348304.
3
Novel knowledge-based mean force potential at the profile level.轮廓水平上基于新知识的平均力势。
BMC Bioinformatics. 2006 Jun 27;7:324. doi: 10.1186/1471-2105-7-324.
4
How well can we predict native contacts in proteins based on decoy structures and their energies?基于诱饵结构及其能量,我们能多准确地预测蛋白质中的天然接触点?
Proteins. 2003 Sep 1;52(4):598-608. doi: 10.1002/prot.10444.
5
Another look at the conditions for the extraction of protein knowledge-based potentials.再探基于蛋白质知识的势场提取条件。
Proteins. 2009 Jul;76(1):72-85. doi: 10.1002/prot.22320.
6
SVR_CAF: an integrated score function for detecting native protein structures among decoys.SVR_CAF:一种用于在诱饵中检测天然蛋白质结构的综合评分函数。
Proteins. 2014 Apr;82(4):556-64. doi: 10.1002/prot.24421. Epub 2013 Oct 17.
7
Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction.距离缩放的有限理想气体参考态改善了用于结构选择和稳定性预测的基于结构的平均力势。
Protein Sci. 2002 Nov;11(11):2714-26. doi: 10.1110/ps.0217002.
8
On the importance of the distance measures used to train and test knowledge-based potentials for proteins.论用于训练和测试基于知识的蛋白质势的距离度量的重要性。
PLoS One. 2014 Nov 20;9(11):e109335. doi: 10.1371/journal.pone.0109335. eCollection 2014.
9
A global machine learning based scoring function for protein structure prediction.一种基于全局机器学习的蛋白质结构预测评分函数。
Proteins. 2014 May;82(5):752-9. doi: 10.1002/prot.24454. Epub 2013 Nov 22.
10
A distance-dependent atomic knowledge-based potential and force for discrimination of native structures from decoys.一种基于原子知识的距离相关势能和力,用于从诱饵结构中区分天然结构。
Proteins. 2009 Nov 1;77(2):454-63. doi: 10.1002/prot.22457.

引用本文的文献

1
MQAPRank: improved global protein model quality assessment by learning-to-rank.MQAPRank:通过排序学习改进全局蛋白质模型质量评估
BMC Bioinformatics. 2017 May 25;18(1):275. doi: 10.1186/s12859-017-1691-z.
2
Sorting protein decoys by machine-learning-to-rank.基于机器学习排序的蛋白质诱饵分类。
Sci Rep. 2016 Aug 17;6:31571. doi: 10.1038/srep31571.