Deng Haiyou, Jia Ya, Zhang Yang
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA, Department of Physics and Institute of Biophysics, Central China Normal University, Wuhan 430079, China and.
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA.
Bioinformatics. 2016 Feb 1;32(3):378-87. doi: 10.1093/bioinformatics/btv601. Epub 2015 Oct 14.
Computationally generated non-native protein structure conformations (or decoys) are often used for designing protein folding simulation methods and force fields. However, almost all the decoy sets currently used in literature suffer from uneven root mean square deviation (RMSD) distribution with bias to non-protein like hydrogen-bonding and compactness patterns. Meanwhile, most protein decoy sets are pre-calculated and there is a lack of methods for automated generation of high-quality decoys for any target proteins.
We developed a new algorithm, 3DRobot, to create protein structure decoys by free fragment assembly with enhanced hydrogen-bonding and compactness interactions. The method was benchmarked with three widely used decoy sets from ab initio folding and comparative modeling simulations. The decoys generated by 3DRobot are shown to have significantly enhanced diversity and evenness with a continuous distribution in the RMSD space. The new energy terms introduced in 3DRobot improve the hydrogen-bonding network and compactness of decoys, which eliminates the possibility of native structure recognition by trivial potentials. Algorithms that can automatically create such diverse and well-packed non-native conformations from any protein structure should have a broad impact on the development of advanced protein force field and folding simulation methods. AVAILIABLITY AND IMPLEMENTATION: http://zhanglab.ccmb.med.umich.edu/3DRobot/
jiay@phy.ccnu.edu.cn; zhng@umich.edu
Supplementary data are available at Bioinformatics online.
通过计算生成的非天然蛋白质结构构象(或诱饵)常用于设计蛋白质折叠模拟方法和力场。然而,目前文献中使用的几乎所有诱饵集均存在均方根偏差(RMSD)分布不均匀的问题,且偏向于非蛋白质样的氢键和紧密模式。同时,大多数蛋白质诱饵集是预先计算的,并且缺乏为任何目标蛋白质自动生成高质量诱饵的方法。
我们开发了一种新算法3DRobot,通过具有增强氢键和紧密相互作用的自由片段组装来创建蛋白质结构诱饵。该方法用来自从头折叠和比较建模模拟的三个广泛使用的诱饵集进行了基准测试。结果表明,3DRobot生成的诱饵在RMSD空间中具有显著增强的多样性和均匀性以及连续分布。3DRobot中引入的新能量项改善了诱饵的氢键网络和紧密性,消除了通过简单势识别天然结构的可能性。能够从任何蛋白质结构自动创建这种多样且紧密堆积的非天然构象的算法,应该会对先进蛋白质力场和折叠模拟方法的发展产生广泛影响。
http://zhanglab.ccmb.med.umich.edu/3DRobot/
jiay@phy.ccnu.edu.cn;zhng@umich.edu
补充数据可在《生物信息学》在线获取。