Fernández Michael, Caballero Julio, Fernández Leyden, Abreu José Ignacio, Garriga Miguel
Molecular Modeling Group, Center for Biotechnological Studies, Faculty of Agronomy, University of Matanzas, 44740 Matanzas, Cuba.
J Mol Graph Model. 2007 Nov;26(4):748-59. doi: 10.1016/j.jmgm.2007.04.011. Epub 2007 May 3.
Development of novel computational approaches for modeling protein properties is a main goal in applied Proteomics. In this work, we reported the extension of the radial distribution function (RDF) scores formalism to proteins for encoding 3D structural information with modeling purposes. Protein-RDF (P-RDF) scores measure spherical distributions on protein 3D structure of 48 amino acids/residues properties selected from the AAindex data base. P-RDF scores were tested for building predictive models of the change of thermal unfolding Gibbs free energy change (DeltaDeltaG) of chymotrypsin inhibitor 2 upon mutations. In this sense, an ensemble of Bayesian-Regularized Genetic Neural Networks (BRGNNs) yielded an optimum nonlinear model for the conformational stability. The ensemble predictor described about 84% and 70% variance of the data in training and test sets, respectively.
开发用于蛋白质特性建模的新型计算方法是应用蛋白质组学的主要目标。在这项工作中,我们报告了将径向分布函数(RDF)评分形式扩展到蛋白质,以便在建模时编码三维结构信息。蛋白质-RDF(P-RDF)评分用于测量从AAindex数据库中选取的48种氨基酸/残基特性在蛋白质三维结构上的球形分布。测试了P-RDF评分用于构建胰凝乳蛋白酶抑制剂2突变时热解折叠吉布斯自由能变化(DeltaDeltaG)变化的预测模型。从这个意义上说,一组贝叶斯正则化遗传神经网络(BRGNNs)产生了一个用于构象稳定性的最优非线性模型。该集成预测器分别描述了训练集和测试集中约84%和70%的数据方差。