INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France.
INSERM, U 1134, DSIMB, Univ Paris, Univ de La Réunion, Univ des Antilles, F-75739, Paris, France; Institut National de La Transfusion Sanguine (INTS), F-75739, Paris, France; Laboratoire D'Excellence GR-Ex, F-75739, Paris, France; Molecular Graphics Laboratory, Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA.
Biochimie. 2019 Oct;165:150-155. doi: 10.1016/j.biochi.2019.07.025. Epub 2019 Aug 1.
Flexibility is an intrinsic essential feature of protein structures, directly linked to their functions. To this day, most of the prediction methods use the crystallographic data (namely B-factors) as the only indicator of protein's inner flexibility and predicts them as rigid or flexible. PredyFlexy stands differently from other approaches as it relies on the definition of protein flexibility (i) not only taken from crystallographic data, but also (ii) from Root Mean Square Fluctuation (RMSFs) observed in Molecular Dynamics simulations. It also uses a specific representation of protein structures, named Long Structural Prototypes (LSPs). From Position-Specific Scoring Matrix, the 120 LSPs are predicted with a good accuracy and directly used to predict (i) the protein flexibility in three categories (flexible, intermediate and rigid), (ii) the normalized B-factors, (iii) the normalized RMSFs, and (iv) a confidence index. Prediction accuracy among these three classes is equivalent to the best two class prediction methods, while the normalized B-factors and normalized RMSFs have a good correlation with experimental and in silico values. Thus, PredyFlexy is a unique approach, which is of major utility for the scientific community. It support parallelization features and can be run on a local cluster using multiple cores.
蛋白质结构的柔韧性是其内在的基本特征,直接与其功能相关。迄今为止,大多数预测方法都使用晶体学数据(即 B 因子)作为蛋白质内部柔韧性的唯一指标,并将其预测为刚性或柔性。PredyFlexy 与其他方法不同,因为它依赖于对蛋白质柔韧性的定义 (i) 不仅来自晶体学数据,而且还来自分子动力学模拟中观察到的均方根波动 (RMSFs)。它还使用了一种名为长结构原型 (LSP) 的特定蛋白质结构表示形式。从位置特异性评分矩阵中,以较高的准确性预测了 120 个 LSP,并直接用于预测 (i) 蛋白质的三种柔韧性类别(柔性、中间和刚性)、(ii) 归一化 B 因子、(iii) 归一化 RMSFs 和 (iv) 置信指数。这三个类别的预测准确性与最佳的两类预测方法相当,而归一化 B 因子和归一化 RMSFs 与实验值和计算值具有良好的相关性。因此,PredyFlexy 是一种独特的方法,对科学界具有重要的实用价值。它支持并行化功能,可以在本地集群上使用多个核心进行运行。