Department of Computer Science, School of Computing, Tokyo Institute of Technology, Ookayama, Meguro-ku, Tokyo, Japan.
PLoS One. 2019 Sep 5;14(9):e0221347. doi: 10.1371/journal.pone.0221347. eCollection 2019.
In protein tertiary structure prediction, model quality assessment programs (MQAPs) are often used to select the final structural models from a pool of candidate models generated by multiple templates and prediction methods. The 3-dimensional convolutional neural network (3DCNN) is an expansion of the 2DCNN and has been applied in several fields, including object recognition. The 3DCNN is also used for MQA tasks, but the performance is low due to several technical limitations related to protein tertiary structures, such as orientation alignment. We proposed a novel single-model MQA method based on local structure quality evaluation using a deep neural network containing 3DCNN layers. The proposed method first assesses the quality of local structures for each residue and then evaluates the quality of whole structures by integrating estimated local qualities. We analyzed the model using the CASP11, CASP12, and 3D-Robot datasets and compared the performance of the model with that of the previous 3DCNN method based on whole protein structures. The proposed method showed a significant improvement compared to the previous 3DCNN method for multiple evaluation measures. We also compared the proposed method to other state-of-the-art methods. Our method showed better performance than the previous 3DCNN-based method and comparable accuracy as the current best single-model methods; particularly, in CASP11 stage2, our method showed a Pearson coefficient of 0.486, which was better than those of the best single-model methods (0.366-0.405). A standalone version of the proposed method and data files are available at https://github.com/ishidalab-titech/3DCNN_MQA.
在蛋白质三级结构预测中,模型质量评估程序 (MQAP) 通常用于从多个模板和预测方法生成的候选模型池中选择最终的结构模型。三维卷积神经网络 (3DCNN) 是二维卷积神经网络的扩展,已应用于多个领域,包括目标识别。3DCNN 也用于 MQA 任务,但由于与蛋白质三级结构相关的几个技术限制,如方向对齐,性能较低。我们提出了一种基于局部结构质量评估的新型单模型 MQA 方法,该方法使用包含 3DCNN 层的深度神经网络。该方法首先评估每个残基的局部结构质量,然后通过整合估计的局部质量来评估整个结构的质量。我们使用 CASP11、CASP12 和 3D-Robot 数据集对模型进行了分析,并将模型的性能与基于整个蛋白质结构的先前 3DCNN 方法进行了比较。与基于整个蛋白质结构的先前 3DCNN 方法相比,该方法在多个评估指标上均有显著提高。我们还将该方法与其他最先进的方法进行了比较。与基于先前 3DCNN 的方法相比,我们的方法表现出更好的性能,与当前最好的单模型方法相当;特别是在 CASP11 stage2 中,我们的方法的 Pearson 系数为 0.486,优于最好的单模型方法(0.366-0.405)。该方法的独立版本和数据文件可在 https://github.com/ishidalab-titech/3DCNN_MQA 上获得。