He Jiahua, Huang Sheng-You
School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China.
Bioinformatics. 2021 Oct 25;37(20):3480-3490. doi: 10.1093/bioinformatics/btab357.
Advances in microscopy instruments and image processing algorithms have led to an increasing number of Cryo-electron microscopy (cryo-EM) maps. However, building accurate models for the EM maps at 3-5 Å resolution remains a challenging and time-consuming process. With the rapid growth of deposited EM maps, there is an increasing gap between the maps and reconstructed/modeled three-dimensional (3D) structures. Therefore, automatic reconstruction of atomic-accuracy full-atom structures from EM maps is pressingly needed.
We present a semi-automatic de novo structure determination method using a deep learning-based framework, named as DeepMM, which builds atomic-accuracy all-atom models from cryo-EM maps at near-atomic resolution. In our method, the main-chain and Cα positions as well as their amino acid and secondary structure types are predicted in the EM map using Densely Connected Convolutional Networks. DeepMM was extensively validated on 40 simulated maps at 5 Å resolution and 30 experimental maps at 2.6-4.8 Å resolution as well as an Electron Microscopy Data Bank-wide dataset of 2931 experimental maps at 2.6-4.9 Å resolution, and compared with state-of-the-art algorithms including RosettaES, MAINMAST and Phenix. Overall, our DeepMM algorithm obtained a significant improvement over existing methods in terms of both accuracy and coverage in building full-length protein structures on all test sets, demonstrating the efficacy and general applicability of DeepMM.
http://huanglab.phys.hust.edu.cn/DeepMM.
Supplementary data are available at Bioinformatics online.
显微镜仪器和图像处理算法的进步使得冷冻电子显微镜(cryo-EM)图谱数量不断增加。然而,为3-5埃分辨率的EM图谱构建精确模型仍然是一个具有挑战性且耗时的过程。随着沉积EM图谱的快速增长,图谱与重建/建模的三维(3D)结构之间的差距越来越大。因此,迫切需要从EM图谱中自动重建原子精度的全原子结构。
我们提出了一种基于深度学习框架的半自动从头结构测定方法,名为DeepMM,它可以从近原子分辨率的cryo-EM图谱构建原子精度的全原子模型。在我们的方法中,使用密集连接卷积网络在EM图谱中预测主链和Cα位置以及它们的氨基酸和二级结构类型。DeepMM在40个5埃分辨率的模拟图谱、30个2.6-4.8埃分辨率的实验图谱以及一个包含2931个2.6-4.9埃分辨率实验图谱的电子显微镜数据库全集中进行了广泛验证,并与包括RosettaES、MAINMAST和Phenix在内的现有算法进行了比较。总体而言,我们的DeepMM算法在所有测试集上构建全长蛋白质结构的准确性和覆盖率方面均比现有方法有显著提高,证明了DeepMM的有效性和普遍适用性。
http://huanglab.phys.hust.edu.cn/DeepMM。
补充数据可在《生物信息学》在线获取。