Selvaraj Joel, Wang Liguo, Cheng Jianlin
Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, United States.
NextGen Precision Health, University of Missouri, Columbia, MO 65211, United States.
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf092.
Cryogenic electron microscopy (cryo-EM) is a core experimental technique used to determine the structure of macromolecules such as proteins. However, the effectiveness of cryo-EM is often hindered by the noise and missing density values in cryo-EM density maps caused by experimental conditions such as low contrast and conformational heterogeneity. Although various global and local map-sharpening techniques are widely employed to improve cryo-EM density maps, it is still challenging to efficiently improve their quality for building better protein structures from them.
In this study, we introduce CryoTEN-a 3D UNETR++ style transformer to improve cryo-EM maps effectively. CryoTEN is trained using a diverse set of 1295 cryo-EM maps as inputs and their corresponding simulated maps generated from known protein structures as targets. An independent test set containing 150 maps is used to evaluate CryoTEN, and the results demonstrate that it can robustly enhance the quality of cryo-EM density maps. In addition, automatic de novo protein structure modeling shows that protein structures built from the density maps processed by CryoTEN have substantially better quality than those built from the original maps. Compared to the existing state-of-the-art deep learning methods for enhancing cryo-EM density maps, CryoTEN ranks second in improving the quality of density maps, while running >10 times faster and requiring much less GPU memory than them.
The source code and data are freely available at https://github.com/jianlin-cheng/cryoten.
低温电子显微镜(cryo-EM)是用于确定蛋白质等大分子结构的核心实验技术。然而,cryo-EM的有效性常常受到诸如低对比度和构象异质性等实验条件导致的cryo-EM密度图中的噪声和缺失密度值的阻碍。尽管各种全局和局部图锐化技术被广泛用于改善cryo-EM密度图,但要从这些图中高效地提高其质量以构建更好的蛋白质结构仍然具有挑战性。
在本研究中,我们引入了CryoTEN——一种3D UNETR++风格的变换器,以有效改善cryo-EM图。CryoTEN使用一组多样的1295张cryo-EM图作为输入,并将从已知蛋白质结构生成的相应模拟图作为目标进行训练。一个包含150张图的独立测试集用于评估CryoTEN,结果表明它可以稳健地提高cryo-EM密度图的质量。此外,自动从头蛋白质结构建模表明,从CryoTEN处理后的密度图构建的蛋白质结构比从原始图构建的蛋白质结构质量要好得多。与现有的用于增强cryo-EM密度图的深度学习方法相比,CryoTEN在提高密度图质量方面排名第二,同时运行速度比它们快10倍以上,并且所需的GPU内存要少得多。