Suppr超能文献

低温电子显微镜/低温电子断层扫描中的精度要求和数据压缩。

Precision requirements and data compression in CryoEM/CryoET.

机构信息

Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, United States.

Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, United States.

出版信息

J Struct Biol. 2022 Sep;214(3):107875. doi: 10.1016/j.jsb.2022.107875. Epub 2022 Jun 17.

Abstract

With larger, higher speed detectors and improved automation, individual CryoEM instruments are capable of producing a prodigious amount of data each day, which must then be stored, processed and archived. While it has become routine to use lossless compression on raw counting-mode movies, the averages which result after correcting these movies no longer compress well. These averages could be considered sufficient for long term archival, yet they are conventionally stored with 32 bits of precision, despite high noise levels. Derived images are similarly stored with excess precision, providing an opportunity to decrease project sizes and improve processing speed. We present a simple argument based on propagation of uncertainty for safe bit truncation of flat-fielded images combined with lossless compression. The same method can be used for most derived images throughout the processing pipeline. We test the proposed strategy on two standard, data-limited CryoEM data sets, demonstrating that these limits are safe for real-world use. We find that 5 bits of precision is sufficient for virtually any raw CryoEM data and that 8-12 bits is sufficient for intermediate averages or final 3-D structures. Additionally, we detail and recommend specific rules for discretization of data as well as a practical compressed data representation that is tuned to the specific needs of CryoEM.

摘要

随着更大、更快的探测器和改进的自动化技术的出现,单个 CryoEM 仪器每天都能够产生大量的数据,这些数据必须进行存储、处理和归档。虽然在原始计数模式电影上使用无损压缩已经成为常规操作,但在纠正这些电影后得到的平均值仍然不能很好地压缩。这些平均值可以被认为足以进行长期存档,但它们通常以 32 位精度存储,尽管噪声水平很高。衍生图像也以多余的精度存储,为减小项目规模和提高处理速度提供了机会。我们提出了一个简单的基于不确定性传播的论点,用于对平场化图像进行安全的比特截断,并结合无损压缩。这种方法可以用于处理管道中的大多数衍生图像。我们在两个标准的、数据受限的 CryoEM 数据集上测试了所提出的策略,证明这些限制在实际使用中是安全的。我们发现,对于几乎任何原始的 CryoEM 数据,5 位精度就足够了,而对于中间平均值或最终的 3D 结构,8-12 位精度就足够了。此外,我们详细介绍并推荐了数据离散化的具体规则,以及一种针对 CryoEM 特定需求进行优化的实用压缩数据表示。

相似文献

1
Precision requirements and data compression in CryoEM/CryoET.
J Struct Biol. 2022 Sep;214(3):107875. doi: 10.1016/j.jsb.2022.107875. Epub 2022 Jun 17.
2
Reducing cryoEM file storage using lossy image formats.
J Struct Biol. 2019 Jul 1;207(1):49-55. doi: 10.1016/j.jsb.2019.04.013. Epub 2019 May 21.
3
Topaz-Denoise: general deep denoising models for cryoEM and cryoET.
Nat Commun. 2020 Oct 15;11(1):5208. doi: 10.1038/s41467-020-18952-1.
4
Learning to automate cryo-electron microscopy data collection with Ptolemy.
IUCrJ. 2023 Jan 1;10(Pt 1):90-102. doi: 10.1107/S2052252522010612.
5
Big data in cryoEM: automated collection, processing and accessibility of EM data.
Curr Opin Microbiol. 2018 Jun;43:1-8. doi: 10.1016/j.mib.2017.10.005. Epub 2017 Oct 31.
6
Lossless Compression on MRI Images Using SWT.
J Digit Imaging. 2014 Oct;27(5):594-600. doi: 10.1007/s10278-014-9697-9.
8
Understanding and controlling the effect of lossy raw data compression on CT images.
Med Phys. 2009 Aug;36(8):3643-53. doi: 10.1118/1.3158738.
9
Using cryoEM and cryoET to visualize membrane penetration of a non-enveloped virus.
STAR Protoc. 2022 Dec 16;3(4):101825. doi: 10.1016/j.xpro.2022.101825. Epub 2022 Nov 11.

引用本文的文献

1
The advent of preventive high-resolution structural histopathology by artificial-intelligence-powered cryogenic electron tomography.
Front Mol Biosci. 2024 May 29;11:1390858. doi: 10.3389/fmolb.2024.1390858. eCollection 2024.

本文引用的文献

1
Electron-event representation data enable efficient cryoEM file storage with full preservation of spatial and temporal resolution.
IUCrJ. 2020 Aug 7;7(Pt 5):860-869. doi: 10.1107/S205225252000929X. eCollection 2020 Sep 1.
2
Reducing cryoEM file storage using lossy image formats.
J Struct Biol. 2019 Jul 1;207(1):49-55. doi: 10.1016/j.jsb.2019.04.013. Epub 2019 May 21.
3
The first single particle analysis Map Challenge: A summary of the assessments.
J Struct Biol. 2018 Nov;204(2):291-300. doi: 10.1016/j.jsb.2018.08.010. Epub 2018 Aug 13.
4
MRCZ - A file format for cryo-TEM data with fast compression.
J Struct Biol. 2018 Mar;201(3):252-257. doi: 10.1016/j.jsb.2017.11.012. Epub 2017 Nov 23.
5
Automated tilt series alignment and tomographic reconstruction in IMOD.
J Struct Biol. 2017 Feb;197(2):102-113. doi: 10.1016/j.jsb.2016.07.011. Epub 2016 Jul 19.
6
Fixed-Rate Compressed Floating-Point Arrays.
IEEE Trans Vis Comput Graph. 2014 Dec;20(12):2674-83. doi: 10.1109/TVCG.2014.2346458.
7
EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy.
Nat Methods. 2015 Oct;12(10):943-6. doi: 10.1038/nmeth.3541. Epub 2015 Aug 17.
8
2.2 Å resolution cryo-EM structure of β-galactosidase in complex with a cell-permeant inhibitor.
Science. 2015 Jun 5;348(6239):1147-51. doi: 10.1126/science.aab1576. Epub 2015 May 7.
9
MRC2014: Extensions to the MRC format header for electron cryo-microscopy and tomography.
J Struct Biol. 2015 Nov;192(2):146-50. doi: 10.1016/j.jsb.2015.04.002. Epub 2015 Apr 13.
10
Structure of β-galactosidase at 3.2-Å resolution obtained by cryo-electron microscopy.
Proc Natl Acad Sci U S A. 2014 Aug 12;111(32):11709-14. doi: 10.1073/pnas.1402809111. Epub 2014 Jul 28.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验