• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过统计流形学习实现大规模并行无监督单粒子冷冻电镜数据聚类

Massively parallel unsupervised single-particle cryo-EM data clustering via statistical manifold learning.

作者信息

Wu Jiayi, Ma Yong-Bei, Congdon Charles, Brett Bevin, Chen Shuobing, Xu Yaofang, Ouyang Qi, Mao Youdong

机构信息

State Key Laboratory for Artificial Microstructure and Mesoscopic Physics, Institute of Condensed Matter Physics, School of Physics, Center for Quantitative Biology, Peking University, Beijing, China.

Intel Parallel Computing Center for Structural Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2017 Aug 7;12(8):e0182130. doi: 10.1371/journal.pone.0182130. eCollection 2017.

DOI:10.1371/journal.pone.0182130
PMID:28786986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5546606/
Abstract

Structural heterogeneity in single-particle cryo-electron microscopy (cryo-EM) data represents a major challenge for high-resolution structure determination. Unsupervised classification may serve as the first step in the assessment of structural heterogeneity. However, traditional algorithms for unsupervised classification, such as K-means clustering and maximum likelihood optimization, may classify images into wrong classes with decreasing signal-to-noise-ratio (SNR) in the image data, yet demand increased computational costs. Overcoming these limitations requires further development of clustering algorithms for high-performance cryo-EM data processing. Here we introduce an unsupervised single-particle clustering algorithm derived from a statistical manifold learning framework called generative topographic mapping (GTM). We show that unsupervised GTM clustering improves classification accuracy by about 40% in the absence of input references for data with lower SNRs. Applications to several experimental datasets suggest that our algorithm can detect subtle structural differences among classes via a hierarchical clustering strategy. After code optimization over a high-performance computing (HPC) environment, our software implementation was able to generate thousands of reference-free class averages within hours in a massively parallel fashion, which allows a significant improvement on ab initio 3D reconstruction and assists in the computational purification of homogeneous datasets for high-resolution visualization.

摘要

单颗粒冷冻电子显微镜(cryo-EM)数据中的结构异质性是高分辨率结构测定的一个主要挑战。无监督分类可作为评估结构异质性的第一步。然而,传统的无监督分类算法,如K均值聚类和最大似然优化,可能会在图像数据信噪比(SNR)降低时将图像分类到错误的类别中,同时还需要增加计算成本。克服这些限制需要进一步开发用于高性能冷冻电镜数据处理的聚类算法。在此,我们引入一种源自称为生成地形映射(GTM)的统计流形学习框架的无监督单颗粒聚类算法。我们表明,对于较低信噪比的数据,在没有输入参考的情况下,无监督GTM聚类可将分类准确率提高约40%。对几个实验数据集的应用表明,我们的算法可以通过层次聚类策略检测不同类别之间的细微结构差异。在高性能计算(HPC)环境中进行代码优化后,我们的软件实现能够在数小时内以大规模并行方式生成数千个无参考类平均图像,这在从头开始的三维重建方面有显著改进,并有助于对均匀数据集进行计算纯化以实现高分辨率可视化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/7982d20ad053/pone.0182130.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/e2b4e6887d30/pone.0182130.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/684a1cdcc0d7/pone.0182130.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/550f79bc6e80/pone.0182130.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/dfed81ad0fd9/pone.0182130.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/bda5e0fb2162/pone.0182130.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/b9cb7b1007d1/pone.0182130.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/7982d20ad053/pone.0182130.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/e2b4e6887d30/pone.0182130.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/684a1cdcc0d7/pone.0182130.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/550f79bc6e80/pone.0182130.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/dfed81ad0fd9/pone.0182130.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/bda5e0fb2162/pone.0182130.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/b9cb7b1007d1/pone.0182130.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e24/5546606/7982d20ad053/pone.0182130.g007.jpg

相似文献

1
Massively parallel unsupervised single-particle cryo-EM data clustering via statistical manifold learning.通过统计流形学习实现大规模并行无监督单粒子冷冻电镜数据聚类
PLoS One. 2017 Aug 7;12(8):e0182130. doi: 10.1371/journal.pone.0182130. eCollection 2017.
2
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.通过自适应约束K均值算法进行无监督冷冻电镜数据聚类
PLoS One. 2016 Dec 13;11(12):e0167765. doi: 10.1371/journal.pone.0167765. eCollection 2016.
3
AutoCryoPicker: an unsupervised learning approach for fully automated single particle picking in Cryo-EM images.AutoCryoPicker:一种用于 Cryo-EM 图像全自动单颗粒挑选的无监督学习方法。
BMC Bioinformatics. 2019 Jun 13;20(1):326. doi: 10.1186/s12859-019-2926-y.
4
An Unsupervised Classification Algorithm for Heterogeneous Cryo-EM Projection Images Based on Autoencoders.基于自动编码器的异质冷冻电镜投影图像无监督分类算法。
Int J Mol Sci. 2023 May 6;24(9):8380. doi: 10.3390/ijms24098380.
5
A Stochastic Hill Climbing Approach for Simultaneous 2D Alignment and Clustering of Cryogenic Electron Microscopy Images.一种用于低温电子显微镜图像二维同时对齐和聚类的随机爬山方法。
Structure. 2016 Jun 7;24(6):988-96. doi: 10.1016/j.str.2016.04.006. Epub 2016 May 12.
6
cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination.cryoSPARC:用于快速无监督低温电子显微镜结构测定的算法。
Nat Methods. 2017 Mar;14(3):290-296. doi: 10.1038/nmeth.4169. Epub 2017 Feb 6.
7
Heterogeneous cryo-EM projection image classification using a two-stage spectral clustering based on novel distance measures.基于新型距离度量的两阶段谱聚类在异质冷冻电镜投影图像分类中的应用。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac032.
8
A particle-filter framework for robust cryo-EM 3D reconstruction.用于稳健冷冻电镜 3D 重建的粒子滤波框架。
Nat Methods. 2018 Dec;15(12):1083-1089. doi: 10.1038/s41592-018-0223-8. Epub 2018 Nov 30.
9
A Fast Image Alignment Approach for 2D Classification of Cryo-EM Images Using Spectral Clustering.基于谱聚类的冷冻电镜图像 2D 分类的快速图像配准方法。
Curr Issues Mol Biol. 2021 Oct 18;43(3):1652-1668. doi: 10.3390/cimb43030117.
10
Simcryocluster: a semantic similarity clustering method of cryo-EM images by adopting contrastive learning.Simcryocluster:一种采用对比学习的 cryo-EM 图像语义相似性聚类方法。
BMC Bioinformatics. 2024 Feb 20;25(1):77. doi: 10.1186/s12859-023-05565-w.

引用本文的文献

1
The membrane-proximal external region of human immunodeficiency virus (HIV-1) envelope glycoprotein trimers in A18-lipid nanodiscs.A18脂质纳米盘中人类免疫缺陷病毒(HIV-1)包膜糖蛋白三聚体的膜近端外部区域。
Commun Biol. 2025 Mar 15;8(1):442. doi: 10.1038/s42003-025-07852-z.
2
Simcryocluster: a semantic similarity clustering method of cryo-EM images by adopting contrastive learning.Simcryocluster:一种采用对比学习的 cryo-EM 图像语义相似性聚类方法。
BMC Bioinformatics. 2024 Feb 20;25(1):77. doi: 10.1186/s12859-023-05565-w.
3
Determining subunit-subunit interaction from statistics of cryo-EM images: observation of nearest-neighbor coupling in a circadian clock protein complex.

本文引用的文献

1
Conformational Landscape of the p28-Bound Human Proteasome Regulatory Particle.与p28结合的人蛋白酶体调节颗粒的构象景观
Mol Cell. 2017 Jul 20;67(2):322-333.e6. doi: 10.1016/j.molcel.2017.06.007. Epub 2017 Jul 6.
2
Robust Neighborhood Preserving Projection by Nuclear/L2,1-Norm Regularization for Image Feature Extraction.基于核范数/L2,1-范数正则化的鲁棒邻域保持投影的图像特征提取。
IEEE Trans Image Process. 2017 Apr;26(4):1607-1622. doi: 10.1109/TIP.2017.2654163. Epub 2017 Jan 16.
3
Structural basis for dynamic regulation of the human 26S proteasome.
从冷冻电镜图像的统计数据中确定亚基-亚基相互作用:在生物钟蛋白复合物中观察最近邻耦合。
Nat Commun. 2023 Sep 22;14(1):5907. doi: 10.1038/s41467-023-41575-1.
4
Probing the Conformational Space of the Cannabinoid Receptor 2 and a Systematic Investigation of DNP-Enhanced MAS NMR Spectroscopy of Proteins in Detergent Micelles.探索大麻素受体2的构象空间以及对去污剂胶束中蛋白质的动态核极化增强魔角旋转核磁共振光谱的系统研究。
ACS Omega. 2023 Aug 28;8(36):32963-32976. doi: 10.1021/acsomega.3c04681. eCollection 2023 Sep 12.
5
Asymmetric conformations of cleaved HIV-1 envelope glycoprotein trimers in styrene-maleic acid lipid nanoparticles.裂解的 HIV-1 包膜糖蛋白三聚体在苯乙烯-马来酸脂纳米颗粒中的非对称构象。
Commun Biol. 2023 May 18;6(1):535. doi: 10.1038/s42003-023-04916-w.
6
An Unsupervised Classification Algorithm for Heterogeneous Cryo-EM Projection Images Based on Autoencoders.基于自动编码器的异质冷冻电镜投影图像无监督分类算法。
Int J Mol Sci. 2023 May 6;24(9):8380. doi: 10.3390/ijms24098380.
7
Visualizing Conformational Space of Functional Biomolecular Complexes by Deep Manifold Learning.通过深度流形学习可视化功能生物分子复合物的构象空间。
Int J Mol Sci. 2022 Aug 9;23(16):8872. doi: 10.3390/ijms23168872.
8
Emerging Themes in CryoEM─Single Particle Analysis Image Processing.新兴主题在 CryoEM-单颗粒分析图像处理。
Chem Rev. 2022 Sep 14;122(17):13915-13951. doi: 10.1021/acs.chemrev.1c00850. Epub 2022 Jul 4.
9
Probing Structural Perturbation of Biomolecules by Extracting Cryo-EM Data Heterogeneity.探究生物分子结构扰动的方法:从低温电子显微镜数据异质性中提取信息。
Biomolecules. 2022 Apr 24;12(5):628. doi: 10.3390/biom12050628.
10
USP14-regulated allostery of the human proteasome by time-resolved cryo-EM.USP14 通过时间分辨冷冻电镜调节人蛋白酶体的变构。
Nature. 2022 May;605(7910):567-574. doi: 10.1038/s41586-022-04671-8. Epub 2022 Apr 27.
人类26S蛋白酶体动态调节的结构基础
Proc Natl Acad Sci U S A. 2016 Nov 15;113(46):12991-12996. doi: 10.1073/pnas.1614614113. Epub 2016 Oct 21.
4
The development of cryo-EM into a mainstream structural biology technique.冷冻电镜发展成为一种主流的结构生物学技术。
Nat Methods. 2016 Jan;13(1):24-7. doi: 10.1038/nmeth.3694.
5
Cryo-EM structure of the activated NAIP2-NLRC4 inflammasome reveals nucleated polymerization.活化的NAIP2-NLRC4炎性小体的冷冻电镜结构揭示了成核聚合作用。
Science. 2015 Oct 23;350(6259):404-9. doi: 10.1126/science.aac5789. Epub 2015 Oct 8.
6
Directly reconstructing principal components of heterogeneous particles from cryo-EM images.从冷冻电镜图像直接重建异质颗粒的主成分
J Struct Biol. 2015 Aug;191(2):245-62. doi: 10.1016/j.jsb.2015.05.007. Epub 2015 Jun 4.
7
Covariance Matrix Estimation for the Cryo-EM Heterogeneity Problem.用于冷冻电镜异质性问题的协方差矩阵估计
SIAM J Imaging Sci. 2015 Jan 22;8(1):126-185. doi: 10.1137/130935434.
8
Trajectories of the ribosome as a Brownian nanomachine.作为布朗运动纳米机器的核糖体轨迹。
Proc Natl Acad Sci U S A. 2014 Dec 9;111(49):17492-7. doi: 10.1073/pnas.1419276111. Epub 2014 Nov 24.
9
Conformations of macromolecules and their complexes from heterogeneous datasets.来自异构数据集的大分子及其复合物的构象
Philos Trans R Soc Lond B Biol Sci. 2014 Jul 17;369(1647):20130567. doi: 10.1098/rstb.2013.0567.
10
Rotationally invariant image representation for viewing direction classification in cryo-EM.用于冷冻电镜中观察方向分类的旋转不变图像表示
J Struct Biol. 2014 Apr;186(1):153-66. doi: 10.1016/j.jsb.2014.03.003. Epub 2014 Mar 12.