• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较分子模拟数据的几何和动力簇算法。

Comparing geometric and kinetic cluster algorithms for molecular simulation data.

机构信息

Laboratory of Physical Chemistry, Swiss Federal Institute of Technology, ETH, CH-8093 Zürich, Switzerland.

出版信息

J Chem Phys. 2010 Feb 21;132(7):074110. doi: 10.1063/1.3301140.

DOI:10.1063/1.3301140
PMID:20170218
Abstract

The identification of metastable states of a molecule plays an important role in the interpretation of molecular simulation data because the free-energy surface, the relative populations in this landscape, and ultimately also the dynamics of the molecule under study can be described in terms of these states. We compare the results of three different geometric cluster algorithms (neighbor algorithm, K-medoids algorithm, and common-nearest-neighbor algorithm) among each other and to the results of a kinetic cluster algorithm. First, we demonstrate the characteristics of each of the geometric cluster algorithms using five two-dimensional data sets. Second, we analyze the molecular dynamics data of a beta-heptapeptide in methanol--a molecule that exhibits a distinct folded state, a structurally diverse unfolded state, and a fast folding/unfolding equilibrium--using both geometric and kinetic cluster algorithms. We find that geometric clustering strongly depends on the algorithm used and that the density based common-nearest-neighbor algorithm is the most robust of the three geometric cluster algorithms with respect to variations in the input parameters and the distance metric. When comparing the geometric cluster results to the metastable states of the beta-heptapeptide as identified by kinetic clustering, we find that in most cases the folded state is identified correctly but the overlap of geometric clusters with further metastable states is often at best approximate.

摘要

确定分子的亚稳态在解释分子模拟数据方面起着重要作用,因为可以根据这些状态来描述自由能面、该景观中的相对种群,以及最终研究分子的动力学。我们相互比较了三种不同的几何聚类算法(邻域算法、K-均值算法和常见最近邻算法)以及动力学聚类算法的结果。首先,我们使用五个二维数据集展示了每种几何聚类算法的特征。其次,我们使用几何和动力学聚类算法分析了甲醇中β-七肽的分子动力学数据——该分子表现出明显的折叠态、结构多样的展开态以及快速折叠/展开平衡。我们发现几何聚类强烈依赖于所使用的算法,并且基于密度的常见最近邻算法是三种几何聚类算法中最稳健的,相对于输入参数和距离度量的变化。当将几何聚类结果与动力学聚类确定的β-七肽的亚稳态进行比较时,我们发现折叠态通常能正确识别,但几何聚类与进一步的亚稳态之间的重叠通常最多只是近似。

相似文献

1
Comparing geometric and kinetic cluster algorithms for molecular simulation data.比较分子模拟数据的几何和动力簇算法。
J Chem Phys. 2010 Feb 21;132(7):074110. doi: 10.1063/1.3301140.
2
Prediction of protein three-dimensional structures in insertion and deletion regions: a procedure for searching data bases of representative protein fragments using geometric scoring criteria.插入和缺失区域中蛋白质三维结构的预测:一种使用几何评分标准搜索代表性蛋白质片段数据库的方法。
J Mol Biol. 1995 Oct 13;253(1):114-31. doi: 10.1006/jmbi.1995.0540.
3
Density-based clustering of small peptide conformations sampled from a molecular dynamics simulation.基于密度的分子动力学模拟中小肽构象聚类。
J Chem Inf Model. 2009 Nov;49(11):2528-36. doi: 10.1021/ci800434e.
4
Free energy landscape and folding mechanism of a beta-hairpin in explicit water: a replica exchange molecular dynamics study.在显式水环境中β-发夹的自由能景观与折叠机制:一项副本交换分子动力学研究
Proteins. 2005 Dec 1;61(4):795-808. doi: 10.1002/prot.20696.
5
Fast k-nearest neighbor classification using cluster-based trees.使用基于聚类的树进行快速k近邻分类。
IEEE Trans Pattern Anal Mach Intell. 2004 Apr;26(4):525-8. doi: 10.1109/TPAMI.2004.1265868.
6
Construction of the free energy landscape of biomolecules via dihedral angle principal component analysis.通过二面角主成分分析构建生物分子的自由能景观
J Chem Phys. 2008 Jun 28;128(24):245102. doi: 10.1063/1.2945165.
7
Evaluation of stability of k-means cluster ensembles with respect to random initialization.关于随机初始化的k均值聚类集成稳定性评估。
IEEE Trans Pattern Anal Mach Intell. 2006 Nov;28(11):1798-808. doi: 10.1109/TPAMI.2006.226.
8
Molecular dynamics simulations of protein unfolding and limited refolding: characterization of partially unfolded states of ubiquitin in 60% methanol and in water.蛋白质展开与有限复性的分子动力学模拟:泛素在60%甲醇和水中部分展开状态的表征
J Mol Biol. 1995 Mar 31;247(3):501-20. doi: 10.1006/jmbi.1994.0156.
9
A shorter peptide model from staphylococcal nuclease for the folding-unfolding equilibrium of a beta-hairpin shows that unfolded state has significant contribution from compact conformational states.一种来自葡萄球菌核酸酶的较短肽模型,用于β-发夹结构的折叠-去折叠平衡,该模型表明,未折叠状态对紧密构象状态有显著贡献。
J Struct Biol. 2008 Oct;164(1):60-74. doi: 10.1016/j.jsb.2008.06.003. Epub 2008 Jun 15.
10
Unsupervised clustering algorithm for N-dimensional data.用于N维数据的无监督聚类算法。
J Neurosci Methods. 2005 May 15;144(1):19-24. doi: 10.1016/j.jneumeth.2004.10.015. Epub 2004 Dec 18.

引用本文的文献

1
An INS-1 832/13 𝛽-Cell Proteome Highlights the Rapid Regulation of Fatty Acid Biosynthesis in Glucose-Stimulated Insulin Secretion.INS-1 832/13β细胞蛋白质组揭示了葡萄糖刺激的胰岛素分泌中脂肪酸生物合成的快速调节。
Proteomics. 2025 Aug;25(15):13-26. doi: 10.1002/pmic.70005. Epub 2025 Jul 20.
2
Divide and Cluster: The DIVINE Framework for Deterministic Top-Down Analysis of Molecular Dynamics Trajectories.划分与聚类:用于分子动力学轨迹确定性自上而下分析的DIVINE框架。
bioRxiv. 2025 Jun 26:2025.06.20.660828. doi: 10.1101/2025.06.20.660828.
3
Scaling -Means for Multi-Million Frames: A Stratified NANI Approach for Large-Scale MD Simulations.
数百万帧的缩放方法:一种用于大规模分子动力学模拟的分层非自适应邻居搜索方法
bioRxiv. 2025 Jun 18:2025.06.15.659780. doi: 10.1101/2025.06.15.659780.
4
Extended Quality (eQual): Radial Threshold Clustering Based on -ary Similarity.扩展质量(eQual):基于 - 元相似度的径向阈值聚类
J Chem Inf Model. 2025 May 26;65(10):5062-5070. doi: 10.1021/acs.jcim.4c02341. Epub 2025 May 1.
5
CADENCE: Clustering Algorithm - Density-based Exploration and Novelty Clustering with Efficiency.CADENCE:聚类算法——基于密度的探索与高效新颖性聚类
bioRxiv. 2025 Feb 28:2025.02.24.639863. doi: 10.1101/2025.02.24.639863.
6
Extended Quality (eQual): Radial threshold clustering based on n-ary similarity.扩展质量(eQual):基于n元相似度的径向阈值聚类
bioRxiv. 2024 Dec 5:2024.12.05.627001. doi: 10.1101/2024.12.05.627001.
7
Exploring the Impact of Protein Chain Selection in Binding Energy Calculations with DFT.探索蛋白质链选择对基于密度泛函理论的结合能计算的影响。
Chemphyschem. 2024 Dec 16;25(24):e202400119. doi: 10.1002/cphc.202400119. Epub 2024 Nov 1.
8
Quantifying Unbiased Conformational Ensembles from Biased Simulations Using ShapeGMM.使用 ShapeGMM 从有偏模拟中定量无偏构象集合。
J Chem Theory Comput. 2024 May 14;20(9):3492-3502. doi: 10.1021/acs.jctc.4c00223. Epub 2024 Apr 25.
9
Clustering molecular dynamics conformations of the CC'-loop of the PD-1 immuno-checkpoint receptor.对程序性死亡蛋白1(PD-1)免疫检查点受体CC'环的分子动力学构象进行聚类分析。
Comput Struct Biotechnol J. 2023 Jul 13;21:3920-3932. doi: 10.1016/j.csbj.2023.07.004. eCollection 2023.
10
Size-and-Shape Space Gaussian Mixture Models for Structural Clustering of Molecular Dynamics Trajectories.用于分子动力学轨迹结构聚类的大小和形状空间高斯混合模型。
J Chem Theory Comput. 2022 May 10;18(5):3218-3230. doi: 10.1021/acs.jctc.1c01290. Epub 2022 Apr 28.