• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分层扩展链接方法(HELM)对混合聚类策略的深入探讨。

Hierarchical Extended Linkage Method (HELM)'s Deep Dive into Hybrid Clustering Strategies.

作者信息

Chen Lexin, Brylle Woody Santos Jherome, Gaza Jokent, Perez Alberto, Miranda-Quintana Ramón Alain

机构信息

Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville 32611, Florida, United States.

出版信息

J Chem Inf Model. 2025 Jun 23;65(12):6209-6220. doi: 10.1021/acs.jcim.5c00539. Epub 2025 Jun 2.

DOI:10.1021/acs.jcim.5c00539
PMID:40452401
Abstract

Clustering remains a key tool in the analysis of molecular dynamics (MD) simulations, from the preparation of kinetic models to the study of mechanistic pathways and structural determination. It is no surprise then that multiple algorithms are currently used in the MD community, with -means and hierarchical approaches being arguably the two most popular approaches. The former is very attractive from a purely computational point of view, demanding minimal memory and time resources, but at the price of being able to partition the data in very restrictive ways. Hierarchical strategies, on the other hand, can generate arbitrary partitions, but with steep memory and time requirements due to their need to build a pairwise distance matrix for all the considered conformations/frames. Here we propose a new hybrid paradigm, the hierarchical extended linkage method (HELM), that retains the efficiency of -means while incorporating the flexibility of hierarchical methods. The key ingredient is the use of -ary difference functions as a way to stabilize the -means results and efficiently build the hierarchy of subsets. We showcase the applicability of this strategy over protein-DNA and protein folding studies, including the complete analysis of simulations with over 1.5 million frames. HELM is freely available in our MDANCE clustering package.

摘要

聚类仍然是分子动力学(MD)模拟分析中的关键工具,从动力学模型的构建到机理途径的研究以及结构确定。因此,毫不奇怪MD领域目前使用了多种算法,其中k均值和层次聚类方法可以说是最受欢迎的两种方法。从纯粹的计算角度来看,前者非常有吸引力,只需要极少的内存和时间资源,但代价是只能以非常受限的方式对数据进行划分。另一方面,层次聚类策略可以生成任意划分,但由于需要为所有考虑的构象/帧构建成对距离矩阵,因此对内存和时间的要求很高。在此,我们提出了一种新的混合范式,即层次扩展链接方法(HELM),它保留了k均值的效率,同时融入了层次聚类方法的灵活性。关键要素是使用q元差分函数来稳定k均值的结果并有效地构建子集层次结构。我们展示了该策略在蛋白质-DNA和蛋白质折叠研究中的适用性,包括对超过150万个帧的模拟进行完整分析。HELM可在我们的MDANCE聚类软件包中免费获取。

相似文献

1
Hierarchical Extended Linkage Method (HELM)'s Deep Dive into Hybrid Clustering Strategies.分层扩展链接方法(HELM)对混合聚类策略的深入探讨。
J Chem Inf Model. 2025 Jun 23;65(12):6209-6220. doi: 10.1021/acs.jcim.5c00539. Epub 2025 Jun 2.
2
Hierarchical Extended Linkage Method (HELM)'s Deep Dive into Hybrid Clustering Strategies.分层扩展链接方法(HELM)对混合聚类策略的深入研究。
bioRxiv. 2025 Mar 10:2025.03.05.641742. doi: 10.1101/2025.03.05.641742.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
4
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.
5
Family-centred interventions for Indigenous early childhood well-being by primary healthcare services.以初级医疗保健服务为中心的家庭干预措施,促进土著儿童早期的身心健康。
Cochrane Database Syst Rev. 2022 Dec 13;12(12):CD012463. doi: 10.1002/14651858.CD012463.pub2.
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
Taxane monotherapy regimens for the treatment of recurrent epithelial ovarian cancer.紫杉烷类单药治疗方案用于复发性上皮性卵巢癌。
Cochrane Database Syst Rev. 2022 Jul 12;7(7):CD008766. doi: 10.1002/14651858.CD008766.pub3.
8
Antidepressants for chronic non-cancer pain in children and adolescents.用于治疗儿童和青少年慢性非癌性疼痛的抗抑郁药。
Cochrane Database Syst Rev. 2017 Aug 5;8(8):CD012535. doi: 10.1002/14651858.CD012535.pub2.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

引用本文的文献

1
Scaling -Means for Multi-Million Frames: A Stratified NANI Approach for Large-Scale MD Simulations.数百万帧的缩放方法:一种用于大规模分子动力学模拟的分层非自适应邻居搜索方法
bioRxiv. 2025 Jun 18:2025.06.15.659780. doi: 10.1101/2025.06.15.659780.
2
clusttraj: A Solvent-Informed Clustering Tool for Molecular Modeling.Clusttraj:一种用于分子建模的溶剂信息聚类工具。
J Chem Theory Comput. 2025 Jul 22;21(14):6759-6768. doi: 10.1021/acs.jctc.5c00634. Epub 2025 Jul 3.