Suppr超能文献

DONKEY:一种灵活且准确的聚类算法。

DONKEY: A Flexible and Accurate Algorithm for Clustering.

作者信息

Kára Jakub, Acheson Kyle, Kirrander Adam

机构信息

Physical and Theoretical Chemistry Laboratory, Department of Chemistry, University of Oxford, South Parks Road, Oxford OX1 3QZ, United Kingdom.

Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom.

出版信息

J Chem Theory Comput. 2025 Jun 24;21(12):5789-5802. doi: 10.1021/acs.jctc.4c01750. Epub 2025 May 2.

Abstract

We propose an accurate clustering algorithm suitable for the varied and multidimensional data sets that correspond to temporal snapshots from nonadiabatic trajectory-based simulations of photoexcited dynamics. The algorithm approximates the underlying probability density function using variable kernel density estimation, with local maxima corresponding to cluster centers. Each data point is then assigned to one of the maxima by employing a maximization procedure. Finally, clusters artificially separated by minor fluctuations in the probability density are merged. The algorithm does not require parameter tuning, which ensures flexibility and reduces the risk of bias. It is tested on several synthetic data sets, where it consistently outperforms conventional clustering algorithms. As a final example, the algorithm is applied to the excited dynamics of the norbornadiene ⇌ quadricyclane (CH) molecular photoswitch, demonstrating how distinct reaction pathways can be identified.

摘要

我们提出了一种精确的聚类算法,适用于与光激发动力学的非绝热轨迹模拟中的时间快照相对应的多样且多维的数据集。该算法使用可变核密度估计来近似潜在的概率密度函数,局部最大值对应于聚类中心。然后通过采用最大化过程将每个数据点分配到其中一个最大值。最后,合并由概率密度中的微小波动人为分离的聚类。该算法不需要参数调整,这确保了灵活性并降低了偏差风险。它在几个合成数据集上进行了测试,在这些数据集上它始终优于传统的聚类算法。作为最后一个例子,该算法应用于降冰片二烯⇌四环烷(CH)分子光开关的激发动力学,展示了如何识别不同的反应途径。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验