• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Ward方法的Székely-Rizzo推广对化学结构文件进行聚类。

Clustering files of chemical structures using the Székely-Rizzo generalization of Ward's method.

作者信息

Varin Thibault, Bureau Ronan, Mueller Christoph, Willett Peter

机构信息

Centre d'Etudes et de Recherche sur le Médicament de Normandie, UPRES EA4258, INC3M FR CNRS 3038, Université de Caen, Boulevard Becquerel, 14032 Caen Cedex, France.

出版信息

J Mol Graph Model. 2009 Sep;28(2):187-95. doi: 10.1016/j.jmgm.2009.06.006. Epub 2009 Jul 4.

DOI:10.1016/j.jmgm.2009.06.006
PMID:19640752
Abstract

Ward's method is extensively used for clustering chemical structures represented by 2D fingerprints. This paper compares Ward clusterings of 14 datasets (containing between 278 and 4332 molecules) with those obtained using the Székely-Rizzo clustering method, a generalization of Ward's method. The clusters resulting from these two methods were evaluated by the extent to which the various classifications were able to group active molecules together, using a novel criterion of clustering effectiveness. Analysis of a total of 1400 classifications (Ward and Székely-Rizzo clustering methods, 14 different datasets, 5 different fingerprints and 10 different distance coefficients) demonstrated the general superiority of the Székely-Rizzo method. The distance coefficient first described by Soergel performed extremely well in these experiments, and this was also the case when it was used in simulated virtual screening experiments.

摘要

沃德方法被广泛用于对由二维指纹表示的化学结构进行聚类。本文将14个数据集(包含278至4332个分子)的沃德聚类与使用塞凯利 - 里佐聚类方法(沃德方法的一种推广)得到的聚类进行了比较。使用一种新的聚类有效性标准,通过各种分类将活性分子聚集在一起的程度来评估这两种方法产生的聚类。对总共1400种分类(沃德和塞凯利 - 里佐聚类方法、14个不同的数据集、5种不同的指纹和10个不同的距离系数)的分析表明塞凯利 - 里佐方法总体上更具优势。索尔格尔首次描述的距离系数在这些实验中表现极其出色,在模拟虚拟筛选实验中使用时也是如此。

相似文献

1
Clustering files of chemical structures using the Székely-Rizzo generalization of Ward's method.使用Ward方法的Székely-Rizzo推广对化学结构文件进行聚类。
J Mol Graph Model. 2009 Sep;28(2):187-95. doi: 10.1016/j.jmgm.2009.06.006. Epub 2009 Jul 4.
2
Effect of data standardization on chemical clustering and similarity searching.
J Chem Inf Model. 2009 Feb;49(2):155-61. doi: 10.1021/ci800224h.
3
Clustering files of chemical structures using the fuzzy k-means clustering method.使用模糊k均值聚类方法对化学结构文件进行聚类。
J Chem Inf Comput Sci. 2004 May-Jun;44(3):894-902. doi: 10.1021/ci0342674.
4
Graph-Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures.基于图的共识聚类用于组合化学结构的多个聚类
Mol Inform. 2013 Feb;32(2):165-78. doi: 10.1002/minf.201200110. Epub 2013 Feb 5.
5
Clustering and rule-based classifications of chemical structures evaluated in the biological activity space.
J Chem Inf Model. 2007 Mar-Apr;47(2):325-36. doi: 10.1021/ci6004004. Epub 2007 Feb 8.
6
Combining multiple classifications of chemical structures using consensus clustering.使用一致性聚类组合化学结构的多种分类。
Bioorg Med Chem. 2012 Sep 15;20(18):5366-71. doi: 10.1016/j.bmc.2012.03.010. Epub 2012 Mar 10.
7
Voting-based consensus clustering for combining multiple clusterings of chemical structures.基于投票的共识聚类方法用于整合多个化学结构聚类。
J Cheminform. 2012 Dec 17;4(1):37. doi: 10.1186/1758-2946-4-37.
8
Information Theory and Voting Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures.基于信息论和投票的共识聚类用于组合化学结构的多个聚类
Mol Inform. 2013 Jul;32(7):591-8. doi: 10.1002/minf.201300004. Epub 2013 May 15.
9
Consensus methods for combining multiple clusterings of chemical structures.组合化学结构多个聚类的共识方法。
J Chem Inf Model. 2013 May 24;53(5):1026-34. doi: 10.1021/ci300442u. Epub 2013 Apr 26.
10
Generalising Ward's Method for Use with Manhattan Distances.推广沃德法以用于曼哈顿距离。
PLoS One. 2017 Jan 13;12(1):e0168288. doi: 10.1371/journal.pone.0168288. eCollection 2017.

引用本文的文献

1
Single-cell somatic copy number alteration profiling of vitreous humor seeds in retinoblastoma.单细胞体细胞核型分析在视网膜母细胞瘤玻璃体液种子中的应用。
Ophthalmic Genet. 2024 Dec;45(6):646-649. doi: 10.1080/13816810.2024.2374886. Epub 2024 Jul 17.
2
Pharmacological affinity fingerprints derived from bioactivity data for the identification of designer drugs.源自生物活性数据的药理学亲和力指纹图谱用于新型毒品的鉴定。
J Cheminform. 2022 Jun 7;14(1):35. doi: 10.1186/s13321-022-00607-6.
3
An integrated method for optimized identification of effective natural inhibitors against SARS-CoV-2 3CLpro.
一种优化鉴定针对 SARS-CoV-2 3CLpro 的有效天然抑制剂的综合方法。
Sci Rep. 2021 Nov 23;11(1):22796. doi: 10.1038/s41598-021-02266-3.
4
Estimating Linear and Nonlinear Gene Coexpression Networks by Semiparametric Neighborhood Selection.基于半参数邻域选择估计线性和非线性基因共表达网络。
Genetics. 2020 Jul;215(3):597-607. doi: 10.1534/genetics.120.303186. Epub 2020 May 15.
5
Genomic cfDNA Analysis of Aqueous Humor in Retinoblastoma Predicts Eye Salvage: The Surrogate Tumor Biopsy for Retinoblastoma.眼内液游离 DNA 基因组分析预测视网膜母细胞瘤保眼治疗的可行性:视网膜母细胞瘤的替代肿瘤活检。
Mol Cancer Res. 2018 Nov;16(11):1701-1712. doi: 10.1158/1541-7786.MCR-18-0369. Epub 2018 Jul 30.
6
The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix.用于通过均方根偏差(RMSD)矩阵对代表性构象异构体集合进行重采样的自动聚类算法比较。
J Cheminform. 2017 Mar 23;9(1):21. doi: 10.1186/s13321-017-0208-0.
7
Weighted voting-based consensus clustering for chemical structure databases.基于加权投票的化学结构数据库共识聚类
J Comput Aided Mol Des. 2014 Jun;28(6):675-84. doi: 10.1007/s10822-014-9750-2. Epub 2014 May 15.
8
Open-source platform to benchmark fingerprints for ligand-based virtual screening.开源平台,用于基于配体的虚拟筛选对指纹进行基准测试。
J Cheminform. 2013 May 30;5(1):26. doi: 10.1186/1758-2946-5-26.
9
Voting-based consensus clustering for combining multiple clusterings of chemical structures.基于投票的共识聚类方法用于整合多个化学结构聚类。
J Cheminform. 2012 Dec 17;4(1):37. doi: 10.1186/1758-2946-4-37.