• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

化学数据库中的分子多样性:药物化学知识库与市售化合物数据库的比较。

Molecular diversity in chemical databases: comparison of medicinal chemistry knowledge bases and databases of commercially available compounds.

作者信息

Cummins D J, Andrews C W, Bentley J A, Cory M

机构信息

Division of Medicinal Chemistry, Glaxo Wellcome, Research Triangle Park, North Carolina 27709, USA.

出版信息

J Chem Inf Comput Sci. 1996 Jul-Aug;36(4):750-63. doi: 10.1021/ci950168h.

DOI:10.1021/ci950168h
PMID:8768767
Abstract

A molecular descriptor space has been developed which describes structural diversity. Large databases of molecules have been mapped into it and compared. This analysis used five chemical databases, CMC and MDDR, which represent knowledge bases containing active medicinal agents, ACD and SPECS, two databases of commercially available compounds, and finally the Wellcome Registry. Together these databases contained more than 300,000 structures. Topological indices and the free energy of solvation were computed for each compound in the databases. Factor analysis was used to reduce the dimensionality of the descriptor space. Low density observations were deleted as a way of removing outliers, which allowed a further reduction in the descriptor space of interest. The five databases could then be compared on an efficient basis using a metric developed for this purpose. A Riemann gridding scheme was used to subdivide the factor space into subhypercubes to obtain accurate comparisons. Most of the 300,000 structures were highly clustered, but unique structures were found. An analysis of overlap between the biological and commercial databases was carried out. The metric provides a useful algorithm for choosing screening sets of diverse compounds from large databases.

摘要

已经开发了一种描述结构多样性的分子描述符空间。大量分子数据库已被映射到该空间并进行比较。该分析使用了五个化学数据库,即代表含有活性药物的知识库的CMC和MDDR、两个市售化合物数据库ACD和SPECS,以及最后一个威康登记库。这些数据库总共包含超过300,000个结构。为数据库中的每个化合物计算拓扑指数和溶剂化自由能。使用因子分析来降低描述符空间的维度。删除低密度观测值作为去除异常值的一种方法,这使得感兴趣的描述符空间进一步减小。然后可以使用为此目的开发的度量在有效基础上比较这五个数据库。使用黎曼网格化方案将因子空间细分为子超立方体以获得准确的比较。300,000个结构中的大多数高度聚集,但也发现了独特的结构。对生物数据库和商业数据库之间的重叠进行了分析。该度量为从大型数据库中选择不同化合物的筛选集提供了一种有用的算法。

相似文献

1
Molecular diversity in chemical databases: comparison of medicinal chemistry knowledge bases and databases of commercially available compounds.化学数据库中的分子多样性:药物化学知识库与市售化合物数据库的比较。
J Chem Inf Comput Sci. 1996 Jul-Aug;36(4):750-63. doi: 10.1021/ci950168h.
2
An overview of the diversity represented in commercially-available databases.
Mol Divers. 2002;5(4):175-83. doi: 10.1023/a:1021363906868.
3
Assessing the scaffold diversity of screening libraries.评估筛选文库的支架多样性。
J Chem Inf Model. 2006 Mar-Apr;46(2):512-24. doi: 10.1021/ci050352v.
4
Leadlikeness and structural diversity of synthetic screening libraries.合成筛选库的类先导性和结构多样性。
Mol Divers. 2006 Aug;10(3):377-88. doi: 10.1007/s11030-006-9040-6. Epub 2006 Sep 21.
5
Chemical similarity searches using latent semantic structural indexing (LaSSI) and comparison to TOPOSIM.使用潜在语义结构索引(LaSSI)进行化学相似性搜索并与TOPOSIM进行比较。
J Med Chem. 2001 Apr 12;44(8):1185-91. doi: 10.1021/jm000392k.
6
A large descriptor set and a probabilistic kernel-based classifier significantly improve druglikeness classification.一个大型描述符集和一个基于概率核的分类器显著提高了类药物性分类。
J Chem Inf Model. 2007 Sep-Oct;47(5):1776-86. doi: 10.1021/ci700107y. Epub 2007 Aug 25.
7
Property distribution of drug-related chemical databases.药物相关化学数据库的属性分布
J Comput Aided Mol Des. 2000 Mar;14(3):251-64. doi: 10.1023/a:1008130001697.
8
Drug-likeness and increased hydrophobicity of commercially available compound libraries for drug screening.用于药物筛选的市售化合物库的类药性和疏水性增加。
Curr Top Med Chem. 2012;12(14):1500-13. doi: 10.2174/156802612802652466.
9
Using novel descriptor accounting for ligand-receptor interactions to define and visually explore biologically relevant chemical space.利用新型描述符来描述配体-受体相互作用,以定义和直观探索具有生物学相关性的化学空间。
J Chem Inf Model. 2012 May 25;52(5):1086-102. doi: 10.1021/ci200627v. Epub 2012 Apr 19.
10
Investigation of the relationship between topology and selectivity for druglike molecules.研究类药性分子的拓扑结构与选择性之间的关系。
J Med Chem. 2010 Nov 11;53(21):7709-14. doi: 10.1021/jm1008456.

引用本文的文献

1
Pharmacokinetic Equations Applied to Obtain New Topological Models in the Search of Antibacterial Compounds.应用药代动力学方程以获取用于寻找抗菌化合物的新拓扑模型。
Pharmaceuticals (Basel). 2025 Jun 10;18(6):865. doi: 10.3390/ph18060865.
2
GPU accelerated chemical similarity calculation for compound library comparison.GPU 加速的化合物库比较中的化学相似性计算。
J Chem Inf Model. 2011 Jul 25;51(7):1521-7. doi: 10.1021/ci1004948. Epub 2011 Jul 1.
3
Managing, profiling and analyzing a library of 2.6 million compounds gathered from 32 chemical providers.
管理、分析和剖析从32家化学供应商处收集的260万种化合物的文库。
Mol Divers. 2006 Aug;10(3):389-403. doi: 10.1007/s11030-006-9033-5. Epub 2006 Sep 21.
4
Multiobjective optimization of combinatorial libraries.组合文库的多目标优化
Mol Divers. 2002;5(4):209-30. doi: 10.1023/a:1021320124615.
5
An overview of the diversity represented in commercially-available databases.
Mol Divers. 2002;5(4):175-83. doi: 10.1023/a:1021363906868.
6
Multiobjective optimization of combinatorial libraries.组合文库的多目标优化
J Comput Aided Mol Des. 2002 May-Jun;16(5-6):335-56. doi: 10.1023/a:1020837112154.
7
An overview of the diversity represented in commercially-available databases.
J Comput Aided Mol Des. 2002 May-Jun;16(5-6):301-9. doi: 10.1023/a:1020811805001.
8
Advances in diversity profiling and combinatorial series design.多样性分析与组合系列设计的进展。
Mol Divers. 1998;4(1):1-22. doi: 10.1023/a:1009636310640.
9
Database diversity assessment: new ideas, concepts, and tools.
J Comput Aided Mol Des. 1997 Sep;11(5):447-52. doi: 10.1023/a:1007937308615.