• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于库尔贝克-莱布勒散度分析的贝叶斯相似性搜索指纹约简方法的开发。

Development of a fingerprint reduction approach for Bayesian similarity searching based on Kullback-Leibler divergence analysis.

作者信息

Nisius Britta, Vogt Martin, Bajorath Jürgen

机构信息

Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universitat, D-53113 Bonn, Germany.

出版信息

J Chem Inf Model. 2009 Jun;49(6):1347-58. doi: 10.1021/ci900087y.

DOI:10.1021/ci900087y
PMID:19480403
Abstract

The contribution of individual fingerprint bit positions to similarity search performance is systematically evaluated. A method is introduced to determine bit significance on the basis of Kullback-Leibler divergence analysis of bit distributions in active and database compounds. Bit divergence analysis and Bayesian compound screening share a common methodological foundation. Hence, given the significance ranking of all individual bit positions comprising a fingerprint, subsets of bits are evaluated in the context of Bayesian screening, and minimal fingerprint representations are determined that meet or exceed the search performance of unmodified fingerprints. For fingerprints of different design evaluated on many compound activity classes, we consistently find that subsets of fingerprint bit positions are responsible for search performance. In part, these subsets are very small and contain in some cases only a few fingerprint bit positions. Structural or pharmacophore patterns captured by preferred bit positions can often be directly associated with characteristic features of active compounds. In some cases, reduced fingerprint representations clearly exceed the search performance of the original fingerprints. Thus, fingerprint reduction likely represents a promising approach for practical applications.

摘要

系统地评估了各个指纹位位置对相似性搜索性能的贡献。引入了一种基于活性化合物和数据库化合物中位分布的库尔贝克-莱布勒散度分析来确定位重要性的方法。位散度分析和贝叶斯化合物筛选有着共同的方法基础。因此,给定构成指纹的所有单个位位置的重要性排名,在位散度分析的背景下评估位的子集,并确定满足或超过未修改指纹搜索性能的最小指纹表示。对于在许多化合物活性类别上评估的不同设计的指纹,我们一致发现指纹位位置的子集决定了搜索性能。部分情况下,这些子集非常小,在某些情况下仅包含几个指纹位位置。优选位位置捕获的结构或药效团模式通常可以直接与活性化合物的特征相关联。在某些情况下,简化的指纹表示明显超过了原始指纹的搜索性能。因此,指纹简化可能是一种很有前景的实际应用方法。

相似文献

1
Development of a fingerprint reduction approach for Bayesian similarity searching based on Kullback-Leibler divergence analysis.基于库尔贝克-莱布勒散度分析的贝叶斯相似性搜索指纹约简方法的开发。
J Chem Inf Model. 2009 Jun;49(6):1347-58. doi: 10.1021/ci900087y.
2
Predicting the performance of fingerprint similarity searching.预测指纹相似性搜索的性能。
Methods Mol Biol. 2011;672:159-73. doi: 10.1007/978-1-60761-839-3_6.
3
Bayesian screening for active compounds in high-dimensional chemical spaces combining property descriptors and molecular fingerprints.结合性质描述符和分子指纹的高维化学空间中活性化合物的贝叶斯筛选
Chem Biol Drug Des. 2008 Jan;71(1):8-14. doi: 10.1111/j.1747-0285.2007.00602.x. Epub 2007 Dec 7.
4
Reduction and recombination of fingerprints of different design increase compound recall and the structural diversity of hits.减少和重新组合不同设计的指纹可以提高化合物的召回率和命中结构的多样性。
Chem Biol Drug Des. 2010 Feb;75(2):152-60. doi: 10.1111/j.1747-0285.2009.00930.x. Epub 2009 Dec 18.
5
Bit silencing in fingerprints enables the derivation of compound class-directed similarity metrics.指纹中的位沉默能够推导出化合物类别导向的相似性度量。
J Chem Inf Model. 2008 Sep;48(9):1754-9. doi: 10.1021/ci8002045. Epub 2008 Aug 13.
6
Rendering conventional molecular fingerprints for virtual screening independent of molecular complexity and size effects.生成与分子复杂性和大小效应无关的虚拟筛选常规分子指纹。
ChemMedChem. 2010 Jun 7;5(6):859-68. doi: 10.1002/cmdc.201000089.
7
Random reduction in fingerprint bit density improves compound recall in search calculations using complex reference molecules.在使用复杂参考分子的搜索计算中,随机降低指纹位密度可提高化合物召回率。
Chem Biol Drug Des. 2008 Jun;71(6):511-7. doi: 10.1111/j.1747-0285.2008.00664.x. Epub 2008 May 7.
8
RelACCS-FP: a structural minimalist approach to fingerprint design.RelACCS-FP:一种指纹设计的结构极简主义方法。
Chem Biol Drug Des. 2008 Nov;72(5):341-9. doi: 10.1111/j.1747-0285.2008.00723.x.
9
Introduction of a generally applicable method to estimate retrieval of active molecules for similarity searching using fingerprints.介绍一种使用指纹来估计活性分子检索以进行相似性搜索的通用方法。
ChemMedChem. 2007 Sep;2(9):1311-20. doi: 10.1002/cmdc.200700090.
10
How do 2D fingerprints detect structurally diverse active compounds? Revealing compound subset-specific fingerprint features through systematic selection.2D 指纹如何检测结构多样的活性化合物?通过系统选择揭示化合物子集特异性指纹特征。
J Chem Inf Model. 2011 Sep 26;51(9):2254-65. doi: 10.1021/ci200275m. Epub 2011 Aug 8.

引用本文的文献

1
Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach.使用特征值熵方法分析相关指纹对分子相似性的影响。
J Cheminform. 2021 Mar 23;13(1):27. doi: 10.1186/s13321-021-00506-2.
2
Practical application of the Average Information Content Maximization (AIC-MAX) algorithm: selection of the most important structural features for serotonin receptor ligands.平均信息含量最大化(AIC-MAX)算法的实际应用:血清素受体配体最重要结构特征的选择
Mol Divers. 2017 May;21(2):407-412. doi: 10.1007/s11030-017-9729-8. Epub 2017 Feb 9.
3
Average Information Content Maximization--A New Approach for Fingerprint Hybridization and Reduction.
平均信息内容最大化——指纹杂交与简化的新方法。
PLoS One. 2016 Jan 19;11(1):e0146666. doi: 10.1371/journal.pone.0146666. eCollection 2016.
4
QSAR with experimental and predictive distributions: an information theoretic approach for assessing model quality.定量构效关系(QSAR)与实验和预测分布:评估模型质量的信息理论方法。
J Comput Aided Mol Des. 2013 Mar;27(3):203-19. doi: 10.1007/s10822-013-9639-5. Epub 2013 Mar 16.