• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

轮廓缩放提高了包含数值描述符和结构键的分子指纹的相似性搜索性能。

Profile scaling increases the similarity search performance of molecular fingerprints containing numerical descriptors and structural keys.

作者信息

Xue Ling, Godden Jeffrey W, Stahura Florence L, Bajorath Jürgen

机构信息

Department of Computer-Aided Drug Discovery, Albany Molecular Research, Inc., Bothell Research Center, 18804 North Creek Parkway, Bothell, Washington 98011, USA.

出版信息

J Chem Inf Comput Sci. 2003 Jul-Aug;43(4):1218-25. doi: 10.1021/ci030287u.

DOI:10.1021/ci030287u
PMID:12870914
Abstract

The concept of compound class-specific profiling and scaling of molecular fingerprints for similarity searching is discussed and applied to newly designed fingerprint representations. The approach is based on the analysis of characteristic patterns of bits in keyed fingerprints that are set on in compounds having equivalent biological activity. Once a fingerprint profile is generated for a particular activity class, scaling factors that are weighted according to observed bit frequencies are applied to signature bit positions when searching for similar compounds. In systematic similarity search calculations over 23 diverse activity classes, profile scaling consistently increased the performance of fingerprints containing property descriptors and/or structural keys. A significant improvement of approximately 15% was observed for a new fingerprint consisting of binary encoded molecular property descriptors and structural keys. Under scaling conditions, this fingerprint, termed MP-MFP, correctly recognized on average close to 60% of all active test compounds, with only a few false positives. MP-MFP outperformed MACCS keys and other reference fingerprints. In general, optimum performance in scaling calculations was achieved at higher threshold values of the Tanimoto coefficient than in nonscaled calculations, thereby increasing the search selectivity. In general, putting relatively high weight on signature bit positions that were always, or almost always, set on was found to be the most effective scaling procedure. Analysis of class-specific search performance revealed that profile scaling of MP-MFP improved the similarity search results for each of the 23 activity classes.

摘要

讨论了用于相似性搜索的分子指纹的化合物类别特异性分析和缩放概念,并将其应用于新设计的指纹表示。该方法基于对键控指纹中特征位模式的分析,这些特征位在具有等效生物活性的化合物中被设置。一旦为特定活性类别生成了指纹图谱,在搜索相似化合物时,根据观察到的位频率加权的缩放因子将应用于特征位位置。在对23种不同活性类别的系统相似性搜索计算中,图谱缩放始终提高了包含性质描述符和/或结构键的指纹的性能。对于由二进制编码的分子性质描述符和结构键组成的新指纹,观察到约15%的显著改善。在缩放条件下,这种称为MP-MFP的指纹平均能正确识别近60%的所有活性测试化合物,只有少数假阳性。MP-MFP优于MACCS键和其他参考指纹。一般来说,在缩放计算中,与未缩放计算相比,在较高的Tanimoto系数阈值下可实现最佳性能,从而提高搜索选择性。一般来说,对始终或几乎始终设置的特征位位置赋予相对较高的权重被发现是最有效的缩放过程。对类别特异性搜索性能的分析表明,MP-MFP的图谱缩放改善了23种活性类别中每一种的相似性搜索结果。

相似文献

1
Profile scaling increases the similarity search performance of molecular fingerprints containing numerical descriptors and structural keys.轮廓缩放提高了包含数值描述符和结构键的分子指纹的相似性搜索性能。
J Chem Inf Comput Sci. 2003 Jul-Aug;43(4):1218-25. doi: 10.1021/ci030287u.
2
Similarity search profiling reveals effects of fingerprint scaling in virtual screening.相似性搜索分析揭示了虚拟筛选中指纹缩放的影响。
J Chem Inf Comput Sci. 2004 Nov-Dec;44(6):2032-9. doi: 10.1021/ci0400819.
3
Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme.一种涉及将性质描述符值转化为二元分类方案的分子指纹的设计与评估。
J Chem Inf Comput Sci. 2003 Jul-Aug;43(4):1151-7. doi: 10.1021/ci030285+.
4
Bit silencing in fingerprints enables the derivation of compound class-directed similarity metrics.指纹中的位沉默能够推导出化合物类别导向的相似性度量。
J Chem Inf Model. 2008 Sep;48(9):1754-9. doi: 10.1021/ci8002045. Epub 2008 Aug 13.
5
Design and evaluation of a novel class-directed 2D fingerprint to search for structurally diverse active compounds.一种新型类别导向二维指纹图谱的设计与评估,用于搜索结构多样的活性化合物。
J Chem Inf Model. 2006 Nov-Dec;46(6):2515-26. doi: 10.1021/ci600303b.
6
Development of a fingerprint reduction approach for Bayesian similarity searching based on Kullback-Leibler divergence analysis.基于库尔贝克-莱布勒散度分析的贝叶斯相似性搜索指纹约简方法的开发。
J Chem Inf Model. 2009 Jun;49(6):1347-58. doi: 10.1021/ci900087y.
7
Rendering conventional molecular fingerprints for virtual screening independent of molecular complexity and size effects.生成与分子复杂性和大小效应无关的虚拟筛选常规分子指纹。
ChemMedChem. 2010 Jun 7;5(6):859-68. doi: 10.1002/cmdc.201000089.
8
Bayesian screening for active compounds in high-dimensional chemical spaces combining property descriptors and molecular fingerprints.结合性质描述符和分子指纹的高维化学空间中活性化合物的贝叶斯筛选
Chem Biol Drug Des. 2008 Jan;71(1):8-14. doi: 10.1111/j.1747-0285.2007.00602.x. Epub 2007 Dec 7.
9
Random reduction in fingerprint bit density improves compound recall in search calculations using complex reference molecules.在使用复杂参考分子的搜索计算中,随机降低指纹位密度可提高化合物召回率。
Chem Biol Drug Des. 2008 Jun;71(6):511-7. doi: 10.1111/j.1747-0285.2008.00664.x. Epub 2008 May 7.
10
Similarity search profiles as a diagnostic tool for the analysis of virtual screening calculations.相似性搜索概况作为虚拟筛选计算分析的诊断工具。
J Chem Inf Comput Sci. 2004 Jul-Aug;44(4):1275-81. doi: 10.1021/ci040120g.

引用本文的文献

1
Turbo prediction: a new approach for bioactivity prediction.Turbo预测:一种生物活性预测的新方法。
J Comput Aided Mol Des. 2022 Jan;36(1):77-85. doi: 10.1007/s10822-021-00440-3. Epub 2022 Jan 21.
2
Large-Scale Comparison of Alternative Similarity Search Strategies with Varying Chemical Information Contents.具有不同化学信息含量的替代相似性搜索策略的大规模比较。
ACS Omega. 2019 Sep 5;4(12):15304-15311. doi: 10.1021/acsomega.9b02470. eCollection 2019 Sep 17.
3
Statistical-based database fingerprint: chemical space dependent representation of compound databases.
基于统计的数据库指纹:化合物数据库的化学空间依赖性表示。
J Cheminform. 2018 Nov 22;10(1):55. doi: 10.1186/s13321-018-0311-x.
4
CarcinoPred-EL: Novel models for predicting the carcinogenicity of chemicals using molecular fingerprints and ensemble learning methods.CarcinoPred-EL:使用分子指纹和集成学习方法预测化学品致癌性的新型模型。
Sci Rep. 2017 May 18;7(1):2118. doi: 10.1038/s41598-017-02365-0.
5
Speeding up chemical searches using the inverted index: the convergence of chemoinformatics and text search methods.利用倒排索引加速化学搜索:化学信息学与文本搜索方法的融合。
J Chem Inf Model. 2012 Apr 23;52(4):891-900. doi: 10.1021/ci200552r. Epub 2012 Apr 10.
6
Comprehensive structural and functional characterization of the human kinome by protein structure modeling and ligand virtual screening.通过蛋白质结构建模和配体虚拟筛选对人类激酶组进行全面的结构和功能表征。
J Chem Inf Model. 2010 Oct 25;50(10):1839-54. doi: 10.1021/ci100235n.
7
The utility of geometrical and chemical restraint information extracted from predicted ligand-binding sites in protein structure refinement.从预测的配体结合位点中提取的几何和化学约束信息在蛋白质结构精修中的应用。
J Struct Biol. 2011 Mar;173(3):558-69. doi: 10.1016/j.jsb.2010.09.009. Epub 2010 Sep 17.
8
Hashing algorithms and data structures for rapid searches of fingerprint vectors.用于快速搜索指纹向量的哈希算法和数据结构。
J Chem Inf Model. 2010 Aug 23;50(8):1358-68. doi: 10.1021/ci100132g.
9
When is chemical similarity significant? The statistical distribution of chemical similarity scores and its extreme values.什么时候化学相似性具有重要意义?化学相似性得分的统计分布及其极值。
J Chem Inf Model. 2010 Jul 26;50(7):1205-22. doi: 10.1021/ci100010v.
10
Large scale study of multiple-molecule queries.大规模多分子查询研究。
J Cheminform. 2009 Jun 4;1(1):7. doi: 10.1186/1758-2946-1-7.