• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PubChem3D:具有生物学相关性的三维相似性。

PubChem3D: Biologically relevant 3-D similarity.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, 8600 Rockville Pike, Bethesda, MD 20894, USA.

出版信息

J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.

DOI:10.1186/1758-2946-3-26
PMID:21781288
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3223603/
Abstract

BACKGROUND

The use of 3-D similarity techniques in the analysis of biological data and virtual screening is pervasive, but what is a biologically meaningful 3-D similarity value? Can one find statistically significant separation between "active/active" and "active/inactive" spaces? These questions are explored using 734,486 biologically tested chemical structures, 1,389 biological assay data sets, and six different 3-D similarity types utilized by PubChem analysis tools.

RESULTS

The similarity value distributions of 269.7 billion unique conformer pairs from 734,486 biologically tested compounds (all-against-all) from PubChem were utilized to help work towards an answer to the question: what is a biologically meaningful 3-D similarity score? The average and standard deviation for the six similarity measures STST-opt, CTST-opt, ComboTST-opt, STCT-opt, CTCT-opt, and ComboTCT-opt were 0.54 ± 0.10, 0.07 ± 0.05, 0.62 ± 0.13, 0.41 ± 0.11, 0.18 ± 0.06, and 0.59 ± 0.14, respectively. Considering that this random distribution of biologically tested compounds was constructed using a single theoretical conformer per compound (the "default" conformer provided by PubChem), further study may be necessary using multiple diverse conformers per compound; however, given the breadth of the compound set, the single conformer per compound results may still apply to the case of multi-conformer per compound 3-D similarity value distributions. As such, this work is a critical step, covering a very wide corpus of chemical structures and biological assays, creating a statistical framework to build upon.The second part of this study explored the question of whether it was possible to realize a statistically meaningful 3-D similarity value separation between reputed biological assay "inactives" and "actives". Using the terminology of noninactive-noninactive (NN) pairs and the noninactive-inactive (NI) pairs to represent comparison of the "active/active" and "active/inactive" spaces, respectively, each of the 1,389 biological assays was examined by their 3-D similarity score differences between the NN and NI pairs and analyzed across all assays and by assay category types. While a consistent trend of separation was observed, this result was not statistically unambiguous after considering the respective standard deviations. While not all "actives" in a biological assay are amenable to this type of analysis, e.g., due to different mechanisms of action or binding configurations, the ambiguous separation may also be due to employing a single conformer per compound in this study. With that said, there were a subset of biological assays where a clear separation between the NN and NI pairs found. In addition, use of combo Tanimoto (ComboT) alone, independent of superposition optimization type, appears to be the most efficient 3-D score type in identifying these cases.

CONCLUSION

This study provides a statistical guideline for analyzing biological assay data in terms of 3-D similarity and PubChem structure-activity analysis tools. When using a single conformer per compound, a relatively small number of assays appear to be able to separate "active/active" space from "active/inactive" space.

摘要

背景

在生物数据和虚拟筛选的分析中,三维相似性技术的使用非常普遍,但是什么是具有生物学意义的三维相似性值呢?是否可以在“活性/活性”和“活性/非活性”空间之间找到具有统计学意义的分离?本研究使用 PubChem 分析工具的 6 种不同的 3-D 相似性类型,通过 734,486 种经过生物学测试的化学结构和 1,389 个生物测定数据集来探索这些问题。

结果

利用 PubChem 中 734,486 种经过生物学测试的化合物(所有化合物对所有化合物)的 269.7 亿个独特构象对的相似性值分布,有助于回答以下问题:什么是具有生物学意义的三维相似性评分?6 种相似性度量 STST-opt、CTST-opt、ComboTST-opt、STCT-opt、CTCT-opt 和 ComboTCT-opt 的平均值和标准差分别为 0.54±0.10、0.07±0.05、0.62±0.13、0.41±0.11、0.18±0.06 和 0.59±0.14。考虑到使用单一理论构象(PubChem 提供的“默认”构象)构建这种经过生物学测试的化合物的随机分布,可能需要进一步研究使用每种化合物的多个不同构象;但是,鉴于化合物集的广度,单一构象化合物的结果可能仍然适用于具有多种构象的化合物的 3-D 相似值分布。因此,这项工作是一个关键步骤,涵盖了非常广泛的化学结构和生物测定数据集,为进一步建立统计框架奠定了基础。本研究的第二部分探讨了是否有可能在声誉良好的生物测定“非活性”和“活性”之间实现具有统计学意义的三维相似性值分离的问题。使用术语非非活性-非非活性(NN)对和非非活性-活性(NI)对分别表示“活性/活性”和“活性/非活性”空间的比较,分别对 1,389 个生物测定进行了分析,比较了 NN 和 NI 对之间的 3-D 相似性评分差异,并对所有测定和测定类别类型进行了分析。尽管观察到了一致的分离趋势,但在考虑各自的标准差后,这一结果并不具有统计学意义。虽然生物测定中的所有“活性”物质都不能进行这种类型的分析,例如由于不同的作用机制或结合构象,但这种不明确的分离也可能是由于在这项研究中对每种化合物都使用了单一构象。话虽如此,有一部分生物测定能够清楚地分离 NN 和 NI 对。此外,单独使用组合 Tanimoto(ComboT),而不考虑叠加优化类型,似乎是识别这些情况的最有效的 3-D 评分类型。

结论

本研究为分析生物测定数据的三维相似性和 PubChem 结构-活性分析工具提供了统计指南。当对每种化合物使用单一构象时,似乎只有相对较少的测定能够将“活性/活性”空间与“活性/非活性”空间分离。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/35ffc19700d1/1758-2946-3-26-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/421b5717abe7/1758-2946-3-26-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8cc77986de17/1758-2946-3-26-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/382bd1e6808d/1758-2946-3-26-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/7e32a64bfcfa/1758-2946-3-26-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/182fba9f2287/1758-2946-3-26-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/a9b3f113e35d/1758-2946-3-26-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8c72832dd010/1758-2946-3-26-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/9f2281e423b1/1758-2946-3-26-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/243f872a0ee0/1758-2946-3-26-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8a6641cbdb85/1758-2946-3-26-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/1717e1094b9a/1758-2946-3-26-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/c40de387ae9f/1758-2946-3-26-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/7fdf2febf6d7/1758-2946-3-26-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/4e6f27cfc034/1758-2946-3-26-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/204d066e7a55/1758-2946-3-26-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/35ffc19700d1/1758-2946-3-26-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/421b5717abe7/1758-2946-3-26-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8cc77986de17/1758-2946-3-26-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/382bd1e6808d/1758-2946-3-26-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/7e32a64bfcfa/1758-2946-3-26-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/182fba9f2287/1758-2946-3-26-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/a9b3f113e35d/1758-2946-3-26-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8c72832dd010/1758-2946-3-26-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/9f2281e423b1/1758-2946-3-26-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/243f872a0ee0/1758-2946-3-26-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/8a6641cbdb85/1758-2946-3-26-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/1717e1094b9a/1758-2946-3-26-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/c40de387ae9f/1758-2946-3-26-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/7fdf2febf6d7/1758-2946-3-26-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/4e6f27cfc034/1758-2946-3-26-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/204d066e7a55/1758-2946-3-26-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/657c/3223603/35ffc19700d1/1758-2946-3-26-16.jpg

相似文献

1
PubChem3D: Biologically relevant 3-D similarity.PubChem3D:具有生物学相关性的三维相似性。
J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.
2
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。
J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.
3
PubChem3D: conformer ensemble accuracy.PubChem3D:构象系综精度。
J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.
4
PubChem3D: Similar conformers.PubChem3D:相似构象。
J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.
5
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.PubChem3D:使用分子形状四极矩进行形状兼容性过滤。
J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.
6
PubChem3D: a new resource for scientists.PubChem3D:科学家的新资源。
J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.
7
PubChem3D: Conformer generation.PubChem3D:构象生成。
J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.
8
PubChem3D: Diversity of shape.PubChem3D:形状的多样性。
J Cheminform. 2011 Mar 21;3:9. doi: 10.1186/1758-2946-3-9.
9
Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets.相似化合物与相似构象异构体:PubChem二维和三维相邻集之间的互补性。
J Cheminform. 2016 Nov 4;8:62. doi: 10.1186/s13321-016-0163-1. eCollection 2016.
10
Teaching an Old Dog New Tricks: Strategies That Improve Early Recognition in Similarity-Based Virtual Screening.老狗学新招:提高基于相似性虚拟筛选早期识别能力的策略
Front Chem. 2019 Oct 23;7:701. doi: 10.3389/fchem.2019.00701. eCollection 2019.

引用本文的文献

1
Exploring Chemical Information in PubChem.探索 PubChem 中的化学信息。
Curr Protoc. 2021 Aug;1(8):e217. doi: 10.1002/cpz1.217.
2
Finding Potential Multitarget Ligands Using PubChem.利用化学物质数据库(PubChem)寻找潜在的多靶点配体。
Methods Mol Biol. 2018;1825:63-91. doi: 10.1007/978-1-4939-8639-2_2.
3
Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets.相似化合物与相似构象异构体:PubChem二维和三维相邻集之间的互补性。

本文引用的文献

1
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.PubChem3D:使用分子形状四极矩进行形状兼容性过滤。
J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.
2
PubChem3D: Similar conformers.PubChem3D:相似构象。
J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.
3
PubChem3D: Diversity of shape.PubChem3D:形状的多样性。
J Cheminform. 2016 Nov 4;8:62. doi: 10.1186/s13321-016-0163-1. eCollection 2016.
4
Leveraging 3D chemical similarity, target and phenotypic data in the identification of drug-protein and drug-adverse effect associations.利用3D化学相似性、靶点和表型数据来识别药物-蛋白质和药物-不良反应关联。
J Cheminform. 2016 Jul 1;8:35. doi: 10.1186/s13321-016-0147-1. eCollection 2016.
5
PubChem Substance and Compound databases.美国国立医学图书馆化学物质数据库和化合物数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13. doi: 10.1093/nar/gkv951. Epub 2015 Sep 22.
6
PubChem structure-activity relationship (SAR) clusters.美国国立医学图书馆化学数据库结构-活性关系(SAR)聚类
J Cheminform. 2015 Jul 7;7:33. doi: 10.1186/s13321-015-0070-x. eCollection 2015.
7
3D pharmacophoric similarity improves multi adverse drug event identification in pharmacovigilance.3D药效团相似性可改善药物警戒中多种不良药物事件的识别。
Sci Rep. 2015 Mar 6;5:8809. doi: 10.1038/srep08809.
8
Global analysis reveals families of chemical motifs enriched for HERG inhibitors.全局分析揭示了富含HERG抑制剂的化学基序家族。
PLoS One. 2015 Feb 20;10(2):e0118324. doi: 10.1371/journal.pone.0118324. eCollection 2015.
9
Identification and Characterization of Noncovalent Interactions That Drive Binding and Specificity in DD-Peptidases and β-Lactamases.驱动DD-肽酶和β-内酰胺酶结合及特异性的非共价相互作用的鉴定与表征
J Chem Theory Comput. 2014 Feb 11;10(2):855-864. doi: 10.1021/ct400968v. Epub 2014 Jan 10.
10
PubChem3D: conformer ensemble accuracy.PubChem3D:构象系综精度。
J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.
J Cheminform. 2011 Mar 21;3:9. doi: 10.1186/1758-2946-3-9.
4
Use of structure-activity landscape index curves and curve integrals to evaluate the performance of multiple machine learning prediction models.利用结构-活性景观指数曲线和曲线积分来评估多个机器学习预测模型的性能。
J Cheminform. 2011 Feb 7;3(1):7. doi: 10.1186/1758-2946-3-7.
5
Design of multitarget activity landscapes that capture hierarchical activity cliff distributions.设计多靶点活性景观以捕获分层活性悬崖分布。
J Chem Inf Model. 2011 Feb 28;51(2):258-66. doi: 10.1021/ci100477m. Epub 2011 Jan 28.
6
PubChem3D: Conformer generation.PubChem3D:构象生成。
J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.
7
Chemical substitutions that introduce activity cliffs across different compound classes and biological targets.在不同化合物类别和生物靶标中引入活性悬崖的化学取代。
J Chem Inf Model. 2010 Jul 26;50(7):1248-56. doi: 10.1021/ci1001845.
8
Molecular shape and medicinal chemistry: a perspective.分子形状与药物化学:一个视角。
J Med Chem. 2010 May 27;53(10):3862-86. doi: 10.1021/jm900818s.
9
An overview of the PubChem BioAssay resource.PubChem 生物测定资源概述。
Nucleic Acids Res. 2010 Jan;38(Database issue):D255-66. doi: 10.1093/nar/gkp965. Epub 2009 Nov 19.
10
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2010 Jan;38(Database issue):D5-16. doi: 10.1093/nar/gkp967. Epub 2009 Nov 12.