化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。

Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, 8600 Rockville Pike, Bethesda, 20894, MD, USA.

出版信息

J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.

DOI:10.1186/1758-2946-4-28

PMID:23134593

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3537644/

Abstract

BACKGROUND

To improve the utility of PubChem, a public repository containing biological activities of small molecules, the PubChem3D project adds computationally-derived three-dimensional (3-D) descriptions to the small-molecule records contained in the PubChem Compound database and provides various search and analysis tools that exploit 3-D molecular similarity. Therefore, the efficient use of PubChem3D resources requires an understanding of the statistical and biological meaning of computed 3-D molecular similarity scores between molecules.

RESULTS

The present study investigated effects of employing multiple conformers per compound upon the 3-D similarity scores between ten thousand randomly selected biologically-tested compounds (10-K set) and between non-inactive compounds in a given biological assay (156-K set). When the "best-conformer-pair" approach, in which a 3-D similarity score between two compounds is represented by the greatest similarity score among all possible conformer pairs arising from a compound pair, was employed with ten diverse conformers per compound, the average 3-D similarity scores for the 10-K set increased by 0.11, 0.09, 0.15, 0.16, 0.07, and 0.18 for STST-opt, CTST-opt, ComboTST-opt, STCT-opt, CTCT-opt, and ComboTCT-opt, respectively, relative to the corresponding averages computed using a single conformer per compound. Interestingly, the best-conformer-pair approach also increased the average 3-D similarity scores for the non-inactive-non-inactive (NN) pairs for a given assay, by comparable amounts to those for the random compound pairs, although some assays showed a pronounced increase in the per-assay NN-pair 3-D similarity scores, compared to the average increase for the random compound pairs.

CONCLUSION

These results suggest that the use of ten diverse conformers per compound in PubChem bioassay data analysis using 3-D molecular similarity is not expected to increase the separation of non-inactive from random and inactive spaces "on average", although some assays show a noticeable separation between the non-inactive and random spaces when multiple conformers are used for each compound. The present study is a critical next step to understand effects of conformational diversity of the molecules upon the 3-D molecular similarity and its application to biological activity data analysis in PubChem. The results of this study may be helpful to build search and analysis tools that exploit 3-D molecular similarity between compounds archived in PubChem and other molecular libraries in a more efficient way.

摘要

背景

为了提高 PubChem 的实用性，PubChem3D 项目为 PubChem 化合物数据库中包含的小分子记录添加了计算得出的三维 (3-D) 描述，并提供了各种利用 3-D 分子相似性的搜索和分析工具。因此，有效利用 PubChem3D 资源需要了解分子间计算得出的 3-D 分子相似性得分的统计和生物学意义。

结果

本研究调查了在一万个随机选择的经过生物测试的化合物（10-K 集）和给定生物测定中的非活性化合物（156-K 集）之间，每个化合物采用多个构象对时，对 3-D 相似性得分的影响。当采用“最佳构象对”方法时，即两个化合物之间的 3-D 相似性得分由化合物对中所有可能构象对的最大相似性得分表示，每个化合物采用十种不同构象时，10-K 集的平均 3-D 相似性得分相对于每个化合物采用单一构象时，分别增加了 0.11、0.09、0.15、0.16、0.07 和 0.18，对于 STST-opt、CTST-opt、ComboTST-opt、STCT-opt、CTCT-opt 和 ComboTCT-opt。有趣的是，最佳构象对方法还增加了给定测定中非活性-非活性（NN）对的平均 3-D 相似性得分，增加幅度与随机化合物对的相似，尽管一些测定显示出与随机化合物对的平均增加相比，NN 对 3-D 相似性得分的明显增加。

结论

这些结果表明，在使用 3-D 分子相似性对 PubChem 生物测定数据进行分析时，每个化合物采用十种不同构象，预计不会“平均”增加非活性与随机和无活性空间的分离，尽管一些测定在每个化合物采用多种构象时显示出非活性和随机空间之间的明显分离。本研究是理解分子构象多样性对 3-D 分子相似性及其在 PubChem 中生物活性数据分析中的应用的关键下一步。本研究的结果可能有助于构建更有效地利用 PubChem 和其他分子库中化合物之间的 3-D 分子相似性的搜索和分析工具。

相似文献

Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.

J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.

PubChem3D: Biologically relevant 3-D similarity.

J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.

PubChem3D: conformer ensemble accuracy.

J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.

PubChem3D: Similar conformers.

J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.

PubChem3D: a new resource for scientists.

J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.

PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.

J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.

J Cheminform. 2016 Nov 4;8:62. doi: 10.1186/s13321-016-0163-1. eCollection 2016.

PubChem3D: Conformer generation.

J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.

PubChem3D: Diversity of shape.

J Cheminform. 2011 Mar 21;3:9. doi: 10.1186/1758-2946-3-9.

PubChem structure-activity relationship (SAR) clusters.

J Cheminform. 2015 Jul 7;7:33. doi: 10.1186/s13321-015-0070-x. eCollection 2015.

引用本文的文献

Chemical Constituents of Stinging Nettle ( L.): A Comprehensive Review on Phenolic and Polyphenolic Compounds and Their Bioactivity.

Int J Mol Sci. 2024 Mar 18;25(6):3430. doi: 10.3390/ijms25063430.

Caffeoylquinic acids: chemistry, biosynthesis, occurrence, analytical challenges, and bioactivity.

Plant J. 2021 Sep;107(5):1299-1319. doi: 10.1111/tpj.15390. Epub 2021 Jul 23.

Integration of mass spectral fingerprinting analysis with precursor ion (MS1) quantification for the characterisation of botanical extracts: application to extracts of Centella asiatica (L.) Urban.

Phytochem Anal. 2020 Nov;31(6):722-738. doi: 10.1002/pca.2936. Epub 2020 Apr 12.

- Phytochemistry and mechanisms of neuroprotection and cognitive enhancement.

Phytochem Rev. 2018 Feb;17(1):161-194. doi: 10.1007/s11101-017-9528-y. Epub 2017 Sep 20.

Teratological and Behavioral Screening of the National Toxicology Program 91-Compound Library in Zebrafish (Danio rerio).

Toxicol Sci. 2019 Jan 1;167(1):77-91. doi: 10.1093/toxsci/kfy266.

Finding Potential Multitarget Ligands Using PubChem.

Methods Mol Biol. 2018;1825:63-91. doi: 10.1007/978-1-4939-8639-2_2.

An update on PUG-REST: RESTful interface for programmatic access to PubChem.

Nucleic Acids Res. 2018 Jul 2;46(W1):W563-W570. doi: 10.1093/nar/gky294.

J Cheminform. 2016 Nov 4;8:62. doi: 10.1186/s13321-016-0163-1. eCollection 2016.

A dual-targeting approach to inhibit Brucella abortus replication in human cells.

Sci Rep. 2016 Oct 21;6:35835. doi: 10.1038/srep35835.

In silico Screening and Evaluation of the Anticonvulsant Activity of Docosahexaenoic Acid-Like Molecules in Experimental Models of Seizures.

Iran Biomed J. 2017 Jan;21(1):32-9. doi: 10.6091/.21.1.32. Epub 2016 Sep 4.

本文引用的文献

PubChem3D: conformer ensemble accuracy.

J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.

Recognizing pitfalls in virtual screening: a critical review.

J Chem Inf Model. 2012 Apr 23;52(4):867-81. doi: 10.1021/ci200528d. Epub 2012 Apr 6.

PubChem's BioAssay Database.

Nucleic Acids Res. 2012 Jan;40(Database issue):D400-12. doi: 10.1093/nar/gkr1132. Epub 2011 Dec 2.

Database resources of the National Center for Biotechnology Information.

Nucleic Acids Res. 2012 Jan;40(Database issue):D13-25. doi: 10.1093/nar/gkr1184. Epub 2011 Dec 2.

PubChem3D: a new resource for scientists.

J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.

PubChem3D: Biologically relevant 3-D similarity.

J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.

PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.

J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.

PubChem3D: Similar conformers.

J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.

PubChem3D: Diversity of shape.

J Cheminform. 2011 Mar 21;3:9. doi: 10.1186/1758-2946-3-9.

PubChem3D: Conformer generation.

J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。

Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, 8600 Rockville Pike, Bethesda, 20894, MD, USA.

出版信息

J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.

DOI:10.1186/1758-2946-4-28

PMID:23134593

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3537644/

Abstract

BACKGROUND

RESULTS

CONCLUSION

摘要

化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。

Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。

Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献