• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PubChem3D:构象系综精度。

PubChem3D: conformer ensemble accuracy.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, 8600 Rockville Pike, Bethesda, MD 20894, USA.

出版信息

J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.

DOI:10.1186/1758-2946-5-1
PMID:23289532
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3547714/
Abstract

BACKGROUND

PubChem is a free and publicly available resource containing substance descriptions and their associated biological activity information. PubChem3D is an extension to PubChem containing computationally-derived three-dimensional (3-D) structures of small molecules. All the tools and services that are a part of PubChem3D rely upon the quality of the 3-D conformer models. Construction of the conformer models currently available in PubChem3D involves a clustering stage to sample the conformational space spanned by the molecule. While this stage allows one to downsize the conformer models to more manageable size, it may result in a loss of the ability to reproduce experimentally determined "bioactive" conformations, for example, found for PDB ligands. This study examines the extent of this accuracy loss and considers its effect on the 3-D similarity analysis of molecules.

RESULTS

The conformer models consisting of up to 100,000 conformers per compound were generated for 47,123 small molecules whose structures were experimentally determined, and the conformers in each conformer model were clustered to reduce the size of the conformer model to a maximum of 500 conformers per molecule. The accuracy of the conformer models before and after clustering was evaluated using five different measures: root-mean-square distance (RMSD), shape-optimized shape-Tanimoto (STST-opt) and combo-Tanimoto (ComboTST-opt), and color-optimized color-Tanimoto (CTCT-opt) and combo-Tanimoto (ComboTCT-opt). On average, the effect of clustering decreased the conformer model accuracy, increasing the conformer ensemble's RMSD to the bioactive conformer (by 0.18 ± 0.12 Å), and decreasing the STST-opt, ComboTST-opt, CTCT-opt, and ComboTCT-opt scores (by 0.04 ± 0.03, 0.16 ± 0.09, 0.09 ± 0.05, and 0.15 ± 0.09, respectively).

CONCLUSION

This study shows the RMSD accuracy performance of the PubChem3D conformer models is operating as designed. In addition, the effect of PubChem3D sampling on 3-D similarity measures shows that there is a linear degradation of average accuracy with respect to molecular size and flexibility. Generally speaking, one can likely expect the worst-case minimum accuracy of 90% or more of the PubChem3D ensembles to be 0.75, 1.09, 0.43, and 1.13, in terms of STST-opt, ComboTST-opt, CTCT-opt, and ComboTCT-opt, respectively. This expected accuracy improves linearly as the molecule becomes smaller or less flexible.

摘要

背景

PubChem 是一个免费的、公开的资源,包含物质描述及其相关生物活性信息。PubChem3D 是 PubChem 的一个扩展,包含了小分子的计算三维(3-D)结构。PubChem3D 中的所有工具和服务都依赖于 3-D 构象模型的质量。目前在 PubChem3D 中构建构象模型涉及到一个聚类阶段,用于采样分子所跨越的构象空间。虽然这个阶段可以将构象模型缩小到更易于管理的大小,但它可能会导致无法重现实验确定的“生物活性”构象的能力,例如pdb 配体中的构象。本研究检查了这种准确性损失的程度,并考虑了其对分子 3-D 相似性分析的影响。

结果

为 47123 个实验确定结构的小分子生成了每个化合物最多包含 100000 个构象的构象模型,并将每个构象模型中的构象进行聚类,将构象模型的大小最大减少到每个分子 500 个构象。使用五种不同的度量标准评估聚类前后构象模型的准确性:均方根距离(RMSD)、形状优化形状-塔尼托(STST-opt)和组合-塔尼托(ComboTST-opt),以及颜色优化颜色-塔尼托(CTCT-opt)和组合-颜色-塔尼托(ComboTCT-opt)。平均而言,聚类的影响降低了构象模型的准确性,使构象集合的 RMSD 增加到生物活性构象(增加 0.18±0.12 Å),并降低了 STST-opt、ComboTST-opt、CTCT-opt 和 ComboTCT-opt 的分数(分别降低 0.04±0.03、0.16±0.09、0.09±0.05 和 0.15±0.09)。

结论

本研究表明,PubChem3D 构象模型的 RMSD 准确性性能按设计运行。此外,PubChem3D 采样对 3-D 相似性度量的影响表明,平均准确性随分子大小和灵活性的线性下降。一般来说,可以预期 PubChem3D 集合的最差情况下的最小准确性为 90%或更高,在 STST-opt、ComboTST-opt、CTCT-opt 和 ComboTCT-opt 方面分别为 0.75、1.09、0.43 和 1.13。随着分子变得更小或更灵活,这种预期的准确性会线性提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/739533ddadeb/1758-2946-5-1-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/be99f4b13dee/1758-2946-5-1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/5bf4926b82d3/1758-2946-5-1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/3a460e74be3b/1758-2946-5-1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/9a0f8c14b00b/1758-2946-5-1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/f4f7d5d049f0/1758-2946-5-1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/378868d83ba4/1758-2946-5-1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/4c377fe3ab65/1758-2946-5-1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/72245a8b9962/1758-2946-5-1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/32a8a09a0fe1/1758-2946-5-1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/3bc821b73e96/1758-2946-5-1-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/26df109440fc/1758-2946-5-1-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/03ffb286f571/1758-2946-5-1-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/739533ddadeb/1758-2946-5-1-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/be99f4b13dee/1758-2946-5-1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/5bf4926b82d3/1758-2946-5-1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/3a460e74be3b/1758-2946-5-1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/9a0f8c14b00b/1758-2946-5-1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/f4f7d5d049f0/1758-2946-5-1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/378868d83ba4/1758-2946-5-1-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/4c377fe3ab65/1758-2946-5-1-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/72245a8b9962/1758-2946-5-1-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/32a8a09a0fe1/1758-2946-5-1-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/3bc821b73e96/1758-2946-5-1-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/26df109440fc/1758-2946-5-1-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/03ffb286f571/1758-2946-5-1-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a13/3547714/739533ddadeb/1758-2946-5-1-13.jpg

相似文献

1
PubChem3D: conformer ensemble accuracy.PubChem3D:构象系综精度。
J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.
2
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。
J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.
3
PubChem3D: Biologically relevant 3-D similarity.PubChem3D:具有生物学相关性的三维相似性。
J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.
4
PubChem3D: Conformer generation.PubChem3D:构象生成。
J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.
5
PubChem3D: Similar conformers.PubChem3D:相似构象。
J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.
6
PubChem3D: a new resource for scientists.PubChem3D:科学家的新资源。
J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.
7
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.PubChem3D:使用分子形状四极矩进行形状兼容性过滤。
J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.
8
Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble.动态聚类阈值可在保持生物相关集合的同时减少构象集合的大小。
J Comput Aided Mol Des. 2010 Aug;24(8):675-86. doi: 10.1007/s10822-010-9365-1. Epub 2010 May 25.
9
Conformational sampling for large-scale virtual screening: accuracy versus ensemble size.大规模虚拟筛选的构象采样:准确性与系综大小
J Chem Inf Model. 2009 Oct;49(10):2303-11. doi: 10.1021/ci9002415.
10
The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix.用于通过均方根偏差(RMSD)矩阵对代表性构象异构体集合进行重采样的自动聚类算法比较。
J Cheminform. 2017 Mar 23;9(1):21. doi: 10.1186/s13321-017-0208-0.

引用本文的文献

1
Therapeutic Targets and Immune Mechanisms of Yinghuang Decoction in Sepsis: Integrating Network Pharmacology, Molecular Docking, and Pharmacokinetic Approaches.银翘汤治疗脓毒症的治疗靶点及免疫机制:整合网络药理学、分子对接和药代动力学方法
Int J Gen Med. 2025 Aug 27;18:4785-4801. doi: 10.2147/IJGM.S532274. eCollection 2025.
2
Theoretical Study on the Metabolic Mechanism of Heptachlor in Human Cytochrome P450 Enzymes.七氯在人细胞色素P450酶中代谢机制的理论研究
Int J Mol Sci. 2025 Feb 26;26(5):2021. doi: 10.3390/ijms26052021.
3
Borrelial phosphomannose isomerase as a cell surface localized protein that retains enzymatic activity and promotes host-pathogen interaction.

本文引用的文献

1
MMFF VI. MMFF94s option for energy minimization studies.MMFF VI。用于能量最小化研究的MMFF94s选项。
J Comput Chem. 1999 May;20(7):720-729. doi: 10.1002/(SICI)1096-987X(199905)20:7<720::AID-JCC7>3.0.CO;2-X.
2
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。
J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.
3
Conformer generation with OMEGA: learning from the data set and the analysis of failures.使用 OMEGA 生成构象:从数据集和失败分析中学习。
疏螺旋体磷酸甘露糖异构酶作为一种定位于细胞表面的蛋白质,保留酶活性并促进宿主-病原体相互作用。
mBio. 2025 Mar 12;16(3):e0360924. doi: 10.1128/mbio.03609-24. Epub 2025 Feb 11.
4
Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes.基于机器学习的高通量鉴定针对所有 4 种登革热病毒血清型的高效且稳定中和抗体的方法。
Sci Rep. 2024 Jul 26;14(1):17165. doi: 10.1038/s41598-024-67487-8.
5
New advances in clinical application of neostigmine: no longer focusing solely on increasing skeletal muscle strength.新斯的明临床应用的新进展:不再仅仅专注于增强骨骼肌力量。
Front Pharmacol. 2023 Aug 4;14:1227496. doi: 10.3389/fphar.2023.1227496. eCollection 2023.
6
QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy.QUAM-AFM:一个通过原子力显微镜进行分子鉴定的免费数据库。
J Chem Inf Model. 2022 Mar 14;62(5):1214-1223. doi: 10.1021/acs.jcim.1c01323. Epub 2022 Mar 2.
7
Exploring Chemical Information in PubChem.探索 PubChem 中的化学信息。
Curr Protoc. 2021 Aug;1(8):e217. doi: 10.1002/cpz1.217.
8
Fast, efficient fragment-based coordinate generation for Open Babel.为开源化学软件包(Open Babel)快速、高效地生成基于片段的坐标。
J Cheminform. 2019 Aug 1;11(1):49. doi: 10.1186/s13321-019-0372-5.
9
Finding Potential Multitarget Ligands Using PubChem.利用化学物质数据库(PubChem)寻找潜在的多靶点配体。
Methods Mol Biol. 2018;1825:63-91. doi: 10.1007/978-1-4939-8639-2_2.
10
PubChem chemical structure standardization.PubChem化学结构标准化
J Cheminform. 2018 Aug 10;10(1):36. doi: 10.1186/s13321-018-0293-8.
J Chem Inf Model. 2012 Nov 26;52(11):2919-36. doi: 10.1021/ci300314k. Epub 2012 Nov 12.
4
Recognizing pitfalls in virtual screening: a critical review.认识虚拟筛选中的陷阱:批判性评价。
J Chem Inf Model. 2012 Apr 23;52(4):867-81. doi: 10.1021/ci200528d. Epub 2012 Apr 6.
5
MMDB: 3D structures and macromolecular interactions.MMDB:三维结构和大分子相互作用。
Nucleic Acids Res. 2012 Jan;40(Database issue):D461-4. doi: 10.1093/nar/gkr1162. Epub 2011 Dec 1.
6
PubChem3D: a new resource for scientists.PubChem3D:科学家的新资源。
J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.
7
PubChem3D: Biologically relevant 3-D similarity.PubChem3D:具有生物学相关性的三维相似性。
J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.
8
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles.PubChem3D:使用分子形状四极矩进行形状兼容性过滤。
J Cheminform. 2011 Jul 20;3:25. doi: 10.1186/1758-2946-3-25.
9
PubChem3D: Similar conformers.PubChem3D:相似构象。
J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.
10
PubChem3D: Diversity of shape.PubChem3D:形状的多样性。
J Cheminform. 2011 Mar 21;3:9. doi: 10.1186/1758-2946-3-9.