• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PubChem3D:构象生成。

PubChem3D: Conformer generation.

机构信息

National Center for Biotechnology Information National Library of Medicine National Institutes of Health Department of Health and Human Services 8600 Rockville Pike, Bethesda, MD 20894, USA.

出版信息

J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.

DOI:10.1186/1758-2946-3-4
PMID:21272340
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3042967/
Abstract

BACKGROUND

PubChem, an open archive for the biological activities of small molecules, provides search and analysis tools to assist users in locating desired information. Many of these tools focus on the notion of chemical structure similarity at some level. PubChem3D enables similarity of chemical structure 3-D conformers to augment the existing similarity of 2-D chemical structure graphs. It is also desirable to relate theoretical 3-D descriptions of chemical structures to experimental biological activity. As such, it is important to be assured that the theoretical conformer models can reproduce experimentally determined bioactive conformations. In the present study, we investigate the effects of three primary conformer generation parameters (the fragment sampling rate, the energy window size, and force field variant) upon the accuracy of theoretical conformer models, and determined optimal settings for PubChem3D conformer model generation and conformer sampling.

RESULTS

Using the software package OMEGA from OpenEye Scientific Software, Inc., theoretical 3-D conformer models were generated for 25,972 small-molecule ligands, whose 3-D structures were experimentally determined. Different values for primary conformer generation parameters were systematically tested to find optimal settings. Employing a greater fragment sampling rate than the default did not improve the accuracy of the theoretical conformer model ensembles. An ever increasing energy window did increase the overall average accuracy, with rapid convergence observed at 10 kcal/mol and 15 kcal/mol for model building and torsion search, respectively; however, subsequent study showed that an energy threshold of 25 kcal/mol for torsion search resulted in slightly improved results for larger and more flexible structures. Exclusion of coulomb terms from the 94s variant of the Merck molecular force field (MMFF94s) in the torsion search stage gave more accurate conformer models at lower energy windows. Overall average accuracy of reproduction of bioactive conformations was remarkably linear with respect to both non-hydrogen atom count ("size") and effective rotor count ("flexibility"). Using these as independent variables, a regression equation was developed to predict the RMSD accuracy of a theoretical ensemble to reproduce bioactive conformations. The equation was modified to give a minimum RMSD conformer sampling value to help ensure that 90% of the sampled theoretical models should contain at least one conformer within the RMSD sampling value to a "bioactive" conformation.

CONCLUSION

Optimal parameters for conformer generation using OMEGA were explored and determined. An equation was developed that provides an RMSD sampling value to use that is based on the relative accuracy to reproduce bioactive conformations. The optimal conformer generation parameters and RMSD sampling values determined are used by the PubChem3D project to generate theoretical conformer models.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8ddbfb68ab8a/1758-2946-3-4-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/d9a1d2c11348/1758-2946-3-4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/9b55c62088aa/1758-2946-3-4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/e33d4beb2da0/1758-2946-3-4-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/5e18330d35e9/1758-2946-3-4-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8775d190b80c/1758-2946-3-4-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8dd9c79f47bf/1758-2946-3-4-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/864ebee481dc/1758-2946-3-4-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/eed33108472f/1758-2946-3-4-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8ddbfb68ab8a/1758-2946-3-4-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/d9a1d2c11348/1758-2946-3-4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/9b55c62088aa/1758-2946-3-4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/e33d4beb2da0/1758-2946-3-4-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/5e18330d35e9/1758-2946-3-4-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8775d190b80c/1758-2946-3-4-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8dd9c79f47bf/1758-2946-3-4-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/864ebee481dc/1758-2946-3-4-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/eed33108472f/1758-2946-3-4-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd2f/3042967/8ddbfb68ab8a/1758-2946-3-4-9.jpg
摘要

背景

PubChem 是一个小分子生物活性的开放档案库,提供搜索和分析工具,帮助用户定位所需信息。这些工具中的许多都侧重于某种程度上的化学结构相似性概念。PubChem3D 可以对化学结构的 3D 构象进行相似性比较,以增强现有的 2D 化学结构图的相似性。将化学结构的理论 3D 描述与实验生物活性联系起来也是很有必要的。因此,必须确保理论构象模型能够再现实验确定的生物活性构象。在本研究中,我们研究了三个主要构象生成参数(片段采样率、能量窗口大小和力场变体)对理论构象模型准确性的影响,并确定了 PubChem3D 构象模型生成和构象采样的最佳设置。

结果

使用 OpenEye Scientific Software, Inc. 的软件包 OMEGA,为 25972 种小分子配体生成了理论 3D 构象模型,这些配体的 3D 结构是通过实验确定的。系统测试了不同的主要构象生成参数值,以找到最佳设置。采用比默认值更大的片段采样率并不能提高理论构象模型集的准确性。不断增加的能量窗口确实提高了整体平均准确性,分别在模型构建和扭转搜索时以 10 kcal/mol 和 15 kcal/mol 达到快速收敛;然而,随后的研究表明,在扭转搜索时将能量阈值设置为 25 kcal/mol 对于更大和更灵活的结构会产生稍微改进的结果。在扭转搜索阶段排除 Merck 分子力场(MMFF94s)94s 变体中的库仑项,可以在较低的能量窗口下获得更准确的构象模型。生物活性构象的再现整体平均准确性与非氢原子数(“大小”)和有效转子数(“灵活性”)呈显著线性关系。使用这些作为独立变量,开发了一个回归方程来预测理论集合再现生物活性构象的 RMSD 准确性。该方程经过修改后,可以给出最小 RMSD 构象采样值,以帮助确保 90%的采样理论模型中至少有一个构象在 RMSD 采样值内到“生物活性”构象。

结论

探索并确定了使用 OMEGA 进行构象生成的最佳参数。开发了一个方程,该方程提供了一个 RMSD 采样值,该值是基于再现生物活性构象的相对准确性。确定的最佳构象生成参数和 RMSD 采样值被 PubChem3D 项目用于生成理论构象模型。

相似文献

1
PubChem3D: Conformer generation.PubChem3D:构象生成。
J Cheminform. 2011 Jan 27;3(1):4. doi: 10.1186/1758-2946-3-4.
2
PubChem3D: conformer ensemble accuracy.PubChem3D:构象系综精度。
J Cheminform. 2013 Jan 7;5(1):1. doi: 10.1186/1758-2946-5-1.
3
PubChem3D: a new resource for scientists.PubChem3D:科学家的新资源。
J Cheminform. 2011 Sep 20;3(1):32. doi: 10.1186/1758-2946-3-32.
4
PubChem3D: Similar conformers.PubChem3D:相似构象。
J Cheminform. 2011 May 9;3:13. doi: 10.1186/1758-2946-3-13.
5
PubChem3D: Biologically relevant 3-D similarity.PubChem3D:具有生物学相关性的三维相似性。
J Cheminform. 2011 Jul 22;3(1):26. doi: 10.1186/1758-2946-3-26.
6
The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix.用于通过均方根偏差(RMSD)矩阵对代表性构象异构体集合进行重采样的自动聚类算法比较。
J Cheminform. 2017 Mar 23;9(1):21. doi: 10.1186/s13321-017-0208-0.
7
Conformational sampling for large-scale virtual screening: accuracy versus ensemble size.大规模虚拟筛选的构象采样:准确性与系综大小
J Chem Inf Model. 2009 Oct;49(10):2303-11. doi: 10.1021/ci9002415.
8
Assessment of conformational ensemble sizes necessary for specific resolutions of coverage of conformational space.评估构象空间覆盖范围的特定分辨率所需的构象集合大小。
J Chem Inf Model. 2007 Jul-Aug;47(4):1428-37. doi: 10.1021/ci7000956. Epub 2007 Jun 15.
9
Dynamic clustering threshold reduces conformer ensemble size while maintaining a biologically relevant ensemble.动态聚类阈值可在保持生物相关集合的同时减少构象集合的大小。
J Comput Aided Mol Des. 2010 Aug;24(8):675-86. doi: 10.1007/s10822-010-9365-1. Epub 2010 May 25.
10
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis.化合物中多个构象对 3-D 相似性搜索和生物测定数据分析的影响。
J Cheminform. 2012 Nov 7;4(1):28. doi: 10.1186/1758-2946-4-28.

引用本文的文献

1
QMugs, quantum mechanical properties of drug-like molecules.QMugs,类药物分子的量子力学性质。
Sci Data. 2022 Jun 7;9(1):273. doi: 10.1038/s41597-022-01390-7.
2
GEOM, energy-annotated molecular conformations for property prediction and molecular generation.GEOM,带能量注释的分子构象,用于性质预测和分子生成。
Sci Data. 2022 Apr 21;9(1):185. doi: 10.1038/s41597-022-01288-4.
3
QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy.QUAM-AFM:一个通过原子力显微镜进行分子鉴定的免费数据库。

本文引用的文献

1
MMFF VII. Characterization of MMFF94, MMFF94s, and other widely available force fields for conformational energies and for intermolecular-interaction energies and geometries.MMFF VII. MMFF94、MMFF94s及其他广泛使用的力场在构象能、分子间相互作用能和几何结构方面的表征。
J Comput Chem. 1999 May;20(7):730-748. doi: 10.1002/(SICI)1096-987X(199905)20:7<730::AID-JCC8>3.0.CO;2-T.
2
MMFF VI. MMFF94s option for energy minimization studies.MMFF VI。用于能量最小化研究的MMFF94s选项。
J Comput Chem. 1999 May;20(7):720-729. doi: 10.1002/(SICI)1096-987X(199905)20:7<720::AID-JCC7>3.0.CO;2-X.
3
Conformer generation with OMEGA: algorithm and validation using high quality structures from the Protein Databank and Cambridge Structural Database.
J Chem Inf Model. 2022 Mar 14;62(5):1214-1223. doi: 10.1021/acs.jcim.1c01323. Epub 2022 Mar 2.
4
Exploring Chemical Information in PubChem.探索 PubChem 中的化学信息。
Curr Protoc. 2021 Aug;1(8):e217. doi: 10.1002/cpz1.217.
5
Combined In Vitro Studies and in Silico Target Fishing for the Evaluation of the Biological Activities of and .联合体外研究和计算机靶点筛选评估 和 的生物活性。
Molecules. 2018 Dec 13;23(12):3303. doi: 10.3390/molecules23123303.
6
Finding Potential Multitarget Ligands Using PubChem.利用化学物质数据库(PubChem)寻找潜在的多靶点配体。
Methods Mol Biol. 2018;1825:63-91. doi: 10.1007/978-1-4939-8639-2_2.
7
PubChem chemical structure standardization.PubChem化学结构标准化
J Cheminform. 2018 Aug 10;10(1):36. doi: 10.1186/s13321-018-0293-8.
8
An update on PUG-REST: RESTful interface for programmatic access to PubChem.PUG-REST 更新:用于对 PubChem 进行编程访问的 RESTful 接口。
Nucleic Acids Res. 2018 Jul 2;46(W1):W563-W570. doi: 10.1093/nar/gky294.
9
The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix.用于通过均方根偏差(RMSD)矩阵对代表性构象异构体集合进行重采样的自动聚类算法比较。
J Cheminform. 2017 Mar 23;9(1):21. doi: 10.1186/s13321-017-0208-0.
10
Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets.相似化合物与相似构象异构体:PubChem二维和三维相邻集之间的互补性。
J Cheminform. 2016 Nov 4;8:62. doi: 10.1186/s13321-016-0163-1. eCollection 2016.
使用 OMEGA 生成构象:使用来自蛋白质数据库和剑桥结构数据库的高质量结构进行算法验证。
J Chem Inf Model. 2010 Apr 26;50(4):572-84. doi: 10.1021/ci100031x.
4
An overview of the PubChem BioAssay resource.PubChem 生物测定资源概述。
Nucleic Acids Res. 2010 Jan;38(Database issue):D255-66. doi: 10.1093/nar/gkp965. Epub 2009 Nov 19.
5
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2010 Jan;38(Database issue):D5-16. doi: 10.1093/nar/gkp967. Epub 2009 Nov 12.
6
PubChem: a public information system for analyzing bioactivities of small molecules.PubChem:一个用于分析小分子生物活性的公共信息系统。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W623-33. doi: 10.1093/nar/gkp456. Epub 2009 Jun 4.
7
Assessment of conformational ensemble sizes necessary for specific resolutions of coverage of conformational space.评估构象空间覆盖范围的特定分辨率所需的构象集合大小。
J Chem Inf Model. 2007 Jul-Aug;47(4):1428-37. doi: 10.1021/ci7000956. Epub 2007 Jun 15.
8
MMDB: annotating protein sequences with Entrez's 3D-structure database.MMDB:使用Entrez的三维结构数据库注释蛋白质序列。
Nucleic Acids Res. 2007 Jan;35(Database issue):D298-300. doi: 10.1093/nar/gkl952. Epub 2006 Nov 29.
9
Impact of conformational flexibility on three-dimensional similarity searching using correlation vectors.构象灵活性对使用相关向量进行三维相似性搜索的影响。
J Chem Inf Model. 2006 Nov-Dec;46(6):2324-32. doi: 10.1021/ci050075s.
10
Comparative performance assessment of the conformational model generators omega and catalyst: a large-scale survey on the retrieval of protein-bound ligand conformations.构象模型生成器Omega和Catalyst的比较性能评估:关于蛋白质结合配体构象检索的大规模调查
J Chem Inf Model. 2006 Jul-Aug;46(4):1848-61. doi: 10.1021/ci060084g.