• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

警示用户:评估主要化学数据库之间的差异。

Caveat Usor: Assessing Differences between Major Chemistry Databases.

机构信息

IUPHAR/BPS Guide to PHARMACOLOGY, Deanery of Biomedical Sciences, University of Edinburgh, Edinburgh, EH8 9XD, UK.

出版信息

ChemMedChem. 2018 Mar 20;13(6):470-481. doi: 10.1002/cmdc.201700724. Epub 2018 Feb 23.

DOI:10.1002/cmdc.201700724
PMID:29451740
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5900829/
Abstract

The three databases of PubChem, ChemSpider, and UniChem capture the majority of open chemical structure records with February 2018 totals of 95, 63, and 154 million, respectively. Collectively, they constitute a massively enabling resource for cheminformatics, chemical biology, and drug discovery. As meta-portals, they subsume and link out to the major proportion of public bioactivity data extracted from the literature and screening center assay results. Therefore, they not only present three different entry points, but the many subsumed independent resources present a fourth entry point in the form of standalone databases. Because this creates a complex picture it is important for users to have at least some appreciation of differential content to enable utility judgments for the tasks at hand. This turns out to be challenging. By comparing the three resources in detail, this review assesses their differences, some of which are not obvious. This includes the fact that coverage is significantly different between the 587, 282, and 38 contributing sources, respectively. This not only presents the "who-has-what" question, but also the reason "why" any particular inclusion is considered valuable is rarely made explicit. Also confusing is that sources nominally in common (i.e., having the same submitter name) can have significantly different structure counts, not only in each of the three but also from their standalone instantiations. Assessing a series of examples indicates that differences in loading dates and structural standardization are the main causes of this inter-portal discordance.

摘要

PubChem、ChemSpider 和 UniChem 这三个数据库分别收录了 9500 万、630 万和 1.54 亿个开放的化学结构记录,截至 2018 年 2 月,这三个数据库占据了大多数的化学结构记录。它们共同构成了化学信息学、化学生物学和药物发现的一个大规模启用资源。作为元门户,它们包含并链接了从文献和筛选中心测定结果中提取的大部分公共生物活性数据。因此,它们不仅提供了三个不同的切入点,而且许多包含的独立资源以独立数据库的形式提供了第四个切入点。由于这造成了一个复杂的情况,用户至少需要对差异内容有一定的了解,以便对当前任务的实用性进行判断。事实证明,这具有一定的挑战性。通过详细比较这三个资源,本综述评估了它们之间的差异,其中一些差异并不明显。这包括以下事实:这三个资源的收录范围分别存在显著差异,分别为 587、282 和 38 个贡献源。这不仅提出了“谁有什么”的问题,而且为什么任何特定的收录都被认为是有价值的原因也很少被明确说明。同样令人困惑的是,名义上相同的来源(即具有相同的提交者名称)的结构数量可能存在显著差异,不仅在这三个数据库中如此,而且在它们的独立实例中也是如此。评估一系列示例表明,加载日期和结构标准化方面的差异是导致这种门户之间不一致的主要原因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/ae82beefdd0d/CMDC-13-470-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/368e92a02f90/CMDC-13-470-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/aff052491854/CMDC-13-470-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/943fb7e4da75/CMDC-13-470-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/48c5c44ec77b/CMDC-13-470-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/ef0e933b1f26/CMDC-13-470-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/778e560ddd8b/CMDC-13-470-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/97f5853320ca/CMDC-13-470-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/84370bdf3dfd/CMDC-13-470-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/ae82beefdd0d/CMDC-13-470-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/368e92a02f90/CMDC-13-470-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/aff052491854/CMDC-13-470-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/943fb7e4da75/CMDC-13-470-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/48c5c44ec77b/CMDC-13-470-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/ef0e933b1f26/CMDC-13-470-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/778e560ddd8b/CMDC-13-470-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/97f5853320ca/CMDC-13-470-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/84370bdf3dfd/CMDC-13-470-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1a8/5900829/ae82beefdd0d/CMDC-13-470-g010.jpg

相似文献

1
Caveat Usor: Assessing Differences between Major Chemistry Databases.警示用户:评估主要化学数据库之间的差异。
ChemMedChem. 2018 Mar 20;13(6):470-481. doi: 10.1002/cmdc.201700724. Epub 2018 Feb 23.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Bridging glycoinformatics and cheminformatics: integration efforts between GlyCosmos and PubChem.桥接糖生物信息学和化学信息学:GlyCosmos 和 PubChem 之间的整合工作。
Glycobiology. 2023 Jun 21;33(6):454-463. doi: 10.1093/glycob/cwad028.
4
Parallel worlds of public and commercial bioactive chemistry data.公共和商业生物活性化学数据的平行世界。
J Med Chem. 2015 Mar 12;58(5):2068-76. doi: 10.1021/jm5011308. Epub 2014 Dec 4.
5
Mapping between databases of compounds and protein targets.化合物数据库与蛋白质靶点之间的映射。
Methods Mol Biol. 2012;910:145-64. doi: 10.1007/978-1-61779-965-5_8.
6
Exploring Chemical Information in PubChem.探索 PubChem 中的化学信息。
Curr Protoc. 2021 Aug;1(8):e217. doi: 10.1002/cpz1.217.
7
Enalos Suite: New Cheminformatics Platform for Drug Discovery and Computational Toxicology.Enalos套件:用于药物发现和计算毒理学的新化学信息学平台。
Methods Mol Biol. 2018;1800:287-311. doi: 10.1007/978-1-4939-7899-1_14.
8
PubChem Protein, Gene, Pathway, and Taxonomy Data Collections: Bridging Biology and Chemistry through Target-Centric Views of PubChem Data.PubChem蛋白质、基因、通路和分类学数据集:通过以靶点为中心的PubChem数据视图连接生物学与化学。
J Mol Biol. 2022 Jun 15;434(11):167514. doi: 10.1016/j.jmb.2022.167514. Epub 2022 Feb 25.
9
PubChem applications in drug discovery: a bibliometric analysis.《PubChem在药物发现中的应用:文献计量分析》
Drug Discov Today. 2014 Nov;19(11):1751-1756. doi: 10.1016/j.drudis.2014.08.008. Epub 2014 Aug 27.
10
Making every SAR point count: the development of Chemistry Connect for the large-scale integration of structure and bioactivity data.充分利用每一个 SAR 点:为大规模整合结构和生物活性数据而开发的 Chemistry Connect。
Drug Discov Today. 2011 Dec;16(23-24):1019-30. doi: 10.1016/j.drudis.2011.10.005. Epub 2011 Oct 14.

引用本文的文献

1
Molecular Docking, Molecular Dynamics Simulations, and Free Energy Calculation Insights into the Binding Mechanism between VS-4718 and Focal Adhesion Kinase.分子对接、分子动力学模拟及自由能计算:深入了解VS-4718与粘着斑激酶之间的结合机制
ACS Omega. 2022 Aug 31;7(36):32442-32456. doi: 10.1021/acsomega.2c03951. eCollection 2022 Sep 13.
2
Will the chemical probes please stand up?化学探针请站起来好吗?
RSC Med Chem. 2021 Jul 16;12(8):1428-1441. doi: 10.1039/d1md00138h. eCollection 2021 Aug 18.
3
Selective Inhibition of HDAC1 by Macrocyclic Polypeptide for the Treatment of Glioblastoma: A Binding Mechanistic Analysis Based on Molecular Dynamics.

本文引用的文献

1
The IUPHAR/BPS Guide to PHARMACOLOGY in 2018: updates and expansion to encompass the new guide to IMMUNOPHARMACOLOGY.2018 年 IUPHAR/BPS 药理学指南:更新和扩展,以包含新的免疫药理学指南。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1091-D1106. doi: 10.1093/nar/gkx1121.
2
Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics.治疗靶点数据库更新 2018:丰富资源,促进靶向治疗的基础到临床研究。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1121-D1127. doi: 10.1093/nar/gkx1076.
3
SuperDRUG2: a one stop resource for approved/marketed drugs.
大环多肽对HDAC1的选择性抑制用于治疗胶质母细胞瘤:基于分子动力学的结合机制分析
Front Mol Biosci. 2020 Mar 11;7:41. doi: 10.3389/fmolb.2020.00041. eCollection 2020.
4
Assessing the public landscape of clinical-stage pharmaceuticals through freely available online databases.通过免费的在线数据库评估临床阶段药物的公共领域情况。
Drug Discov Today. 2019 Apr;24(4):1010-1016. doi: 10.1016/j.drudis.2019.01.010. Epub 2019 Jan 25.
5
Challenges of Connecting Chemistry to Pharmacology: Perspectives from Curating the IUPHAR/BPS Guide to PHARMACOLOGY.连接化学与药理学的挑战:来自编纂《IUPHAR/BPS药理学指南》的观点
ACS Omega. 2018 Jul 31;3(7):8408-8420. doi: 10.1021/acsomega.8b00884.
SuperDRUG2:已批准/上市药物的一站式资源。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1137-D1143. doi: 10.1093/nar/gkx1088.
4
HMDB 4.0: the human metabolome database for 2018.HMDB 4.0:2018 年人类代谢组数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D608-D617. doi: 10.1093/nar/gkx1089.
5
WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research.WikiPathways:一个将代谢组学与其他组学研究联系起来的多方面的途径数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D661-D667. doi: 10.1093/nar/gkx1064.
6
Automated assembly of species metabolomes through data submission into a public repository.通过将数据提交到公共存储库实现物种代谢组的自动化组装。
Gigascience. 2017 Aug 1;6(8):1-4. doi: 10.1093/gigascience/gix062.
7
PubChem BioAssay: A Decade's Development toward Open High-Throughput Screening Data Sharing.PubChem 生物测定:迈向开放高通量筛选数据共享的十年发展。
SLAS Discov. 2017 Jul;22(6):655-666. doi: 10.1177/2472555216685069. Epub 2017 Jan 13.
8
Recon 2.2: from reconstruction to model of human metabolism.Recon 2.2:从重建到人类代谢模型
Metabolomics. 2016;12:109. doi: 10.1007/s11306-016-1051-4. Epub 2016 Jun 7.
9
PubChem Substance and Compound databases.美国国立医学图书馆化学物质数据库和化合物数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13. doi: 10.1093/nar/gkv951. Epub 2015 Sep 22.
10
Expanding opportunities for mining bioactive chemistry from patents.拓展从专利中挖掘生物活性化学物质的机会。
Drug Discov Today Technol. 2015 Jul;14:3-9. doi: 10.1016/j.ddtec.2014.12.001. Epub 2015 Feb 11.