• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ZINC-22─一个免费的、数十亿规模的有形化合物数据库,用于配体发现。

ZINC-22─A Free Multi-Billion-Scale Database of Tangible Compounds for Ligand Discovery.

机构信息

Department of Pharmaceutical Chemistry, University of California San Francisco, 1700 4th St, Mailcode 2550, San Francisco, California 94158-2330, United States.

Taras Shevchenko National University of Kyïv, 60 Volodymyrska Street, Kyïv 01601, Ukraine.

出版信息

J Chem Inf Model. 2023 Feb 27;63(4):1166-1176. doi: 10.1021/acs.jcim.2c01253. Epub 2023 Feb 15.

DOI:10.1021/acs.jcim.2c01253
PMID:36790087
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9976280/
Abstract

Purchasable chemical space has grown rapidly into the tens of billions of molecules, providing unprecedented opportunities for ligand discovery but straining the tools that might exploit these molecules at scale. We have therefore developed ZINC-22, a database of commercially accessible small molecules derived from multi-billion-scale make-on-demand libraries. The new database and tools enable analog searching in this vast new space via a facile GUI, CartBlanche, drawing on similarity methods that scale sublinearly in the number of molecules. The new library also uses data organization methods, enabling rapid lookup of molecules and their physical properties, including conformations, partial atomic charges,  Log  values, and solvation energies, all crucial for molecule docking, which had become slow with older database organizations in previous versions of ZINC. As the libraries have continued to grow, we have been interested in finding whether molecular diversity has suffered, for instance, because certain scaffolds have come to dominate via easy analoging. This has not occurred thus far, and chemical diversity continues to grow with database size, with a log increase in Bemis-Murcko scaffolds for every two-log unit increase in database size. Most new scaffolds come from compounds with the highest heavy atom count. Finally, we consider the implications for databases like ZINC as the libraries grow toward and beyond the trillion-molecule range. ZINC is freely available to everyone and may be accessed at cartblanche22.docking.org, via Globus, and in the Amazon AWS and Oracle OCI clouds.

摘要

可购买的化学空间已迅速扩展到数十亿个分子,为配体发现提供了前所未有的机会,但也给可能大规模利用这些分子的工具带来了压力。因此,我们开发了 ZINC-22,这是一个由数十亿规模按需制作的文库衍生的商业小分子数据库。这个新数据库和工具通过一个简单易用的 GUI,即 CartBlanche,利用在分子数量上呈次线性扩展的相似性方法,在这个巨大的新空间中实现了类似物搜索。新的文库还使用了数据组织方法,能够快速查找分子及其物理性质,包括构象、部分原子电荷、Log 值和溶剂化能,这些对于分子对接都至关重要,而在 ZINC 的早期版本中,由于数据库组织的原因,对接已经变得很慢。随着文库的不断增长,我们一直关注是否会出现分子多样性受到影响的情况,例如,由于某些骨架通过简单的模拟而占据主导地位。到目前为止,这种情况并未发生,而且随着数据库规模的增加,化学多样性仍在不断增加,Bemis-Murcko 骨架的对数增加与数据库规模的每两个对数单位增加成正比。大多数新骨架来自于具有最高重原子数的化合物。最后,我们考虑了随着文库规模增长到万亿分子范围甚至超过这个范围,像 ZINC 这样的数据库的影响。ZINC 对所有人都是免费开放的,可以在 cartblanche22.docking.org 上通过 Globus、亚马逊 AWS 和甲骨文 OCI 云访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/910ade3e42f9/ci2c01253_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/f54370553a7b/ci2c01253_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/4a72eb893aef/ci2c01253_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/d001c7673a91/ci2c01253_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/910ade3e42f9/ci2c01253_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/f54370553a7b/ci2c01253_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/4a72eb893aef/ci2c01253_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/d001c7673a91/ci2c01253_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544b/9976280/910ade3e42f9/ci2c01253_0005.jpg

相似文献

1
ZINC-22─A Free Multi-Billion-Scale Database of Tangible Compounds for Ligand Discovery.ZINC-22─一个免费的、数十亿规模的有形化合物数据库,用于配体发现。
J Chem Inf Model. 2023 Feb 27;63(4):1166-1176. doi: 10.1021/acs.jcim.2c01253. Epub 2023 Feb 15.
2
ZINC20-A Free Ultralarge-Scale Chemical Database for Ligand Discovery.ZINC20-A 免费超大尺度化学数据库,用于配体发现。
J Chem Inf Model. 2020 Dec 28;60(12):6065-6073. doi: 10.1021/acs.jcim.0c00675. Epub 2020 Oct 29.
3
ZINC--a free database of commercially available compounds for virtual screening.锌数据库——一个可用于虚拟筛选的商业可用化合物免费数据库。
J Chem Inf Model. 2005 Jan-Feb;45(1):177-82. doi: 10.1021/ci049714+.
4
Large-Scale Docking in the Cloud.大规模云端对接。
J Chem Inf Model. 2023 May 8;63(9):2735-2741. doi: 10.1021/acs.jcim.3c00031. Epub 2023 Apr 18.
5
ZINC 15--Ligand Discovery for Everyone.锌15——面向大众的配体发现平台。
J Chem Inf Model. 2015 Nov 23;55(11):2324-37. doi: 10.1021/acs.jcim.5b00559. Epub 2015 Nov 9.
6
Modeling the expansion of virtual screening libraries.建立虚拟筛选库的扩展模型。
Nat Chem Biol. 2023 Jun;19(6):712-718. doi: 10.1038/s41589-022-01234-w. Epub 2023 Jan 16.
7
Chemical space docking enables large-scale structure-based virtual screening to discover ROCK1 kinase inhibitors.化学空间对接使基于结构的大规模虚拟筛选能够发现 ROCK1 激酶抑制剂。
Nat Commun. 2022 Oct 28;13(1):6447. doi: 10.1038/s41467-022-33981-8.
8
Bioactivity-guided navigation of chemical space.基于生物活性的化学空间导航。
Acc Chem Res. 2010 Aug 17;43(8):1103-14. doi: 10.1021/ar100014h.
9
istar: a web platform for large-scale protein-ligand docking.istar:一个用于大规模蛋白质配体对接的网络平台。
PLoS One. 2014 Jan 24;9(1):e85678. doi: 10.1371/journal.pone.0085678. eCollection 2014.
10
Novel tumor necrosis factor-α (TNF-α) inhibitors from small molecule library screening for their therapeutic activity profiles against rheumatoid arthritis using target-driven approaches and binary QSAR models.基于靶向驱动方法和二元 QSAR 模型,从小分子文库中筛选新型肿瘤坏死因子-α(TNF-α)抑制剂,以评估其治疗类风湿关节炎的活性谱。
J Biomol Struct Dyn. 2019 Jun;37(9):2464-2476. doi: 10.1080/07391102.2018.1491423. Epub 2018 Dec 23.

引用本文的文献

1
Parallel synthesis of 5'-amino-5'-deoxy-adenosine derivatives for focused chemical space exploration and their application as methyltransferase inhibitors.用于聚焦化学空间探索的5'-氨基-5'-脱氧腺苷衍生物的平行合成及其作为甲基转移酶抑制剂的应用。
RSC Med Chem. 2025 Aug 12. doi: 10.1039/d5md00376h.
2
A structure-oriented kinetics dataset of enzyme-substrate interactions.一个面向结构的酶-底物相互作用动力学数据集。
Sci Data. 2025 Aug 26;12(1):1489. doi: 10.1038/s41597-025-05829-5.
3
A Benchmark Set of Bioactive Molecules for Diversity Analysis of Compound Libraries and Combinatorial Chemical Spaces.

本文引用的文献

1
Modeling the expansion of virtual screening libraries.建立虚拟筛选库的扩展模型。
Nat Chem Biol. 2023 Jun;19(6):712-718. doi: 10.1038/s41589-022-01234-w. Epub 2023 Jan 16.
2
Chemical Space Exploration with Active Learning and Alchemical Free Energies.主动学习和非热力学自由能在化学空间探索中的应用。
J Chem Theory Comput. 2022 Oct 11;18(10):6259-6270. doi: 10.1021/acs.jctc.2c00752. Epub 2022 Sep 23.
3
Magnet for the Needle in Haystack: "Crystal Structure First" Fragment Hits Unlock Active Chemical Matter Using Targeted Exploration of Vast Chemical Spaces.
用于化合物库和组合化学空间多样性分析的生物活性分子基准集。
J Chem Inf Model. 2025 Sep 8;65(17):9097-9124. doi: 10.1021/acs.jcim.5c00719. Epub 2025 Aug 20.
4
Drug screens using the nematode Caenorhabditis elegans.使用线虫秀丽隐杆线虫进行药物筛选。
Genetics. 2025 Sep 3;231(1). doi: 10.1093/genetics/iyaf141.
5
STELLA provides a drug design framework enabling extensive fragment-level chemical space exploration and balanced multi-parameter optimization.STELLA提供了一个药物设计框架,能够进行广泛的片段级化学空间探索和平衡的多参数优化。
Sci Rep. 2025 Aug 1;15(1):28135. doi: 10.1038/s41598-025-12685-1.
6
Protein-ligand data at scale to support machine learning.大规模蛋白质-配体数据以支持机器学习。
Nat Rev Chem. 2025 Jul 23. doi: 10.1038/s41570-025-00737-z.
7
Simpatico: accurate and ultra-fast virtual drug screening with atomic embeddings.辛帕提科:利用原子嵌入进行准确且超快速的虚拟药物筛选。
bioRxiv. 2025 Jun 8:2025.06.08.658499. doi: 10.1101/2025.06.08.658499.
8
Harnessing AlphaFold to reveal hERG channel conformational state secrets.利用AlphaFold揭示人乙醚-去极化激活钾离子通道(hERG)构象状态的秘密。
Elife. 2025 Jul 14;13:RP104901. doi: 10.7554/eLife.104901.
9
Uncovering key markers and therapeutic targets for renal fibrosis in diabetic kidney disease through bulk and single-cell RNA sequencing.通过批量和单细胞RNA测序揭示糖尿病肾病肾纤维化的关键标志物和治疗靶点。
J Transl Med. 2025 Jul 4;23(1):742. doi: 10.1186/s12967-025-06554-8.
10
An open-source family of large encoder-decoder foundation models for chemistry.一个用于化学领域的开源大型编码器-解码器基础模型系列。
Commun Chem. 2025 Jul 1;8(1):193. doi: 10.1038/s42004-025-01585-0.
大海捞针的磁体:“晶体结构优先”片段命中物通过对广阔化学空间的靶向探索解锁活性化学物质。
J Med Chem. 2022 Dec 8;65(23):15663-15678. doi: 10.1021/acs.jmedchem.2c00813. Epub 2022 Sep 7.
4
Coverage Score: A Model Agnostic Method to Efficiently Explore Chemical Space.覆盖率得分:一种高效探索化学空间的模型不可知方法。
J Chem Inf Model. 2022 Sep 26;62(18):4391-4402. doi: 10.1021/acs.jcim.2c00258. Epub 2022 Jul 22.
5
Quantitative and Qualitative Analysis of the Anti-Proliferative Potential of the Pyrazole Scaffold in the Design of Anticancer Agents.定量和定性分析吡唑骨架在设计抗癌药物中的抗增殖潜力。
Molecules. 2022 May 20;27(10):3300. doi: 10.3390/molecules27103300.
6
Artificial intelligence in virtual screening: Models versus experiments.人工智能在虚拟筛选中的应用:模型与实验的对比。
Drug Discov Today. 2022 Jul;27(7):1913-1923. doi: 10.1016/j.drudis.2022.05.013. Epub 2022 May 18.
7
Exploration of Ultralarge Compound Collections for Drug Discovery.探索用于药物发现的超大型化合物库。
J Chem Inf Model. 2022 May 9;62(9):2021-2034. doi: 10.1021/acs.jcim.2c00224. Epub 2022 Apr 14.
8
Comparison of Combinatorial Fragment Spaces and Its Application to Ultralarge Make-on-Demand Compound Catalogs.组合片段空间的比较及其在超大型按需合成化合物库中的应用
J Chem Inf Model. 2022 Feb 14;62(3):553-566. doi: 10.1021/acs.jcim.1c01378. Epub 2022 Jan 20.
9
Virtual Screening with Gnina 1.0.Gnina 1.0 虚拟筛选。
Molecules. 2021 Dec 4;26(23):7369. doi: 10.3390/molecules26237369.
10
Structures of the σ receptor enable docking for bioactive ligand discovery.σ受体的结构有助于对接以发现生物活性配体。
Nature. 2021 Dec;600(7890):759-764. doi: 10.1038/s41586-021-04175-x. Epub 2021 Dec 8.