• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过分子语言处理生成的 6700 万种类似天然产物的化合物数据库。

67 million natural product-like compound database generated via molecular language processing.

机构信息

Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore.

Hwa Chong Institution, 661 Bukit Timah Road, Singapore, 269734, Republic of Singapore.

出版信息

Sci Data. 2023 May 19;10(1):296. doi: 10.1038/s41597-023-02207-x.

DOI:10.1038/s41597-023-02207-x
PMID:37208372
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10199072/
Abstract

Natural products are a rich resource of bioactive compounds for valuable applications across multiple fields such as food, agriculture, and medicine. For natural product discovery, high throughput in silico screening offers a cost-effective alternative to traditional resource-heavy assay-guided exploration of structurally novel chemical space. In this data descriptor, we report a characterized database of 67,064,204 natural product-like molecules generated using a recurrent neural network trained on known natural products, demonstrating a significant 165-fold expansion in library size over the approximately 400,000 known natural products. This study highlights the potential of using deep generative models to explore novel natural product chemical space for high throughput in silico discovery.

摘要

天然产物是生物活性化合物的丰富资源,可应用于食品、农业和医学等多个领域。对于天然产物的发现,高通量的计算筛选为传统的资源密集型、基于结构的新型化学空间的检测指导探索提供了一种具有成本效益的替代方法。在本数据描述中,我们报告了一个经过特征描述的数据库,其中包含了 67064204 个类似天然产物的分子,这些分子是使用基于已知天然产物的递归神经网络训练得到的,与大约 40 万个已知天然产物相比,库的大小显著扩大了 165 倍。这项研究强调了使用深度生成模型来探索新型天然产物化学空间以进行高通量计算发现的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/d4631e794a40/41597_2023_2207_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/75b9a539eaae/41597_2023_2207_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/9a4bd3bd1c0d/41597_2023_2207_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/d4631e794a40/41597_2023_2207_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/75b9a539eaae/41597_2023_2207_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/9a4bd3bd1c0d/41597_2023_2207_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1119/10199072/d4631e794a40/41597_2023_2207_Fig3_HTML.jpg

相似文献

1
67 million natural product-like compound database generated via molecular language processing.通过分子语言处理生成的 6700 万种类似天然产物的化合物数据库。
Sci Data. 2023 May 19;10(1):296. doi: 10.1038/s41597-023-02207-x.
2
NCI Program for Natural Product Discovery: A Publicly-Accessible Library of Natural Product Fractions for High-Throughput Screening.NCI 天然产物发现计划:用于高通量筛选的公开天然产物馏分文库。
ACS Chem Biol. 2018 Sep 21;13(9):2484-2497. doi: 10.1021/acschembio.8b00389. Epub 2018 Jun 13.
3
Data Resources for the Computer-Guided Discovery of Bioactive Natural Products.用于计算机辅助发现生物活性天然产物的数据资源。
J Chem Inf Model. 2017 Sep 25;57(9):2099-2111. doi: 10.1021/acs.jcim.7b00341. Epub 2017 Aug 30.
4
Progress and Impact of Latin American Natural Product Databases.拉丁美洲天然产物数据库的进展与影响。
Biomolecules. 2022 Aug 30;12(9):1202. doi: 10.3390/biom12091202.
5
Profiling Chemobiological Connection between Natural Product and Target Space Based on Systematic Analysis.基于系统分析的天然产物与靶标空间化学生物学关系剖析。
Int J Mol Sci. 2023 Jul 10;24(14):11265. doi: 10.3390/ijms241411265.
6
Computer-Assisted Drug Virtual Screening Based on the Natural Product Databases.基于天然产物数据库的计算机辅助药物虚拟筛选
Curr Pharm Biotechnol. 2019;20(4):293-301. doi: 10.2174/1389201020666190328115411.
7
Novel chemical space exploration via natural products.通过天然产物进行新型化学空间探索。
J Med Chem. 2009 Apr 9;52(7):1953-62. doi: 10.1021/jm801514w.
8
High-throughput screening for bioactive components from traditional Chinese medicine.中药生物活性成分的高通量筛选
Comb Chem High Throughput Screen. 2010 Dec;13(10):837-48. doi: 10.2174/138620710793360257.
9
Cheminformatics Explorations of Natural Products.天然产物的化学信息学探索
Prog Chem Org Nat Prod. 2019;110:1-35. doi: 10.1007/978-3-030-14632-0_1.
10
Global profiling and rapid matching of natural products using diagnostic product ion network and in silico analogue database: Gastrodia elata as a case study.利用诊断性生产网络和计算机模拟类似物数据库对天然产物进行全球分析和快速匹配:以天麻为例
J Chromatogr A. 2016 Jul 22;1456:187-95. doi: 10.1016/j.chroma.2016.06.013. Epub 2016 Jun 4.

引用本文的文献

1
The Extraordinary Benefit of Nature's Chemistry to Health, Society, and the Economy.大自然的化学对健康、社会和经济的非凡益处。
J Nat Prod. 2025 Jun 27;88(6):1541-1548. doi: 10.1021/acs.jnatprod.5c00554. Epub 2025 Jun 12.
2
Artificial Intelligence in Natural Product Drug Discovery: Current Applications and Future Perspectives.天然产物药物发现中的人工智能:当前应用与未来展望。
J Med Chem. 2025 Feb 27;68(4):3948-3969. doi: 10.1021/acs.jmedchem.4c01257. Epub 2025 Feb 6.
3
The Natural Products Magnetic Resonance Database (NP-MRD) for 2025.

本文引用的文献

1
Auto3D: Automatic Generation of the Low-Energy 3D Structures with ANI Neural Network Potentials.Auto3D:使用ANI 神经网络势自动生成低能 3D 结构。
J Chem Inf Model. 2022 Nov 28;62(22):5373-5382. doi: 10.1021/acs.jcim.2c00817. Epub 2022 Sep 16.
2
Generative machine learning for de novo drug discovery: A systematic review.生成式机器学习在从头药物发现中的应用:系统评价。
Comput Biol Med. 2022 Jun;145:105403. doi: 10.1016/j.compbiomed.2022.105403. Epub 2022 Mar 13.
3
Sustainable manufacturing with synthetic biology.合成生物学助力可持续制造。
2025年天然产物磁共振数据库(NP-MRD)。
Nucleic Acids Res. 2025 Jan 6;53(D1):D700-D708. doi: 10.1093/nar/gkae1067.
4
Deep learning pipeline for accelerating virtual screening in drug discovery.深度学习在药物发现中的虚拟筛选加速流水线。
Sci Rep. 2024 Nov 16;14(1):28321. doi: 10.1038/s41598-024-79799-w.
5
Chemoinformatic Characterization of NAPROC-13: A Database for Natural Product C NMR Dereplication.NAPROC-13 的化学信息学特征:天然产物 C NMR 去重复数据库。
J Nat Prod. 2024 Sep 27;87(9):2216-2229. doi: 10.1021/acs.jnatprod.4c00530. Epub 2024 Sep 13.
6
Synthetic Biology of Natural Products Engineering: Recent Advances Across the Discover-Design-Build-Test-Learn Cycle.天然产物工程的合成生物学:贯穿发现-设计-构建-测试-学习周期的最新进展。
ACS Synth Biol. 2024 Sep 20;13(9):2684-2692. doi: 10.1021/acssynbio.4c00391. Epub 2024 Aug 20.
7
Synthesis of Metabolites and Metabolite-like Compounds Using Biocatalytic Systems.利用生物催化系统合成代谢物及类代谢物化合物
Metabolites. 2023 Oct 19;13(10):1097. doi: 10.3390/metabo13101097.
8
Prioritised identification of structural classes of natural products from higher plants in the expedition of antimalarial drug discovery.在抗疟药物发现探索中对高等植物天然产物结构类别的优先识别。
Nat Prod Bioprospect. 2023 Oct 12;13(1):37. doi: 10.1007/s13659-023-00402-2.
Nat Biotechnol. 2022 Mar;40(3):304-307. doi: 10.1038/s41587-022-01248-8.
4
Using deep neural networks to explore chemical space.使用深度神经网络探索化学空间。
Expert Opin Drug Discov. 2022 Mar;17(3):297-304. doi: 10.1080/17460441.2022.2019704. Epub 2021 Dec 29.
5
Molecular generation by Fast Assembly of (Deep)SMILES fragments.通过(深度)SMILES片段的快速组装进行分子生成。
J Cheminform. 2021 Nov 14;13(1):88. doi: 10.1186/s13321-021-00566-4.
6
NPClassifier: A Deep Neural Network-Based Structural Classification Tool for Natural Products.NPClassifier:一种基于深度神经网络的天然产物结构分类工具。
J Nat Prod. 2021 Nov 26;84(11):2795-2807. doi: 10.1021/acs.jnatprod.1c00399. Epub 2021 Oct 18.
7
Generative Adversarial Networks for De Novo Molecular Design.生成对抗网络用于从头分子设计。
Mol Inform. 2021 Oct;40(10):e2100045. doi: 10.1002/minf.202100045. Epub 2021 Jul 6.
8
Cheminformatic analysis of natural product-based drugs and chemical probes.天然产物类药物和化学探针的 cheminformatic 分析。
Nat Prod Rep. 2022 Jan 26;39(1):20-32. doi: 10.1039/d1np00039j.
9
NP Navigator: A New Look at the Natural Product Chemical Space.NP 导航器:探索天然产物化学空间的新视角。
Mol Inform. 2021 Sep;40(9):e2100068. doi: 10.1002/minf.202100068. Epub 2021 Jun 25.
10
Applications of Natural Products in Food.天然产物在食品中的应用。
Foods. 2021 Feb 2;10(2):300. doi: 10.3390/foods10020300.