• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

界定植物化学空间的界限:挑战与评估

Defining the limits of plant chemical space: challenges and estimations.

作者信息

Hart Chloe Engler, Gadiya Yojana, Kind Tobias, Krettler Christoph A, Gaetz Matthew, Misra Biswapriya B, Healey David, Allen August, Colluru Viswa, Domingo-Fernández Daniel

机构信息

Enveda, Boulder, CO 80301, United States.

出版信息

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf033.

DOI:10.1093/gigascience/giaf033
PMID:40184432
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11970369/
Abstract

The plant kingdom, encompassing nearly 400,000 known species, produces an immense diversity of metabolites, including primary compounds essential for survival and secondary metabolites specialized for ecological interactions. These metabolites constitute a vast and complex phytochemical space with significant potential applications in medicine, agriculture, and biotechnology. However, much of this chemical diversity remains unexplored, as only a fraction of plant species has been studied comprehensively. In this work, we estimate the size of the plant chemical space by leveraging large-scale metabolomics and literature datasets. We begin by examining the known chemical space, which, while containing at most several hundred thousand unique compounds, remains sparsely covered. Using data from over 1,000 plant species, we apply various mass spectrometry-based approaches-a formula prediction model, a de novo prediction model, a combination of library search and de novo prediction, and MS2 clustering-to estimate the number of unique structures. Our methods suggest that the number of unique compounds in the metabolomics dataset alone may already surpass existing estimates of plant chemical diversity. Finally, we project these findings across the entire plant kingdom, estimating that the total plant chemical space likely spans millions, if not more, with most still unexplored.

摘要

植物王国包含近40万种已知物种,产生了种类繁多的代谢产物,包括生存所必需的初级化合物和专门用于生态相互作用的次级代谢产物。这些代谢产物构成了一个广阔而复杂的植物化学空间,在医学、农业和生物技术领域具有巨大的潜在应用价值。然而,这种化学多样性的大部分仍未被探索,因为只有一小部分植物物种得到了全面研究。在这项工作中,我们通过利用大规模代谢组学和文献数据集来估计植物化学空间的大小。我们首先研究已知的化学空间,尽管其中最多包含几十万种独特的化合物,但仍然覆盖稀疏。利用来自1000多种植物物种的数据,我们应用了各种基于质谱的方法——分子式预测模型、从头预测模型、库检索和从头预测相结合的方法以及MS2聚类——来估计独特结构的数量。我们的方法表明,仅代谢组学数据集中独特化合物的数量可能已经超过了现有的植物化学多样性估计值。最后,我们将这些发现推广到整个植物王国,估计整个植物化学空间可能涵盖数百万种化合物,甚至更多,其中大部分仍未被探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/f26dd796d4a3/giaf033fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/920cf57cceb6/giaf033fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/37d51c780208/giaf033fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/828a05b9876b/giaf033fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/54866c9c94ae/giaf033fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/f26dd796d4a3/giaf033fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/920cf57cceb6/giaf033fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/37d51c780208/giaf033fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/828a05b9876b/giaf033fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/54866c9c94ae/giaf033fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5fa/11970369/f26dd796d4a3/giaf033fig5.jpg

相似文献

1
Defining the limits of plant chemical space: challenges and estimations.界定植物化学空间的界限:挑战与评估
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf033.
2
Metabolomics and complementary techniques to investigate the plant phytochemical cosmos.代谢组学和互补技术研究植物化学生物圈。
Nat Prod Rep. 2021 Oct 20;38(10):1729-1759. doi: 10.1039/d1np00014d.
3
A liquid chromatography-mass spectrometry-based metabolomics strategy to explore plant metabolic diversity.一种基于液相色谱-质谱联用的代谢组学策略,用于探索植物代谢多样性。
Methods Enzymol. 2023;680:247-273. doi: 10.1016/bs.mie.2022.08.029. Epub 2022 Sep 24.
4
Studying Plant Specialized Metabolites Using Computational Metabolomics Strategies.利用计算代谢组学策略研究植物特异性代谢物。
Methods Mol Biol. 2024;2788:97-136. doi: 10.1007/978-1-0716-3782-1_7.
5
Spatial and evolutionary predictability of phytochemical diversity.植物化学多样性的空间和进化可预测性。
Proc Natl Acad Sci U S A. 2021 Jan 19;118(3). doi: 10.1073/pnas.2013344118.
6
The WEIZMASS spectral library for high-confidence metabolite identification.高可信度代谢物鉴定的 WEIZMASS 光谱库。
Nat Commun. 2016 Aug 30;7:12423. doi: 10.1038/ncomms12423.
7
[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].一种基于改良代谢组和探针分子的重要通路相关代谢物高效筛选与注释新方法
Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025.
8
PSC-db: A Structured and Searchable 3D-Database for Plant Secondary Compounds.PSC-db:一个用于植物次生化合物的结构化和可搜索的 3D 数据库。
Molecules. 2021 Feb 20;26(4):1124. doi: 10.3390/molecules26041124.
9
Exploring the known chemical space of the plant kingdom: insights into taxonomic patterns, knowledge gaps, and bioactive regions.探索植物王国的已知化学空间:洞察分类模式、知识空白和生物活性区域。
J Cheminform. 2023 Nov 10;15(1):107. doi: 10.1186/s13321-023-00778-w.
10
Web-based resources for mass-spectrometry-based metabolomics: a user's guide.基于质谱的代谢组学的网络资源:用户指南
Phytochemistry. 2009 Mar;70(4):450-6. doi: 10.1016/j.phytochem.2009.02.004. Epub 2009 Mar 13.

引用本文的文献

1
Untargeted Metabolomics Reveals Organ-Specific and Extraction-Dependent Metabolite Profiles in Endemic Tajik Species Korovin.非靶向代谢组学揭示了塔吉克特有物种科罗温体内器官特异性和提取物依赖性代谢物谱。
bioRxiv. 2025 Aug 27:2025.08.24.671964. doi: 10.1101/2025.08.24.671964.

本文引用的文献

1
COCONUT 2.0: a comprehensive overhaul and curation of the collection of open natural products database.COCONUT 2.0:开放天然产物数据库集合的全面修订与管理
Nucleic Acids Res. 2025 Jan 6;53(D1):D634-D643. doi: 10.1093/nar/gkae1063.
2
Plant Metabolic Network 16: expansion of underrepresented plant groups and experimentally supported enzyme data.植物代谢网络16:代表性不足的植物类群扩展及实验支持的酶数据。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1606-D1613. doi: 10.1093/nar/gkae991.
3
NCBI Taxonomy: enhanced access via NCBI Datasets.
NCBI分类法:通过NCBI数据集实现更便捷的访问。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1711-D1715. doi: 10.1093/nar/gkae967.
4
Evaluating the generalizability of graph neural networks for predicting collision cross section.评估图神经网络在预测碰撞横截面方面的通用性。
J Cheminform. 2024 Aug 29;16(1):105. doi: 10.1186/s13321-024-00899-w.
5
Effects of solvent extraction on the phytoconstituents and in vitro antioxidant activity properties of leaf extracts of the two selected medicinal plants from Malawi.溶剂萃取对马拉维两种药用植物叶提取物的植物化学成分和体外抗氧化活性的影响。
BMC Complement Med Ther. 2024 Aug 27;24(1):317. doi: 10.1186/s12906-024-04619-7.
6
Reproducible MS/MS library cleaning pipeline in matchms.Matchms中可重现的串联质谱(MS/MS)库清理流程。
J Cheminform. 2024 Jul 29;16(1):88. doi: 10.1186/s13321-024-00878-1.
7
A Sample-Centric and Knowledge-Driven Computational Framework for Natural Products Drug Discovery.一种以样本为中心且知识驱动的天然产物药物发现计算框架。
ACS Cent Sci. 2024 Feb 20;10(3):494-510. doi: 10.1021/acscentsci.3c00800. eCollection 2024 Mar 27.
8
Exploring the known chemical space of the plant kingdom: insights into taxonomic patterns, knowledge gaps, and bioactive regions.探索植物王国的已知化学空间:洞察分类模式、知识空白和生物活性区域。
J Cheminform. 2023 Nov 10;15(1):107. doi: 10.1186/s13321-023-00778-w.
9
Open and reusable annotated mass spectrometry dataset of a chemodiverse collection of 1,600 plant extracts.1600 种植物提取物的化学多样性混合物的开放且可重复使用的带注释的质谱数据集。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giac124. Epub 2023 Jan 18.
10
The critical role that spectral libraries play in capturing the metabolomics community knowledge.光谱库在捕获代谢组学领域知识方面的关键作用。
Metabolomics. 2022 Nov 19;18(12):94. doi: 10.1007/s11306-022-01947-y.