• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

理解和分类代谢物空间和代谢物相似性。

Understanding and classifying metabolite space and metabolite-likeness.

机构信息

TNO Research Group Quality and Safety, Zeist, The Netherlands.

出版信息

PLoS One. 2011;6(12):e28966. doi: 10.1371/journal.pone.0028966. Epub 2011 Dec 14.

DOI:10.1371/journal.pone.0028966
PMID:22194963
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3237584/
Abstract

While the entirety of 'Chemical Space' is huge (and assumed to contain between 10(63) and 10(200) 'small molecules'), distinct subsets of this space can nonetheless be defined according to certain structural parameters. An example of such a subspace is the chemical space spanned by endogenous metabolites, defined as 'naturally occurring' products of an organisms' metabolism. In order to understand this part of chemical space in more detail, we analyzed the chemical space populated by human metabolites in two ways. Firstly, in order to understand metabolite space better, we performed Principal Component Analysis (PCA), hierarchical clustering and scaffold analysis of metabolites and non-metabolites in order to analyze which chemical features are characteristic for both classes of compounds. Here we found that heteroatom (both oxygen and nitrogen) content, as well as the presence of particular ring systems was able to distinguish both groups of compounds. Secondly, we established which molecular descriptors and classifiers are capable of distinguishing metabolites from non-metabolites, by assigning a 'metabolite-likeness' score. It was found that the combination of MDL Public Keys and Random Forest exhibited best overall classification performance with an AUC value of 99.13%, a specificity of 99.84% and a selectivity of 88.79%. This performance is slightly better than previous classifiers; and interestingly we found that drugs occupy two distinct areas of metabolite-likeness, the one being more 'synthetic' and the other being more 'metabolite-like'. Also, on a truly prospective dataset of 457 compounds, 95.84% correct classification was achieved. Overall, we are confident that we contributed to the tasks of classifying metabolites, as well as to understanding metabolite chemical space better. This knowledge can now be used in the development of new drugs that need to resemble metabolites, and in our work particularly for assessing the metabolite-likeness of candidate molecules during metabolite identification in the metabolomics field.

摘要

虽然“化学空间”的整体范围非常大(据估计包含 10(63)到 10(200)个“小分子”),但根据某些结构参数,仍然可以定义这个空间的不同子集。这个空间的一个子空间是内源性代谢物所占据的化学空间,定义为生物体代谢的“天然”产物。为了更详细地了解这部分化学空间,我们以两种方式分析了人类代谢物所占据的化学空间。首先,为了更好地了解代谢物空间,我们对代谢物和非代谢物进行了主成分分析(PCA)、层次聚类和支架分析,以分析哪些化学特征是这两类化合物所共有的。在这里,我们发现杂原子(氧和氮)含量以及特定环系统的存在能够区分这两类化合物。其次,我们通过分配“代谢物相似性”评分来确定哪些分子描述符和分类器能够区分代谢物和非代谢物。结果发现,MDL 公钥和随机森林的组合表现出最佳的整体分类性能,AUC 值为 99.13%,特异性为 99.84%,选择性为 88.79%。这种性能略优于以前的分类器;有趣的是,我们发现药物占据了代谢物相似性的两个不同区域,一个区域更“合成”,另一个区域更“代谢物样”。此外,在一个真正的 457 个化合物前瞻性数据集上,实现了 95.84%的正确分类。总的来说,我们有信心我们为分类代谢物以及更好地理解代谢物化学空间做出了贡献。现在,这些知识可以用于开发需要类似于代谢物的新药,特别是在代谢组学领域中用于评估候选分子在代谢物鉴定过程中的代谢物相似性。

相似文献

1
Understanding and classifying metabolite space and metabolite-likeness.理解和分类代谢物空间和代谢物相似性。
PLoS One. 2011;6(12):e28966. doi: 10.1371/journal.pone.0028966. Epub 2011 Dec 14.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
Drug repositioning for enzyme modulator based on human metabolite-likeness.基于人类代谢物相似性的酶调节剂药物重新定位。
BMC Bioinformatics. 2017 May 31;18(Suppl 7):226. doi: 10.1186/s12859-017-1637-5.
4
Comparing the chemical spaces of metabolites and available chemicals: models of metabolite-likeness.比较代谢物与可用化学品的化学空间:类代谢物模型。
Mol Divers. 2007 Feb;11(1):23-36. doi: 10.1007/s11030-006-9054-0. Epub 2007 Feb 16.
5
'Metabolite-likeness' as a criterion in the design and selection of pharmaceutical drug libraries.“代谢物相似性”作为药物文库设计与筛选的一项标准。
Drug Discov Today. 2009 Jan;14(1-2):31-40. doi: 10.1016/j.drudis.2008.10.011. Epub 2008 Dec 26.
6
Structural diversity of biologically interesting datasets: a scaffold analysis approach.具有生物学意义的数据集的结构多样性:支架分析方法。
J Cheminform. 2011 Aug 8;3:30. doi: 10.1186/1758-2946-3-30.
7
Physiochemical property space distribution among human metabolites, drugs and toxins.人体代谢物、药物和毒素的物理化学性质空间分布。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S10. doi: 10.1186/1471-2105-10-S15-S10.
8
BioSM: metabolomics tool for identifying endogenous mammalian biochemical structures in chemical structure space.BioSM:用于在化学结构空间中识别内源性哺乳动物生化结构的代谢组学工具。
J Chem Inf Model. 2013 Mar 25;53(3):601-12. doi: 10.1021/ci300512q. Epub 2013 Feb 27.
9
MetExpert: An expert system to enhance gas chromatography‒mass spectrometry-based metabolite identifications.MetExpert:一种用于增强基于气相色谱-质谱联用的代谢物鉴定的专家系统。
Anal Chim Acta. 2018 Dec 11;1037:316-326. doi: 10.1016/j.aca.2018.03.052. Epub 2018 Apr 6.
10
How diverse are diversity assessment methods? A comparative analysis and benchmarking of molecular descriptor space.多样性评估方法有哪些差异?分子描述符空间的比较分析和基准测试。
J Chem Inf Model. 2014 Jan 27;54(1):230-42. doi: 10.1021/ci400469u. Epub 2013 Dec 13.

引用本文的文献

1
Assessing How Chemical Exposures Affect Human Health.评估化学物质暴露如何影响人类健康。
LC GC Eur. 2023 Jun;36(Suppl):7-10.
2
Microbial metabolites in the marine carbon cycle.海洋碳循环中的微生物代谢产物。
Nat Microbiol. 2022 Apr;7(4):508-523. doi: 10.1038/s41564-022-01090-3. Epub 2022 Apr 1.
3
Natural product drug discovery in the artificial intelligence era.人工智能时代的天然产物药物发现

本文引用的文献

1
How similar are those molecules after all? Use two descriptors and you will have three different answers.这些分子到底有多相似?使用两个描述符,你将得到三个不同的答案。
Expert Opin Drug Discov. 2010 Dec;5(12):1141-51. doi: 10.1517/17460441.2010.517832. Epub 2010 Sep 16.
2
Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry.使用气相色谱和液相色谱-质谱联用技术进行大规模血清和血浆代谢物分析的程序。
Nat Protoc. 2011 Jun 30;6(7):1060-83. doi: 10.1038/nprot.2011.335.
3
Automated workflows for accurate mass-based putative metabolite identification in LC/MS-derived metabolomic datasets.
Chem Sci. 2021 Dec 13;13(6):1526-1546. doi: 10.1039/d1sc04471k. eCollection 2022 Feb 9.
4
Defining Blood Plasma and Serum Metabolome by GC-MS.通过气相色谱-质谱联用技术定义血浆和血清代谢组
Metabolites. 2021 Dec 24;12(1):15. doi: 10.3390/metabo12010015.
5
Secretomics: a biochemical footprinting tool for developing microalgal cultivation strategies.分泌组学:一种用于开发微藻培养策略的生化足迹工具。
World J Microbiol Biotechnol. 2021 Sep 28;37(11):182. doi: 10.1007/s11274-021-03148-6.
6
Generation of a Small Library of Natural Products Designed to Cover Chemical Space Inexpensively.构建一个旨在以低成本覆盖化学空间的天然产物小型文库。
Pharm Front. 2019;1(1):e190005. doi: 10.20900/pf20190005. Epub 2019 Aug 9.
7
Metabolomics reveals elevated urinary excretion of collagen degradation and epithelial cell turnover products in irritable bowel syndrome patients.代谢组学揭示了肠易激综合征患者尿液中胶原降解和上皮细胞更替产物的排泄增加。
Metabolomics. 2019 May 20;15(6):82. doi: 10.1007/s11306-019-1543-0.
8
Propagating annotations of molecular networks using in silico fragmentation.利用计算机模拟片段化技术传播分子网络的注释。
PLoS Comput Biol. 2018 Apr 18;14(4):e1006089. doi: 10.1371/journal.pcbi.1006089. eCollection 2018 Apr.
9
ChemDistiller: an engine for metabolite annotation in mass spectrometry.ChemDistiller:用于质谱代谢物注释的引擎。
Bioinformatics. 2018 Jun 15;34(12):2096-2102. doi: 10.1093/bioinformatics/bty080.
10
The octet rule in chemical space: generating virtual molecules.化学空间的八隅体规则:生成虚拟分子。
Mol Divers. 2017 Nov;21(4):769-778. doi: 10.1007/s11030-017-9775-2. Epub 2017 Aug 3.
基于 LC/MS 衍生代谢组学数据集的准确基于质量的假定代谢物鉴定的自动化工作流程。
Bioinformatics. 2011 Apr 15;27(8):1108-12. doi: 10.1093/bioinformatics/btr079. Epub 2011 Feb 16.
4
Advances in structure elucidation of small molecules using mass spectrometry.利用质谱法进行小分子结构解析的进展
Bioanal Rev. 2010 Dec;2(1-4):23-60. doi: 10.1007/s12566-010-0015-9. Epub 2010 Aug 21.
5
Automated strategies to identify compounds on the basis of GC/EI-MS and calculated properties.基于 GC/EI-MS 和计算性质自动识别化合物的策略。
Anal Chem. 2011 Feb 1;83(3):903-12. doi: 10.1021/ac102574h. Epub 2011 Jan 12.
6
Trust, but verify: on the importance of chemical structure curation in cheminformatics and QSAR modeling research.信任,但要验证:关于化学结构 curated 在 cheminformatics 和 QSAR 建模研究中的重要性。
J Chem Inf Model. 2010 Jul 26;50(7):1189-204. doi: 10.1021/ci100176x.
7
Dealing with the unknown: metabolomics and metabolite atlases.应对未知挑战:代谢组学与代谢物图谱。
J Am Soc Mass Spectrom. 2010 Sep;21(9):1471-6. doi: 10.1016/j.jasms.2010.04.003. Epub 2010 Apr 12.
8
Extended-connectivity fingerprints.扩展连接指纹。
J Chem Inf Model. 2010 May 24;50(5):742-54. doi: 10.1021/ci100050t.
9
Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition research.基于质谱的代谢组学:局限性及对未来进展的建议,特别关注营养研究
Metabolomics. 2009 Dec;5(4):435-458. doi: 10.1007/s11306-009-0168-0. Epub 2009 Jun 12.
10
KEGG for representation and analysis of molecular networks involving diseases and drugs.KEGG 用于表示和分析涉及疾病和药物的分子网络。
Nucleic Acids Res. 2010 Jan;38(Database issue):D355-60. doi: 10.1093/nar/gkp896. Epub 2009 Oct 30.