引入“识别概率”用于代谢组学及相关研究中代谢物识别可信度的自动化和可转移评估。

Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.

作者信息

Metz Thomas O, Chang Christine H, Gautam Vasuk, Anjum Afia, Tian Siyang, Wang Fei, Colby Sean M, Nunez Jamie R, Blumer Madison R, Edison Arthur S, Fiehn Oliver, Jones Dean P, Li Shuzhao, Morgan Edward T, Patti Gary J, Ross Dylan H, Shapiro Madelyn R, Williams Antony J, Wishart David S

机构信息

Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.

Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada.

出版信息

bioRxiv. 2024 Jul 31:2024.07.30.605945. doi: 10.1101/2024.07.30.605945.

DOI:10.1101/2024.07.30.605945

PMID:39131324

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11312557/

Abstract

Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.

摘要

在过去二十年中，代谢组学及相关研究中评估化合物鉴定可信度的方法一直存在争议并受到积极研究。2007年的早期工作主要集中在质谱和核磁共振光谱上，结果产生了四个推荐的代谢物鉴定可信度水平——代谢物标准倡议（MSI）水平。2014年，最初的MSI水平扩展到五个水平（包括两个子水平），以促进在高分辨率质谱研究中交流化合物鉴定可信度。鉴定水平进一步细化，例如以适应代谢组学工作流程中离子淌度光谱的使用，并且基于鉴定点模式也开发了交流化合物鉴定可信度的替代方法。然而，无论是鉴定可信度的定性水平还是定量评分系统，都没有解决在所考虑的化学空间背景下化合物鉴定中的模糊程度问题，不易自动化，也不能在分析平台之间转移。从这个角度来看，我们建议代谢组学及相关领域将鉴定概率作为一种在代谢组学及相关研究中对化合物鉴定和模糊性进行自动化和可转移评估的方法。鉴定概率简单定义为1/N，其中N是参考库或化学空间中与在用户定义的测量精度（例如质量测量或保留时间准确性等）内实验测量的分子匹配的化合物数量。我们在对由人类代谢组数据库构建的多属性参考库和计算属性预测的分析中展示了鉴定概率的实用性，为该领域透明实施这一概念提供指导，并邀请该领域与他们当前评估代谢物鉴定可信度的首选方法并行进一步评估这一概念。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1445/11312557/6fa6f4246b73/nihpp-2024.07.30.605945v1-f0001.jpg

相似文献

Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.引入“识别概率”用于代谢组学及相关研究中代谢物识别可信度的自动化和可转移评估。

bioRxiv. 2024 Jul 31:2024.07.30.605945. doi: 10.1101/2024.07.30.605945.

Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies.介绍用于代谢组学及相关研究中代谢物鉴定置信度的自动化和可转移评估的“鉴定概率”。

Anal Chem. 2025 Jan 14;97(1):1-11. doi: 10.1021/acs.analchem.4c04060. Epub 2024 Dec 19.

Enhanced metabolite annotation via dynamic retention time prediction: Steroidogenesis alterations as a case study.通过动态保留时间预测增强代谢物注释：以类固醇生成改变为例的研究

J Chromatogr B Analyt Technol Biomed Life Sci. 2017 Dec 15;1071:11-18. doi: 10.1016/j.jchromb.2017.04.032. Epub 2017 Apr 23.

Quantum Chemistry Calculations for Metabolomics.代谢组学的量子化学计算。

Chem Rev. 2021 May 26;121(10):5633-5670. doi: 10.1021/acs.chemrev.0c00901. Epub 2021 May 12.

Compound Identification Strategies in Mass Spectrometry-Based Metabolomics and Pharmacometabolomics.基于质谱的代谢组学和药物代谢组学中的化合物鉴定策略

Handb Exp Pharmacol. 2023;277:43-71. doi: 10.1007/164_2022_617.

Development of a Liquid Chromatography-High Resolution Mass Spectrometry Metabolomics Method with High Specificity for Metabolite Identification Using All Ion Fragmentation Acquisition.开发一种液相色谱-高分辨质谱代谢组学方法，采用全离子碎裂采集技术，具有高度特异性，可用于代谢物鉴定。

Anal Chem. 2017 Aug 1;89(15):7933-7942. doi: 10.1021/acs.analchem.7b00925. Epub 2017 Jul 12.

MetExpert: An expert system to enhance gas chromatography‒mass spectrometry-based metabolite identifications.MetExpert：一种用于增强基于气相色谱-质谱联用的代谢物鉴定的专家系统。

Anal Chim Acta. 2018 Dec 11;1037:316-326. doi: 10.1016/j.aca.2018.03.052. Epub 2018 Apr 6.

MIDAS: a database-searching algorithm for metabolite identification in metabolomics.MIDAS：一种用于代谢组学中代谢物鉴定的数据库搜索算法。

Anal Chem. 2014 Oct 7;86(19):9496-503. doi: 10.1021/ac5014783. Epub 2014 Sep 11.

High Resolution GC-Orbitrap-MS Metabolomics Using Both Electron Ionization and Chemical Ionization for Analysis of Human Plasma.使用电子电离和化学电离的高分辨率气相色谱-轨道阱质谱代谢组学分析人血浆

J Proteome Res. 2020 Jul 2;19(7):2717-2731. doi: 10.1021/acs.jproteome.9b00774. Epub 2020 Feb 10.

Metabolome searcher: a high throughput tool for metabolite identification and metabolic pathway mapping directly from mass spectrometry and using genome restriction.代谢组搜索器：一种直接从质谱数据并利用基因组限制进行代谢物鉴定和代谢途径映射的高通量工具。

BMC Bioinformatics. 2015 Feb 25;16(1):62. doi: 10.1186/s12859-015-0462-y.

本文引用的文献

FragHub: A Mass Spectral Library Data Integration Workflow.FragHub：一种质谱图库数据整合工作流程。

Anal Chem. 2024 Jul 19;96(30):12489-96. doi: 10.1021/acs.analchem.4c02219.

Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline.常见的数据模型可简化代谢组学处理和注释，并在 Python 管道中实现。

PLoS Comput Biol. 2024 Jun 6;20(6):e1011912. doi: 10.1371/journal.pcbi.1011912. eCollection 2024 Jun.

Accurate Prediction of H NMR Chemical Shifts of Small Molecules Using Machine Learning.使用机器学习准确预测小分子的氢核磁共振化学位移

Metabolites. 2024 May 19;14(5):290. doi: 10.3390/metabo14050290.

MagMet: A fully automated web server for targeted nuclear magnetic resonance metabolomics of plasma and serum.MagMet：一个用于血浆和血清靶向核磁共振代谢组学的全自动网络服务器。

Magn Reson Chem. 2023 Dec;61(12):681-704. doi: 10.1002/mrc.5371. Epub 2023 Jun 2.

RaMP-DB 2.0: a renovated knowledgebase for deriving biological and chemical insight from metabolites, proteins, and genes.RaMP-DB 2.0：一个经过改进的知识库，可从代谢物、蛋白质和基因中获取生物和化学见解。

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac726.

The NORMAN Suspect List Exchange (NORMAN-SLE): facilitating European and worldwide collaboration on suspect screening in high resolution mass spectrometry.诺曼可疑物列表交换库（NORMAN-SLE）：推动欧洲及全球在高分辨率质谱法可疑物筛查方面的合作。

Environ Sci Eur. 2022;34(1):104. doi: 10.1186/s12302-022-00680-6. Epub 2022 Oct 21.

Untargeted metabolite profiling to elucidate rhizosphere and leaf metabolome changes of wheat cultivars ( L.) treated with the plant growth-promoting rhizobacteria (T22) and .采用非靶向代谢物谱分析方法，以阐明经植物促生根际细菌（T22）处理的小麦品种（L.）根际和叶片代谢组的变化。

Front Microbiol. 2022 Aug 25;13:971836. doi: 10.3389/fmicb.2022.971836. eCollection 2022.

Enhancing untargeted metabolomics using metadata-based source annotation.基于元数据的源注释增强非靶向代谢组学。

Nat Biotechnol. 2022 Dec;40(12):1774-1779. doi: 10.1038/s41587-022-01368-1. Epub 2022 Jul 7.

HMDB 5.0: the Human Metabolome Database for 2022.HMDB 5.0：2022 年人类代谢组数据库。

Nucleic Acids Res. 2022 Jan 7;50(D1):D622-D631. doi: 10.1093/nar/gkab1062.

Chemical-damage MINE: A database of curated and predicted spontaneous metabolic reactions.化学损伤代谢物数据库：一个经过精心整理和预测的自发代谢反应数据库。

Metab Eng. 2022 Jan;69:302-312. doi: 10.1016/j.ymben.2021.11.009. Epub 2021 Dec 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

引入“识别概率”用于代谢组学及相关研究中代谢物识别可信度的自动化和可转移评估。

Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献