Suppr超能文献

引入“识别概率”用于代谢组学及相关研究中代谢物识别可信度的自动化和可转移评估。

Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.

作者信息

Metz Thomas O, Chang Christine H, Gautam Vasuk, Anjum Afia, Tian Siyang, Wang Fei, Colby Sean M, Nunez Jamie R, Blumer Madison R, Edison Arthur S, Fiehn Oliver, Jones Dean P, Li Shuzhao, Morgan Edward T, Patti Gary J, Ross Dylan H, Shapiro Madelyn R, Williams Antony J, Wishart David S

机构信息

Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.

Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada.

出版信息

bioRxiv. 2024 Jul 31:2024.07.30.605945. doi: 10.1101/2024.07.30.605945.

Abstract

Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.

摘要

在过去二十年中,代谢组学及相关研究中评估化合物鉴定可信度的方法一直存在争议并受到积极研究。2007年的早期工作主要集中在质谱和核磁共振光谱上,结果产生了四个推荐的代谢物鉴定可信度水平——代谢物标准倡议(MSI)水平。2014年,最初的MSI水平扩展到五个水平(包括两个子水平),以促进在高分辨率质谱研究中交流化合物鉴定可信度。鉴定水平进一步细化,例如以适应代谢组学工作流程中离子淌度光谱的使用,并且基于鉴定点模式也开发了交流化合物鉴定可信度的替代方法。然而,无论是鉴定可信度的定性水平还是定量评分系统,都没有解决在所考虑的化学空间背景下化合物鉴定中的模糊程度问题,不易自动化,也不能在分析平台之间转移。从这个角度来看,我们建议代谢组学及相关领域将鉴定概率作为一种在代谢组学及相关研究中对化合物鉴定和模糊性进行自动化和可转移评估的方法。鉴定概率简单定义为1/N,其中N是参考库或化学空间中与在用户定义的测量精度(例如质量测量或保留时间准确性等)内实验测量的分子匹配的化合物数量。我们在对由人类代谢组数据库构建的多属性参考库和计算属性预测的分析中展示了鉴定概率的实用性,为该领域透明实施这一概念提供指导,并邀请该领域与他们当前评估代谢物鉴定可信度的首选方法并行进一步评估这一概念。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1445/11312557/6fa6f4246b73/nihpp-2024.07.30.605945v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验