• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于谱图匹配的大规模代谢组学注释的显著性估计。

Significance estimation for large scale metabolomics annotations by spectral matching.

机构信息

Chair for Bioinformatics, Friedrich Schiller University Jena, Jena, 07743, Germany.

RNA Bioinformatics and High Throughput Analysis, Friedrich Schiller University Jena, Jena, 07743, Germany.

出版信息

Nat Commun. 2017 Nov 14;8(1):1494. doi: 10.1038/s41467-017-01318-5.

DOI:10.1038/s41467-017-01318-5
PMID:29133785
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5684233/
Abstract

The annotation of small molecules in untargeted mass spectrometry relies on the matching of fragment spectra to reference library spectra. While various spectrum-spectrum match scores exist, the field lacks statistical methods for estimating the false discovery rates (FDR) of these annotations. We present empirical Bayes and target-decoy based methods to estimate the false discovery rate (FDR) for 70 public metabolomics data sets. We show that the spectral matching settings need to be adjusted for each project. By adjusting the scoring parameters and thresholds, the number of annotations rose, on average, by +139% (ranging from -92 up to +5705%) when compared with a default parameter set available at GNPS. The FDR estimation methods presented will enable a user to assess the scoring criteria for large scale analysis of mass spectrometry based metabolomics data that has been essential in the advancement of proteomics, transcriptomics, and genomics science.

摘要

在非靶向质谱中,小分子的注释依赖于碎片谱与参考文库谱的匹配。虽然存在各种谱-谱匹配评分,但该领域缺乏估计这些注释的假发现率 (FDR) 的统计方法。我们提出了经验贝叶斯和基于靶标诱饵的方法来估计 70 个公共代谢组学数据集的假发现率 (FDR)。我们表明,需要针对每个项目调整光谱匹配设置。通过调整评分参数和阈值,与 GNPS 提供的默认参数集相比,注释数量平均增加了 +139%(范围从-92 到+5705%)。所提出的 FDR 估计方法将使用户能够评估基于质谱的代谢组学数据的大规模分析的评分标准,这对于蛋白质组学、转录组学和基因组学科学的发展至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/bfd9d2f419a4/41467_2017_1318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/86f8decc80e6/41467_2017_1318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/3a4d841b0177/41467_2017_1318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/e70f01c541b1/41467_2017_1318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/bfd9d2f419a4/41467_2017_1318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/86f8decc80e6/41467_2017_1318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/3a4d841b0177/41467_2017_1318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/e70f01c541b1/41467_2017_1318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff5/5684233/bfd9d2f419a4/41467_2017_1318_Fig4_HTML.jpg

相似文献

1
Significance estimation for large scale metabolomics annotations by spectral matching.基于谱图匹配的大规模代谢组学注释的显著性估计。
Nat Commun. 2017 Nov 14;8(1):1494. doi: 10.1038/s41467-017-01318-5.
2
Ion entropy and accurate entropy-based FDR estimation in metabolomics.代谢组学中的离子熵和基于准确熵的 FDR 估计。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae056.
3
Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics.用于蛋白质组学中基于光谱库搜索的错误发现率估计的人工诱饵光谱库。
J Proteome Res. 2010 Jan;9(1):605-10. doi: 10.1021/pr900947u.
4
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.用于鸟枪法蛋白质组学的改进型错误发现率估计程序
J Proteome Res. 2015 Aug 7;14(8):3148-61. doi: 10.1021/acs.jproteome.5b00081. Epub 2015 Jul 27.
5
False discovery rate estimation using candidate peptides for each spectrum.使用每个谱图的候选肽进行错误发现率估计。
BMC Bioinformatics. 2022 Nov 1;23(1):454. doi: 10.1186/s12859-022-05002-4.
6
Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification.光谱熵在小分子化合物鉴定方面优于 MS/MS 点积相似度。
Nat Methods. 2021 Dec;18(12):1524-1531. doi: 10.1038/s41592-021-01331-z. Epub 2021 Dec 2.
7
Modeling Lower-Order Statistics to Enable Decoy-Free FDR Estimation in Proteomics.对低阶统计量进行建模以实现蛋白质组学中无诱饵的错误发现率估计。
J Proteome Res. 2023 Apr 7;22(4):1159-1171. doi: 10.1021/acs.jproteome.2c00604. Epub 2023 Mar 24.
8
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.Spec2Vec:通过学习结构关系提高质谱相似性评分。
PLoS Comput Biol. 2021 Feb 16;17(2):e1008724. doi: 10.1371/journal.pcbi.1008724. eCollection 2021 Feb.
9
Decoy methods for assessing false positives and false discovery rates in shotgun proteomics.用于评估鸟枪法蛋白质组学中假阳性和错误发现率的诱饵方法。
Anal Chem. 2009 Jan 1;81(1):146-59. doi: 10.1021/ac801664q.
10
Target-Decoy-Based False Discovery Rate Estimation for Large-Scale Metabolite Identification.基于靶标-诱饵的大规模代谢物识别假发现率估计。
J Proteome Res. 2018 Jul 6;17(7):2328-2334. doi: 10.1021/acs.jproteome.8b00019. Epub 2018 May 29.

引用本文的文献

1
Knowledge and data-driven two-layer networking for accurate metabolite annotation in untargeted metabolomics.用于非靶向代谢组学中准确代谢物注释的知识和数据驱动的双层网络
Nat Commun. 2025 Aug 30;16(1):8118. doi: 10.1038/s41467-025-63536-6.
2
Machine Learning for Enhanced Identification Probability in RPLC/HRMS Nontargeted Workflows.用于提高反相液相色谱/高分辨率质谱非靶向工作流程中鉴定概率的机器学习
Anal Chem. 2025 Aug 26;97(33):18028-18035. doi: 10.1021/acs.analchem.5c01873. Epub 2025 Aug 12.
3
A substrate-multiplexed platform for profiling enzymatic potential of plant family 1 glycosyltransferases.

本文引用的文献

1
Mass Spectrometry Based Molecular 3D-Cartography of Plant Metabolites.基于质谱法的植物代谢物分子三维图谱
Front Plant Sci. 2017 Mar 29;8:429. doi: 10.3389/fpls.2017.00429. eCollection 2017.
2
Current and Future Perspectives on the Structural Identification of Small Molecules in Biological Systems.生物系统中小分子结构鉴定的现状与未来展望
Metabolites. 2016 Dec 15;6(4):46. doi: 10.3390/metabo6040046.
3
Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.基于目标-诱饵法的鸟枪法蛋白质组学无偏错误发现率估计
用于分析植物1型糖基转移酶酶促潜力的底物多重平台。
Nat Commun. 2025 Jul 10;16(1):6366. doi: 10.1038/s41467-025-61530-6.
4
Dereplication of secondary metabolites from Sophora flavescens using an LC-MS/MS-based molecular networking strategy.基于液相色谱-串联质谱的分子网络策略对苦参次生代谢产物进行去重复化研究。
Sci Rep. 2025 Mar 24;15(1):10148. doi: 10.1038/s41598-025-94958-3.
5
A guide to reverse metabolomics-a framework for big data discovery strategy.反向代谢组学指南——大数据发现策略框架
Nat Protoc. 2025 Feb 28. doi: 10.1038/s41596-024-01136-2.
6
Deep Learning-Based Molecular Fingerprint Prediction for Metabolite Annotation.基于深度学习的代谢物注释分子指纹预测
Metabolites. 2025 Feb 14;15(2):132. doi: 10.3390/metabo15020132.
7
Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies.介绍用于代谢组学及相关研究中代谢物鉴定置信度的自动化和可转移评估的“鉴定概率”。
Anal Chem. 2025 Jan 14;97(1):1-11. doi: 10.1021/acs.analchem.4c04060. Epub 2024 Dec 19.
8
Data Processing of Product Ion Spectra: Methods to Control False Discovery Rate in Compound Search Results for Untargeted Metabolomics.产物离子光谱的数据处理:在非靶向代谢组学的化合物搜索结果中控制错误发现率的方法
Mass Spectrom (Tokyo). 2024;13(1):A0155. doi: 10.5702/massspectrometry.A0155. Epub 2024 Nov 12.
9
In-Depth Chemical Analysis of the Brain Extracellular Space Using Microdialysis with Liquid Chromatography-Tandem Mass Spectrometry.利用液相色谱-串联质谱法对脑细胞外液进行深入的化学分析。
Anal Chem. 2024 Oct 15;96(41):16387-16396. doi: 10.1021/acs.analchem.4c03806. Epub 2024 Oct 3.
10
Structural insights into the mechanism and dynamics of proteorhodopsin biogenesis and retinal scavenging.结构洞察视紫红质生物发生和视网膜清除的机制和动力学。
Nat Commun. 2024 Aug 13;15(1):6950. doi: 10.1038/s41467-024-50960-3.
J Proteome Res. 2017 Feb 3;16(2):393-397. doi: 10.1021/acs.jproteome.6b00144. Epub 2016 Dec 13.
4
Molecular Networking As a Drug Discovery, Drug Metabolism, and Precision Medicine Strategy.分子网络作为一种药物发现、药物代谢和精准医疗策略。
Trends Pharmacol Sci. 2017 Feb;38(2):143-154. doi: 10.1016/j.tips.2016.10.011. Epub 2016 Nov 11.
5
SPLASH, a hashed identifier for mass spectra.SPLASH,一种用于质谱的哈希标识符。
Nat Biotechnol. 2016 Nov 8;34(11):1099-1101. doi: 10.1038/nbt.3689.
6
Dereplication of peptidic natural products through database search of mass spectra.通过质谱数据库搜索对肽类天然产物进行去重复化。
Nat Chem Biol. 2017 Jan;13(1):30-37. doi: 10.1038/nchembio.2219. Epub 2016 Oct 31.
7
Mass Spectrometry-Based Visualization of Molecules Associated with Human Habitats.基于质谱的与人类栖息地相关分子的可视化。
Anal Chem. 2016 Nov 15;88(22):10775-10784. doi: 10.1021/acs.analchem.6b03456. Epub 2016 Oct 24.
8
Metabolomics enables precision medicine: "A White Paper, Community Perspective".代谢组学助力精准医学:“白皮书,社区视角”
Metabolomics. 2016;12(10):149. doi: 10.1007/s11306-016-1094-6. Epub 2016 Sep 2.
9
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.通过全球天然产物社会分子网络共享和社区管理质谱数据。
Nat Biotechnol. 2016 Aug 9;34(8):828-837. doi: 10.1038/nbt.3597.
10
How to talk about protein-level false discovery rates in shotgun proteomics.如何在鸟枪法蛋白质组学中探讨蛋白质水平的错误发现率。
Proteomics. 2016 Sep;16(18):2461-9. doi: 10.1002/pmic.201500431.