Suppr超能文献

光谱熵在小分子化合物鉴定方面优于 MS/MS 点积相似度。

Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification.

机构信息

West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA, USA.

Olobion, Parc Científic de Barcelona, Barcelona, Spain.

出版信息

Nat Methods. 2021 Dec;18(12):1524-1531. doi: 10.1038/s41592-021-01331-z. Epub 2021 Dec 2.

Abstract

Compound identification in small-molecule research, such as untargeted metabolomics or exposome research, relies on matching tandem mass spectrometry (MS/MS) spectra against experimental or in silico mass spectral libraries. Most software programs use dot product similarity scores. Here we introduce the concept of MS/MS spectral entropy to improve scoring results in MS/MS similarity searches via library matching. Entropy similarity outperformed 42 alternative similarity algorithms, including dot product similarity, when searching 434,287 spectra against the high-quality NIST20 library. Entropy similarity scores proved to be highly robust even when we added different levels of noise ions. When we applied entropy levels to 37,299 experimental spectra of natural products, false discovery rates of less than 10% were observed at entropy similarity score 0.75. Experimental human gut metabolome data were used to confirm that entropy similarity largely improved the accuracy of MS-based annotations in small-molecule research to false discovery rates below 10%, annotated new compounds and provided the basis to automatically flag poor-quality, noisy spectra.

摘要

在小分子研究(如非靶向代谢组学或暴露组学研究)中,化合物鉴定依赖于将串联质谱(MS/MS)谱与实验或计算质谱谱库进行匹配。大多数软件程序使用点积相似度得分。在这里,我们引入 MS/MS 光谱熵的概念,通过库匹配来提高 MS/MS 相似度搜索中的评分结果。在对高质量 NIST20 库进行 434,287 次光谱搜索时,熵相似度优于包括点积相似度在内的 42 种替代相似度算法。即使在添加不同水平的噪声离子时,熵相似度得分也被证明具有高度的稳健性。当我们将熵水平应用于 37,299 种天然产物的实验光谱时,在熵相似度得分 0.75 时,假发现率低于 10%。我们使用实验性人类肠道代谢组学数据来证实,熵相似度极大地提高了基于 MS 的小分子研究中注释的准确性,假发现率低于 10%,注释了新的化合物,并为自动标记低质量、噪声光谱提供了基础。

相似文献

6
Methods to Calculate Spectrum Similarity.计算光谱相似度的方法。
Methods Mol Biol. 2017;1549:75-100. doi: 10.1007/978-1-4939-6740-7_7.
7
Flash entropy search to query all mass spectral libraries in real time.实时查询所有质谱文库的 Flash 熵搜索。
Nat Methods. 2023 Oct;20(10):1475-1478. doi: 10.1038/s41592-023-02012-9. Epub 2023 Sep 21.

引用本文的文献

本文引用的文献

3
"Lipidomics": Mass spectrometric and chemometric analyses of lipids.脂质组学:脂质的质谱分析和化学计量学分析。
Adv Drug Deliv Rev. 2020;159:294-307. doi: 10.1016/j.addr.2020.06.009. Epub 2020 Jun 14.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验