基于结构的高分辨多级 MS(n)谱树的注释。

Substructure-based annotation of high-resolution multistage MS(n) spectral trees.

机构信息

Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands.

出版信息

Rapid Commun Mass Spectrom. 2012 Oct 30;26(20):2461-71. doi: 10.1002/rcm.6364.

DOI:10.1002/rcm.6364

PMID:22976213

Abstract

RATIONALE

High-resolution multistage MS(n) data contains detailed information that can be used for structural elucidation of compounds observed in metabolomics studies. However, full exploitation of this complex data requires significant analysis efforts by human experts. In silico methods currently used to support data annotation by assigning substructures of candidate molecules are limited to a single level of MS fragmentation.

METHODS

We present an extended substructure-based approach which allows annotation of hierarchical spectral trees obtained from high-resolution multistage MS(n) experiments. The algorithm yields a hierarchical tree of substructures of a candidate molecule to explain the fragment peaks observed at consecutive levels of the multistage MS(n) spectral tree. A matching score is calculated that indicates how well the candidate structure can explain the observed hierarchical fragmentation pattern.

RESULTS

The method is applied to MS(n) spectral trees of a set of compounds representing important chemical classes in metabolomics. Based on the calculated score, the correct molecules were successfully prioritized among extensive sets of candidates structures retrieved from the PubChem database.

CONCLUSIONS

The results indicate that the inclusion of subsequent levels of fragmentation in the automatic annotation of MS(n) data improves the identification of the correct compounds. We show that, especially in the case of lower mass accuracy, this improvement is not only due to the inclusion of additional fragment ions in the analysis, but also to the specific hierarchical information present in the MS(n) spectral trees. This method may significantly reduce the time required by MS experts to analyze complex MS(n) data.

摘要

原理

高分辨率多级 MS(n) 数据包含可用于代谢组学研究中观察到的化合物结构阐明的详细信息。然而，充分利用这些复杂的数据需要人类专家进行大量的分析工作。目前用于通过分配候选分子的亚结构来支持数据注释的计算方法仅限于 MS 碎片化的单个级别。

方法

我们提出了一种扩展的基于亚结构的方法，该方法允许对从高分辨率多级 MS(n) 实验中获得的分层光谱树进行注释。该算法生成候选分子的亚结构分层树，以解释在多级 MS(n) 光谱树的连续级别上观察到的碎片峰。计算了一个匹配分数，该分数指示候选结构可以解释观察到的分层碎片化模式的程度。

结果

该方法应用于代表代谢组学中重要化学类别的一组化合物的 MS(n) 光谱树。基于计算的分数，可以成功地在从 PubChem 数据库中检索到的广泛的候选结构集中对正确的分子进行优先级排序。

结论

结果表明，在 MS(n) 数据的自动注释中包含后续的碎片化水平可以提高正确化合物的识别。我们表明，特别是在质量精度较低的情况下，这种改进不仅归因于分析中包含了额外的碎片离子，还归因于 MS(n) 光谱树中存在的特定分层信息。该方法可以大大减少 MS 专家分析复杂 MS(n) 数据所需的时间。

相似文献

Substructure-based annotation of high-resolution multistage MS(n) spectral trees.

Rapid Commun Mass Spectrom. 2012 Oct 30;26(20):2461-71. doi: 10.1002/rcm.6364.

Metabolite identification using automated comparison of high-resolution multistage mass spectral trees.

Anal Chem. 2012 Jul 3;84(13):5524-34. doi: 10.1021/ac2034216. Epub 2012 Jun 22.

Automatic chemical structure annotation of an LC-MS(n) based metabolic profile from green tea.

Anal Chem. 2013 Jun 18;85(12):6033-40. doi: 10.1021/ac400861a. Epub 2013 May 31.

Database supported candidate search for metabolite identification.

J Integr Bioinform. 2011 Jul 7;8(2):157. doi: 10.2390/biecoll-jib-2011-157.

Polyphenol identification based on systematic and robust high-resolution accurate mass spectrometry fragmentation.

Anal Chem. 2011 Jan 1;83(1):409-16. doi: 10.1021/ac102546x. Epub 2010 Dec 9.

Automated pipeline for de novo metabolite identification using mass-spectrometry-based metabolomics.

Anal Chem. 2013 Apr 2;85(7):3576-83. doi: 10.1021/ac303218u. Epub 2013 Mar 21.

Simple data-reduction method for high-resolution LC-MS data in metabolomics.

Bioanalysis. 2009 Dec;1(9):1551-7. doi: 10.4155/bio.09.146.

De novo analysis of electron impact mass spectra using fragmentation trees.

Anal Chim Acta. 2012 Aug 20;739:67-76. doi: 10.1016/j.aca.2012.06.021. Epub 2012 Jun 27.

Time alignment algorithms based on selected mass traces for complex LC-MS data.

J Proteome Res. 2010 Mar 5;9(3):1483-95. doi: 10.1021/pr9010124.

Identification of triacylglycerol using automated annotation of high resolution multistage mass spectral trees.

Anal Chim Acta. 2016 Oct 12;940:84-91. doi: 10.1016/j.aca.2016.07.036. Epub 2016 Jul 28.

引用本文的文献

mineMS2: annotation of spectral libraries with exact fragmentation patterns.

J Cheminform. 2025 Jul 24;17(1):111. doi: 10.1186/s13321-025-01051-y.

Self-supervised learning of molecular representations from millions of tandem mass spectra using DreaMS.

Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02663-3.

Knowledge-based in silico fragmentation and annotation of mass spectra for natural products with MassKG.

Comput Struct Biotechnol J. 2024 Sep 7;23:3327-3341. doi: 10.1016/j.csbj.2024.09.001. eCollection 2024 Dec.

ModiFinder: Tandem Mass Spectral Alignment Enables Structural Modification Site Localization.

J Am Soc Mass Spectrom. 2024 Nov 6;35(11):2564-2578. doi: 10.1021/jasms.4c00061. Epub 2024 Jun 3.

Microbial Metabolites Annotation by Mass Spectrometry-Based Metabolomics.

Adv Exp Med Biol. 2023;1439:225-248. doi: 10.1007/978-3-031-41741-2_9.

Characterizing azobenzene disperse dyes and related compounds in house dust and their correlations with other organic contaminant classes.

Environ Pollut. 2023 Nov 15;337:122491. doi: 10.1016/j.envpol.2023.122491. Epub 2023 Sep 12.

Evaluating LC-HRMS metabolomics data processing software using FAIR principles for research software.

Metabolomics. 2023 Feb 6;19(2):11. doi: 10.1007/s11306-023-01974-3.

Critical assessment of chromatographic metadata in publicly available metabolomics data repositories.

Metabolomics. 2022 Nov 27;18(12):97. doi: 10.1007/s11306-022-01956-x.

Compound Identification Strategies in Mass Spectrometry-Based Metabolomics and Pharmacometabolomics.

Handb Exp Pharmacol. 2023;277:43-71. doi: 10.1007/164_2022_617.

Strategies for structure elucidation of small molecules based on LC-MS/MS data from complex biological samples.

Comput Struct Biotechnol J. 2022 Sep 7;20:5085-5097. doi: 10.1016/j.csbj.2022.09.004. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于结构的高分辨多级 MS(n)谱树的注释。

Substructure-based annotation of high-resolution multistage MS(n) spectral trees.

机构信息

出版信息

RATIONALE

METHODS

RESULTS

CONCLUSIONS

原理

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献