• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过母离子子分子式图确定质谱中的母离子和相关碎片分子式。

Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph.

作者信息

Li Sean, Bohman Björn, Flematti Gavin R, Jayatilaka Dylan

机构信息

School of Molecular Sciences, The University of Western Australia, 35 Stirling Highway, Crawley, 6009, Australia.

Department of Plant Protection Biology, Swedish University of Agricultural Sciences, Box 190, 23422, Lomma, Sweden.

出版信息

J Cheminform. 2023 Nov 7;15(1):104. doi: 10.1186/s13321-023-00776-y.

DOI:10.1186/s13321-023-00776-y
PMID:37936244
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10631010/
Abstract

BACKGROUND

Identifying the molecular formula and fragmentation reactions of an unknown compound from its mass spectrum is crucial in areas such as natural product chemistry and metabolomics. We propose a method for identifying the correct candidate formula of an unidentified natural product from its mass spectrum. The method involves scoring the plausibility of parent candidate formulae based on a parent subformula graph (PSG), and two possible metrics relating to the number of edges in the PSG. This method is applicable to both electron-impact mass spectrometry (EI-MS) and tandem mass spectrometry (MS/MS) data. Additionally, this work introduces the two-dimensional fragmentation plot (2DFP) for visualizing PSGs.

RESULTS

Our results suggest that incorporating information regarding the edges of the PSG results in enhanced performance in correctly identifying parent formulae, in comparison to the more well-accepted "MS/MS score", on the 2016 Computational Assessment of Small Molecule Identification (CASMI 2016) data set (76.3 vs 58.9% correct formula identification) and the Research Centre for Toxic Compounds in the Environment (RECETOX) data set (66.2% vs 59.4% correct formula identification). In the extension of our method to identify the correct candidate formula from complex EI-MS data of semiochemicals, our method again performed better (correct formula appearing in the top 4 candidates in 20/23 vs 7/23 cases) than the MS/MS score, and enables the rapid identification of both the correct parent ion mass and the correct parent formula with minimal expert intervention.

CONCLUSION

Our method reliably identifies the correct parent formula even when the mass information is ambiguous. Furthermore, should parent formula identification be successful, the majority of associated fragment formulae can also be correctly identified. Our method can also identify the parent ion and its associated fragments in EI-MS spectra where the identity of the parent ion is unclear due to low quantities and overlapping compounds. Finally, our method does not inherently require empirical fitting of parameters or statistical learning, meaning it is easy to implement and extend upon.

SCIENTIFIC CONTRIBUTION

Developed, implemented and tested new metrics for assessing plausibility of candidate molecular formulae obtained from HR-MS data.

摘要

背景

从质谱图中识别未知化合物的分子式和裂解反应在天然产物化学和代谢组学等领域至关重要。我们提出了一种从质谱图中识别未知天然产物正确候选分子式的方法。该方法包括基于母离子子式图(PSG)对母离子候选分子式的合理性进行评分,以及与PSG中边的数量相关的两种可能的度量标准。此方法适用于电子轰击质谱(EI-MS)和串联质谱(MS/MS)数据。此外,这项工作引入了二维裂解图(2DFP)来可视化PSG。

结果

我们的结果表明,与更被广泛接受的“MS/MS评分”相比,纳入有关PSG边的信息在正确识别母离子分子式方面具有更高的性能,在2016年小分子识别计算评估(CASMI 2016)数据集(正确分子式识别率为76.3%对58.9%)和环境中毒性化合物研究中心(RECETOX)数据集(正确分子式识别率为66.2%对59.4%)上均如此。在将我们的方法扩展以从信息素的复杂EI-MS数据中识别正确候选分子式时,我们的方法再次表现优于MS/MS评分(20/23例中正确分子式出现在前4个候选中,而MS/MS评分为7/23例),并且能够在最少专家干预的情况下快速识别正确的母离子质量和正确的母离子分子式。

结论

即使质量信息不明确,我们的方法也能可靠地识别正确的母离子分子式。此外,如果母离子分子式识别成功,大多数相关的碎片分子式也能被正确识别。我们的方法还可以在EI-MS谱图中识别母离子及其相关碎片,其中由于含量低和化合物重叠导致母离子的身份不明确。最后,我们的方法本质上不需要对参数进行经验拟合或统计学习,这意味着它易于实现和扩展。

科学贡献

开发、实施并测试了用于评估从高分辨质谱(HR-MS)数据获得的候选分子式合理性的新度量标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/aa70a8bd2943/13321_2023_776_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/1510f0d9b05a/13321_2023_776_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/e31d9ce17171/13321_2023_776_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/61105f73fc59/13321_2023_776_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/05151fc8d510/13321_2023_776_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/2d7105594d81/13321_2023_776_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/aa70a8bd2943/13321_2023_776_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/1510f0d9b05a/13321_2023_776_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/e31d9ce17171/13321_2023_776_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/61105f73fc59/13321_2023_776_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/05151fc8d510/13321_2023_776_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/2d7105594d81/13321_2023_776_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc71/10631010/aa70a8bd2943/13321_2023_776_Fig6_HTML.jpg

相似文献

1
Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph.通过母离子子分子式图确定质谱中的母离子和相关碎片分子式。
J Cheminform. 2023 Nov 7;15(1):104. doi: 10.1186/s13321-023-00776-y.
2
Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons.电子电离高分辨率质谱的自动碎片化学式注释:应用于卤代烃的大气测量
J Cheminform. 2021 Oct 4;13(1):78. doi: 10.1186/s13321-021-00544-w.
3
The Compound Characteristics Comparison (CCC) approach: a tool for improving confidence in natural compound identification.化合物特征比较(CCC)方法:一种提高天然化合物鉴定可信度的工具。
Food Addit Contam Part A Chem Anal Control Expo Risk Assess. 2018 Nov;35(11):2145-2157. doi: 10.1080/19440049.2018.1523572. Epub 2018 Oct 23.
4
Molecular formula analysis by an MS/MS/MS technique to expedite dereplication of natural products.采用串联质谱技术进行分子式分析以加速天然产物的去重复化。
Anal Chem. 2007 Feb 1;79(3):1187-97. doi: 10.1021/ac061391o.
5
Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software.氢重排规则:使用 MS-FINDER 软件进行计算 MS/MS 碎裂和结构解析。
Anal Chem. 2016 Aug 16;88(16):7946-58. doi: 10.1021/acs.analchem.6b00770. Epub 2016 Aug 4.
6
Fragment formula calculator (FFC): determination of chemical formulas for fragment ions in mass spectrometric data.碎片公式计算器 (FFC):用于确定质谱数据中碎片离子的化学式。
Anal Chem. 2014 Feb 18;86(4):2221-8. doi: 10.1021/ac403879d. Epub 2014 Feb 5.
7
Improved molecular level identification of organic compounds using comprehensive two-dimensional chromatography, dual ionization energies and high resolution mass spectrometry.使用全二维色谱、双离子化能和高分辨质谱提高有机化合物的分子水平鉴定。
Analyst. 2017 Jul 7;142(13):2395-2403. doi: 10.1039/c7an00625j. Epub 2017 May 30.
8
Automatic Compound Annotation from Mass Spectrometry Data Using MAGMa.使用MAGMa从质谱数据进行自动化合物注释
Mass Spectrom (Tokyo). 2014;3(Spec Iss 2):S0033. doi: 10.5702/massspectrometry.S0033. Epub 2014 Jul 2.
9
Towards de novo identification of metabolites by analyzing tandem mass spectra.通过分析串联质谱进行代谢物的从头鉴定。
Bioinformatics. 2008 Aug 15;24(16):i49-i55. doi: 10.1093/bioinformatics/btn270.
10
MassGenie: A Transformer-Based Deep Learning Method for Identifying Small Molecules from Their Mass Spectra.MassGenie:一种基于 Transformer 的深度学习方法,用于从其质谱中识别小分子。
Biomolecules. 2021 Nov 30;11(12):1793. doi: 10.3390/biom11121793.

本文引用的文献

1
BUDDY: molecular formula discovery via bottom-up MS/MS interrogation.通过自下而上的 MS/MS 询问发现分子公式。
Nat Methods. 2023 Jun;20(6):881-890. doi: 10.1038/s41592-023-01850-x. Epub 2023 Apr 13.
2
Automated fragment formula annotation for electron ionisation, high resolution mass spectrometry: application to atmospheric measurements of halocarbons.电子电离高分辨率质谱的自动碎片化学式注释:应用于卤代烃的大气测量
J Cheminform. 2021 Oct 4;13(1):78. doi: 10.1186/s13321-021-00544-w.
3
Open, High-Resolution EI+ Spectral Library of Anthropogenic Compounds.
人为化合物开放、高分辨率 EI+ 光谱库。
Front Public Health. 2021 Mar 9;9:622558. doi: 10.3389/fpubh.2021.622558. eCollection 2021.
4
MFAssignR: Molecular formula assignment software for ultrahigh resolution mass spectrometry analysis of environmental complex mixtures.MFAssignR:用于环境复杂混合物超高分辨率质谱分析的分子式分配软件。
Environ Res. 2020 Dec;191:110114. doi: 10.1016/j.envres.2020.110114. Epub 2020 Aug 28.
5
Structural Characterization of Dissolved Organic Matter at the Chemical Formula Level Using TIMS-FT-ICR MS/MS.采用 TIMS-FT-ICR MS/MS 技术在化学式水平上对溶解有机物进行结构特征分析。
Anal Chem. 2020 Sep 1;92(17):11960-11966. doi: 10.1021/acs.analchem.0c02347. Epub 2020 Aug 21.
6
A general approach to calculating isotopic distributions for mass spectrometry.一种用于计算质谱同位素分布的通用方法。
J Mass Spectrom. 2020 Aug;55(8):e4498. doi: 10.1002/jms.4498. Epub 2020 May 4.
7
A Specific Blend of Drakolide and Hydroxymethylpyrazines: An Unusual Pollinator Sexual Attractant Used by the Endangered Orchid Drakaea micrantha.特定配比的 Drakolide 和羟甲基吡嗪:一种不寻常的传粉者性引诱剂,被濒危兰花 Drakaea micrantha 使用。
Angew Chem Int Ed Engl. 2020 Jan 13;59(3):1124-1128. doi: 10.1002/anie.201911636. Epub 2019 Dec 12.
8
2-(Tetrahydrofuran-2-yl)acetic Acid and Ester Derivatives as Long-Range Pollinator Attractants in the Sexually Deceptive Orchid Cryptostylis ovata.2-(四氢呋喃-2-基)乙酸及其酯衍生物作为性欺骗兰花Cryptostylis ovata 的远距离传粉者引诱剂。
J Nat Prod. 2019 May 24;82(5):1107-1113. doi: 10.1021/acs.jnatprod.8b00772. Epub 2019 Mar 28.
9
SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information.SIRIUS 4:一种快速将串联质谱转化为代谢物结构信息的工具。
Nat Methods. 2019 Apr;16(4):299-302. doi: 10.1038/s41592-019-0344-8. Epub 2019 Mar 18.
10
UltraMassExplorer: a browser-based application for the evaluation of high-resolution mass spectrometric data.超质量浏览器:一款基于浏览器的用于评估高分辨率质谱数据的应用程序。
Rapid Commun Mass Spectrom. 2019 Jan 30;33(2):193-202. doi: 10.1002/rcm.8315.