• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于分子指纹的代谢物质谱识别的贝叶斯网络。

Bayesian networks for mass spectrometric metabolite identification via molecular fingerprints.

机构信息

Chair for Bioinformatics, Friedrich-Schiller-University, Jena, Germany.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i333-i340. doi: 10.1093/bioinformatics/bty245.

DOI:10.1093/bioinformatics/bty245
PMID:29949965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022630/
Abstract

MOTIVATION

Metabolites, small molecules that are involved in cellular reactions, provide a direct functional signature of cellular state. Untargeted metabolomics experiments usually rely on tandem mass spectrometry to identify the thousands of compounds in a biological sample. Recently, we presented CSI:FingerID for searching in molecular structure databases using tandem mass spectrometry data. CSI:FingerID predicts a molecular fingerprint that encodes the structure of the query compound, then uses this to search a molecular structure database such as PubChem. Scoring of the predicted query fingerprint and deterministic target fingerprints is carried out assuming independence between the molecular properties constituting the fingerprint.

RESULTS

We present a scoring that takes into account dependencies between molecular properties. As before, we predict posterior probabilities of molecular properties using machine learning. Dependencies between molecular properties are modeled as a Bayesian tree network; the tree structure is estimated on the fly from the instance data. For each edge, we also estimate the expected covariance between the two random variables. For fixed marginal probabilities, we then estimate conditional probabilities using the known covariance. Now, the corrected posterior probability of each candidate can be computed, and candidates are ranked by this score. Modeling dependencies improves identification rates of CSI:FingerID by 2.85 percentage points.

AVAILABILITY AND IMPLEMENTATION

The new scoring Bayesian (fixed tree) is integrated into SIRIUS 4.0 (https://bio.informatik.uni-jena.de/software/sirius/).

摘要

动机

代谢物是参与细胞反应的小分子,提供了细胞状态的直接功能特征。非靶向代谢组学实验通常依赖串联质谱来鉴定生物样本中的数千种化合物。最近,我们提出了 CSI:FingerID,用于使用串联质谱数据搜索分子结构数据库。CSI:FingerID 预测一个分子指纹,该指纹编码查询化合物的结构,然后使用该指纹在 PubChem 等分子结构数据库中进行搜索。假设指纹中的分子性质之间相互独立,对预测查询指纹和确定性目标指纹进行评分。

结果

我们提出了一种考虑分子性质之间相关性的评分方法。与之前一样,我们使用机器学习来预测分子性质的后验概率。将分子性质之间的相关性建模为贝叶斯树网络;树结构是从实例数据中实时估计的。对于每条边,我们还估计两个随机变量之间的预期协方差。对于固定的边际概率,然后使用已知的协方差来估计条件概率。现在,可以计算每个候选者的修正后验概率,并根据该得分对候选者进行排名。通过建模相关性,CSI:FingerID 的识别率提高了 2.85 个百分点。

可用性和实现

新的评分贝叶斯(固定树)已集成到 SIRIUS 4.0(https://bio.informatik.uni-jena.de/software/sirius/)中。

相似文献

1
Bayesian networks for mass spectrometric metabolite identification via molecular fingerprints.基于分子指纹的代谢物质谱识别的贝叶斯网络。
Bioinformatics. 2018 Jul 1;34(13):i333-i340. doi: 10.1093/bioinformatics/bty245.
2
Searching molecular structure databases with tandem mass spectra using CSI:FingerID.使用CSI:FingerID通过串联质谱搜索分子结构数据库。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5. doi: 10.1073/pnas.1509788112. Epub 2015 Sep 21.
3
Deep kernel learning improves molecular fingerprint prediction from tandem mass spectra.深度学习提高串联质谱分子指纹预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i342-i349. doi: 10.1093/bioinformatics/btac260.
4
SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information.SIRIUS 4:一种快速将串联质谱转化为代谢物结构信息的工具。
Nat Methods. 2019 Apr;16(4):299-302. doi: 10.1038/s41592-019-0344-8. Epub 2019 Mar 18.
5
Metabolite identification through multiple kernel learning on fragmentation trees.基于碎裂树的多核学习进行代谢产物鉴定。
Bioinformatics. 2014 Jun 15;30(12):i157-64. doi: 10.1093/bioinformatics/btu275.
6
Searching molecular structure databases using tandem MS data: are we there yet?使用串联质谱数据搜索分子结构数据库:我们做到了吗?
Curr Opin Chem Biol. 2017 Feb;36:1-6. doi: 10.1016/j.cbpa.2016.12.010. Epub 2016 Dec 22.
7
Metabolite identification and molecular fingerprint prediction through machine learning.通过机器学习进行代谢产物鉴定和分子指纹预测。
Bioinformatics. 2012 Sep 15;28(18):2333-41. doi: 10.1093/bioinformatics/bts437. Epub 2012 Jul 18.
8
Combining Experimental with Computational Infrared and Mass Spectra for High-Throughput Nontargeted Chemical Structure Identification.结合实验与计算红外光谱和质谱用于高通量非靶向化学结构鉴定
Anal Chem. 2023 Aug 15;95(32):11901-11907. doi: 10.1021/acs.analchem.3c00937. Epub 2023 Aug 4.
9
MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation.MetFID:基于人工神经网络的化合物指纹预测代谢物注释。
Metabolomics. 2020 Sep 30;16(10):104. doi: 10.1007/s11306-020-01726-7.
10
Identification of metabolites from tandem mass spectra with a machine learning approach utilizing structural features.利用机器学习方法结合结构特征鉴定串联质谱中的代谢物。
Bioinformatics. 2020 Feb 15;36(4):1213-1218. doi: 10.1093/bioinformatics/btz736.

引用本文的文献

1
Microbial Community Metabolism of Coral Reef Exometabolomes Broadens the Chemodiversity of Labile Dissolved Organic Matter.珊瑚礁胞外代谢组的微生物群落代谢拓宽了不稳定溶解有机物的化学多样性。
Environ Microbiol. 2025 Mar;27(3):e70064. doi: 10.1111/1462-2920.70064.
2
Deep Learning-Based Molecular Fingerprint Prediction for Metabolite Annotation.基于深度学习的代谢物注释分子指纹预测
Metabolites. 2025 Feb 14;15(2):132. doi: 10.3390/metabo15020132.
3
Natural variation in root exudate composition in the genetically structured Arabidopsis thaliana in the Iberian Peninsula.

本文引用的文献

1
Critical Assessment of Small Molecule Identification 2016: automated methods.2016年小分子鉴定的批判性评估:自动化方法
J Cheminform. 2017 Mar 27;9(1):22. doi: 10.1186/s13321-017-0207-1.
2
The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.化学开发工具包(CDK)v2.0:原子类型标注、描绘、分子式及子结构搜索。
J Cheminform. 2017 Jun 6;9(1):33. doi: 10.1186/s13321-017-0220-4.
3
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.
伊比利亚半岛基因结构的拟南芥根系分泌物组成的自然变异
New Phytol. 2025 Feb;245(4):1437-1449. doi: 10.1111/nph.20314. Epub 2024 Dec 10.
4
Identification of Granatane Alkaloids from (Solanaceae) using Molecular Networking and Semisynthesis.利用分子网络和半合成技术鉴定 (茄科)中的格兰塔烷生物碱。
J Nat Prod. 2024 Aug 23;87(8):1914-1920. doi: 10.1021/acs.jnatprod.4c00304. Epub 2024 Jul 22.
5
Broadcasters, receivers, functional groups of metabolites, and the link to heart failure by revealing metabolomic network connectivity.广播器、接收器、代谢物的功能基团,以及通过揭示代谢组学网络连通性与心力衰竭的联系。
Metabolomics. 2024 Jul 7;20(4):71. doi: 10.1007/s11306-024-02141-y.
6
Predicting RP-LC retention indices of structurally unknown chemicals from mass spectrometry data.从质谱数据预测结构未知化学品的反相液相色谱保留指数
J Cheminform. 2023 Feb 24;15(1):28. doi: 10.1186/s13321-023-00699-8.
7
Probabilistic edge inference of gene networks with markov random field-based bayesian learning.基于马尔可夫随机场贝叶斯学习的基因网络概率边推断
Front Genet. 2022 Nov 10;13:1034946. doi: 10.3389/fgene.2022.1034946. eCollection 2022.
8
PubChem 2023 update.PubChem 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.
9
Deep kernel learning improves molecular fingerprint prediction from tandem mass spectra.深度学习提高串联质谱分子指纹预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i342-i349. doi: 10.1093/bioinformatics/btac260.
10
MSNovelist: de novo structure generation from mass spectra.MSNovelist:从头开始从质谱生成结构。
Nat Methods. 2022 Jul;19(7):865-870. doi: 10.1038/s41592-022-01486-3. Epub 2022 May 30.
通过全球天然产物社会分子网络共享和社区管理质谱数据。
Nat Biotechnol. 2016 Aug 9;34(8):828-837. doi: 10.1038/nbt.3597.
4
Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software.氢重排规则:使用 MS-FINDER 软件进行计算 MS/MS 碎裂和结构解析。
Anal Chem. 2016 Aug 16;88(16):7946-58. doi: 10.1021/acs.analchem.6b00770. Epub 2016 Aug 4.
5
Computational Prediction of Electron Ionization Mass Spectra to Assist in GC/MS Compound Identification.计算预测电子离子化质谱,以辅助 GC/MS 化合物鉴定。
Anal Chem. 2016 Aug 2;88(15):7689-97. doi: 10.1021/acs.analchem.6b01622. Epub 2016 Jul 21.
6
Fast metabolite identification with Input Output Kernel Regression.使用输入输出核回归进行快速代谢物鉴定。
Bioinformatics. 2016 Jun 15;32(12):i28-i36. doi: 10.1093/bioinformatics/btw246.
7
Fragmentation trees reloaded.碎片树重新加载。
J Cheminform. 2016 Feb 1;8:5. doi: 10.1186/s13321-016-0116-8. eCollection 2016.
8
MetFrag relaunched: incorporating strategies beyond in silico fragmentation.MetFrag重新推出:纳入计算机辅助碎片化之外的策略。
J Cheminform. 2016 Jan 29;8:3. doi: 10.1186/s13321-016-0115-9. eCollection 2016.
9
Mining molecular structure databases: Identification of small molecules based on fragmentation mass spectrometry data.分子结构数据库挖掘:基于碎片质谱数据的小分子鉴定。
Mass Spectrom Rev. 2017 Sep;36(5):624-633. doi: 10.1002/mas.21489. Epub 2016 Jan 13.
10
KEGG as a reference resource for gene and protein annotation.KEGG作为基因和蛋白质注释的参考资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62. doi: 10.1093/nar/gkv1070. Epub 2015 Oct 17.