• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小分子鉴定中质谱和保留时间信息集成的概率框架。

Probabilistic framework for integration of mass spectrum and retention time information in small molecule identification.

机构信息

Department of Computer Science, School of Science, Aalto University, Espoo, Finland.

School of Computing Science, University of Glasgow, Glasgow, UK.

出版信息

Bioinformatics. 2021 Jul 19;37(12):1724-1731. doi: 10.1093/bioinformatics/btaa998.

DOI:10.1093/bioinformatics/btaa998
PMID:33244585
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8289373/
Abstract

MOTIVATION

Identification of small molecules in a biological sample remains a major bottleneck in molecular biology, despite a decade of rapid development of computational approaches for predicting molecular structures using mass spectrometry (MS) data. Recently, there has been increasing interest in utilizing other information sources, such as liquid chromatography (LC) retention time (RT), to improve identifications solely based on MS information, such as precursor mass-per-charge and tandem mass spectrometry (MS2).

RESULTS

We put forward a probabilistic modelling framework to integrate MS and RT data of multiple features in an LC-MS experiment. We model the MS measurements and all pairwise retention order information as a Markov random field and use efficient approximate inference for scoring and ranking potential molecular structures. Our experiments show improved identification accuracy by combining MS2 data and retention orders using our approach, thereby outperforming state-of-the-art methods. Furthermore, we demonstrate the benefit of our model when only a subset of LC-MS features has MS2 measurements available besides MS1.

AVAILABILITY AND IMPLEMENTATION

Software and data are freely available at https://github.com/aalto-ics-kepaco/msms_rt_score_integration.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

尽管在使用质谱 (MS) 数据预测分子结构的计算方法方面已经取得了十年的快速发展,但在生物样本中识别小分子仍然是分子生物学的主要瓶颈。最近,人们越来越感兴趣地利用其他信息源,例如液相色谱 (LC) 保留时间 (RT),来改进仅基于 MS 信息(如前体质量电荷比和串联质谱 (MS2))的鉴定。

结果

我们提出了一个概率建模框架,用于整合 LC-MS 实验中多个特征的 MS 和 RT 数据。我们将 MS 测量值和所有成对的保留顺序信息建模为马尔可夫随机场,并使用有效的近似推理来对潜在分子结构进行评分和排序。我们的实验表明,通过结合使用我们的方法的 MS2 数据和保留顺序,可以提高鉴定准确性,从而优于最先进的方法。此外,我们还证明了当除了 MS1 之外,LC-MS 特征的子集具有 MS2 测量值时,我们的模型的益处。

可用性和实现

软件和数据可在 https://github.com/aalto-ics-kepaco/msms_rt_score_integration 上免费获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/67e8ed7a4944/btaa998f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/2c6b902e6dc2/btaa998f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/f3a31ad6c898/btaa998f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/67e8ed7a4944/btaa998f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/2c6b902e6dc2/btaa998f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/f3a31ad6c898/btaa998f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15a1/8289373/67e8ed7a4944/btaa998f3.jpg

相似文献

1
Probabilistic framework for integration of mass spectrum and retention time information in small molecule identification.小分子鉴定中质谱和保留时间信息集成的概率框架。
Bioinformatics. 2021 Jul 19;37(12):1724-1731. doi: 10.1093/bioinformatics/btaa998.
2
Liquid-chromatography retention order prediction for metabolite identification.用于代谢物鉴定的液相色谱保留顺序预测。
Bioinformatics. 2018 Sep 1;34(17):i875-i883. doi: 10.1093/bioinformatics/bty590.
3
Relative Retention Time Estimation Improves N-Glycopeptide Identifications by LC-MS/MS.相对保留时间估计通过液相色谱-串联质谱法改善N-糖肽鉴定
J Proteome Res. 2020 May 1;19(5):2113-2121. doi: 10.1021/acs.jproteome.0c00051. Epub 2020 Apr 10.
4
Shape-based feature matching improves protein identification via LC-MS and tandem MS.基于形状的特征匹配通过液相色谱-质谱联用和串联质谱提高蛋白质鉴定水平。
J Comput Biol. 2011 Apr;18(4):547-57. doi: 10.1089/cmb.2010.0155. Epub 2011 Mar 21.
5
LIQUID: an-open source software for identifying lipids in LC-MS/MS-based lipidomics data.LIQUID:一款用于在基于液相色谱-串联质谱的脂质组学数据中鉴定脂质的开源软件。
Bioinformatics. 2017 Jun 1;33(11):1744-1746. doi: 10.1093/bioinformatics/btx046.
6
CliqueMS: a computational tool for annotating in-source metabolite ions from LC-MS untargeted metabolomics data based on a coelution similarity network.CliqueMS:一种基于共流出相似性网络的用于从 LC-MS 非靶向代谢组学数据中注释源内代谢物离子的计算工具。
Bioinformatics. 2019 Oct 15;35(20):4089-4097. doi: 10.1093/bioinformatics/btz207.
7
SimExTargId: a comprehensive package for real-time LC-MS data acquisition and analysis.SimExTargId:一个用于实时 LC-MS 数据采集和分析的综合软件包。
Bioinformatics. 2018 Oct 15;34(20):3589-3590. doi: 10.1093/bioinformatics/bty218.
8
MS2AI: automated repurposing of public peptide LC-MS data for machine learning applications.MS2AI:用于机器学习应用的公共肽段液相色谱-质谱数据的自动重新利用。
Bioinformatics. 2022 Jan 12;38(3):875-877. doi: 10.1093/bioinformatics/btab701.
9
Improved identification and quantification of peptides in mass spectrometry data via chemical and random additive noise elimination (CRANE).通过化学和随机加性噪声消除(CRANE)提高质谱数据中肽的鉴定和定量。
Bioinformatics. 2021 Dec 11;37(24):4719-4726. doi: 10.1093/bioinformatics/btab563.
10
Targeted realignment of LC-MS profiles by neighbor-wise compound-specific graphical time warping with misalignment detection.基于邻接法的化合物特异性图形时间扭曲与错配检测实现 LC-MS 图谱的靶向重排。
Bioinformatics. 2020 May 1;36(9):2862-2871. doi: 10.1093/bioinformatics/btaa037.

引用本文的文献

1
On Selecting Robust Approaches for Learning Predictive Biomarkers in Metabolomics Data Sets.关于选择稳健方法以在代谢组学数据集中学习预测性生物标志物
Anal Chem. 2025 Jun 24;97(24):12669-12678. doi: 10.1021/acs.analchem.5c01049. Epub 2025 Jun 12.
2
ROASMI: accelerating small molecule identification by repurposing retention data.ROASMI:通过重新利用保留数据加速小分子鉴定
J Cheminform. 2025 Feb 14;17(1):20. doi: 10.1186/s13321-025-00968-8.
3
MAD HATTER Correctly Annotates 98% of Small Molecule Tandem Mass Spectra Searching in PubChem.

本文引用的文献

1
Current status of retention time prediction in metabolite identification.代谢物鉴定中保留时间预测的现状。
J Sep Sci. 2020 May;43(9-10):1746-1754. doi: 10.1002/jssc.202000060. Epub 2020 Apr 6.
2
The METLIN small molecule dataset for machine learning-based retention time prediction.基于机器学习的保留时间预测的 METLIN 小分子数据集。
Nat Commun. 2019 Dec 20;10(1):5811. doi: 10.1038/s41467-019-13680-7.
3
Taxonomically Informed Scoring Enhances Confidence in Natural Products Annotation.基于分类学的评分提高了天然产物注释的可信度。
疯帽匠在PubChem中搜索时能正确注释98%的小分子串联质谱图。
Metabolites. 2023 Feb 21;13(3):314. doi: 10.3390/metabo13030314.
4
Strategies for structure elucidation of small molecules based on LC-MS/MS data from complex biological samples.基于复杂生物样品的液相色谱-串联质谱数据解析小分子结构的策略。
Comput Struct Biotechnol J. 2022 Sep 7;20:5085-5097. doi: 10.1016/j.csbj.2022.09.004. eCollection 2022.
5
Probabilistic metabolite annotation using retention time prediction and meta-learned projections.利用保留时间预测和元学习投影进行概率性代谢物注释。
J Cheminform. 2022 Jun 7;14(1):33. doi: 10.1186/s13321-022-00613-8.
Front Plant Sci. 2019 Oct 25;10:1329. doi: 10.3389/fpls.2019.01329. eCollection 2019.
4
ADAPTIVE: leArning DAta-dePendenT, concIse molecular VEctors for fast, accurate metabolite identification from tandem mass spectra.ADAPTIVE:从串联质谱中快速、准确识别代谢物的学习数据依赖、简洁的分子向量。
Bioinformatics. 2019 Jul 15;35(14):i164-i172. doi: 10.1093/bioinformatics/btz319.
5
Integrated Probabilistic Annotation: A Bayesian-Based Annotation Method for Metabolomic Profiles Integrating Biochemical Connections, Isotope Patterns, and Adduct Relationships.集成概率标注:一种基于贝叶斯的代谢组学谱注释方法,整合了生化联系、同位素模式和加合物关系。
Anal Chem. 2019 Oct 15;91(20):12799-12807. doi: 10.1021/acs.analchem.9b02354. Epub 2019 Sep 30.
6
Improved Small Molecule Identification through Learning Combinations of Kernel Regression Models.通过学习核回归模型的组合改进小分子识别
Metabolites. 2019 Aug 1;9(8):160. doi: 10.3390/metabo9080160.
7
Quantitative Structure-Retention Relationships with Non-Linear Programming for Prediction of Chromatographic Elution Order.采用非线性规划进行定量结构保留关系预测色谱洗脱顺序。
Int J Mol Sci. 2019 Jul 12;20(14):3443. doi: 10.3390/ijms20143443.
8
Improving MetFrag with statistical learning of fragment annotations.利用片段注释的统计学习改进 MetFrag。
BMC Bioinformatics. 2019 Jul 5;20(1):376. doi: 10.1186/s12859-019-2954-7.
9
Predicting Ion Mobility Collision Cross-Sections Using a Deep Neural Network: DeepCCS.使用深度神经网络预测离子淌度碰撞截面:DeepCCS。
Anal Chem. 2019 Apr 16;91(8):5191-5199. doi: 10.1021/acs.analchem.8b05821. Epub 2019 Apr 1.
10
SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information.SIRIUS 4:一种快速将串联质谱转化为代谢物结构信息的工具。
Nat Methods. 2019 Apr;16(4):299-302. doi: 10.1038/s41592-019-0344-8. Epub 2019 Mar 18.