• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于机器学习的保留时间预测增强液相色谱-高分辨质谱数据中的脂质鉴定

Enhancing lipid identification in LC-HRMS data through machine learning-based retention time prediction.

作者信息

Noreldeen Hamada A A

机构信息

National Institute of Oceanography and Fisheries, NIOF, Cairo, Egypt.

出版信息

J Chromatogr A. 2025 Feb 8;1742:465650. doi: 10.1016/j.chroma.2024.465650. Epub 2025 Jan 3.

DOI:10.1016/j.chroma.2024.465650
PMID:39798479
Abstract

The comprehensive identification of peaks in untargeted lipidomics using LC-MS/MS remains a significant challenge. Confidence in lipid annotation can be greatly improved by integrating a highly accurate machine learning-based retention time prediction model. Such an approach enables the identification of lipids for understanding pathogenic mechanisms, biomarker discovery, and drug screening. In this study, we developed a machine learning model to predict retention times and facilitate lipid peak annotations in LC-MS-based untargeted lipidomics. Our model achieved high correlation coefficients of 0.998 and 0.990, with mean absolute errors (MAE) of 0.107 min and 0.240 min for the training and test sets, respectively. External validation showed similarly strong performance, with correlations of 0.991 and 0.978, and MAE values of 0.241 min and 0.270 min. We also compared the impact of molecular descriptors and molecular fingerprints on the model's performance, finding that molecular descriptors outperformed molecular fingerprints across all datasets when using Random Forest (RF) for model construction. Notably, this retention time calibration model demonstrates robust performance across chromatographic systems with comparable gradients and flow rates. Overall, this machine learning model enhances lipid annotation accuracy and reduces errors in untargeted lipidomics, improving data analysis across multiple datasets.

摘要

使用液相色谱-串联质谱(LC-MS/MS)对非靶向脂质组学中的峰进行全面鉴定仍然是一项重大挑战。通过整合基于高精度机器学习的保留时间预测模型,可以大大提高脂质注释的可信度。这种方法能够鉴定脂质,以了解致病机制、发现生物标志物和进行药物筛选。在本研究中,我们开发了一种机器学习模型来预测保留时间,并促进基于LC-MS的非靶向脂质组学中的脂质峰注释。我们的模型在训练集和测试集上分别取得了0.998和0.990的高相关系数,平均绝对误差(MAE)分别为0.107分钟和0.240分钟。外部验证显示出同样强大的性能,相关系数分别为0.991和0.978,MAE值分别为0.241分钟和0.270分钟。我们还比较了分子描述符和分子指纹对模型性能的影响,发现在使用随机森林(RF)构建模型时,分子描述符在所有数据集中的表现均优于分子指纹。值得注意的是,这种保留时间校准模型在具有可比梯度和流速的色谱系统中表现出稳健的性能。总体而言,这种机器学习模型提高了非靶向脂质组学中脂质注释的准确性,减少了误差,改善了多个数据集的数据分析。

相似文献

1
Enhancing lipid identification in LC-HRMS data through machine learning-based retention time prediction.通过基于机器学习的保留时间预测增强液相色谱-高分辨质谱数据中的脂质鉴定
J Chromatogr A. 2025 Feb 8;1742:465650. doi: 10.1016/j.chroma.2024.465650. Epub 2025 Jan 3.
2
Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics.提示:用于无靶标代谢组学中化合物注释的保留时间预测。
Anal Chem. 2020 Jun 2;92(11):7515-7522. doi: 10.1021/acs.analchem.9b05765. Epub 2020 May 21.
3
Mobile Phase Contaminants Affect Neutral Lipid Analysis in LC-MS-Based Lipidomics Studies.流动相污染物影响基于液相色谱-质谱联用的脂质组学研究中的中性脂质分析。
J Am Soc Mass Spectrom. 2024 Dec 4;35(12):3095-3106. doi: 10.1021/jasms.4c00320. Epub 2024 Oct 7.
4
Optimization of a liquid chromatography-ion mobility-high resolution mass spectrometry platform for untargeted lipidomics and application to HepaRG cell extracts.优化液相色谱-离子淌度-高分辨质谱联用平台进行非靶向脂质组学分析及其在 HepaRG 细胞提取物中的应用。
Talanta. 2021 Dec 1;235:122808. doi: 10.1016/j.talanta.2021.122808. Epub 2021 Aug 17.
5
Using Data-Dependent and -Independent Hybrid Acquisitions for Fast Liquid Chromatography-Based Untargeted Lipidomics.基于数据依赖和独立混合采集的快速液相色谱非靶向脂质组学分析。
Anal Chem. 2024 Jan 23;96(3):991-996. doi: 10.1021/acs.analchem.3c04400. Epub 2024 Jan 11.
6
Recommendations for Accurate Lipid Annotation and Semi-absolute Quantification from LC-MS/MS Datasets.推荐使用 LC-MS/MS 数据集进行准确的脂质注释和半绝对定量。
Methods Mol Biol. 2025;2855:269-287. doi: 10.1007/978-1-0716-4116-3_16.
7
High-Resolution Liquid Chromatography-Mass Spectrometry for Lipidomics.用于脂质组学的高分辨率液相色谱-质谱联用技术
Methods Mol Biol. 2023;2625:57-63. doi: 10.1007/978-1-0716-2966-6_4.
8
LipidMS: An R Package for Lipid Annotation in Untargeted Liquid Chromatography-Data Independent Acquisition-Mass Spectrometry Lipidomics.脂质 MS:用于非靶向液相色谱-数据非依赖性采集-质谱脂质组学中脂质注释的 R 包。
Anal Chem. 2019 Jan 2;91(1):836-845. doi: 10.1021/acs.analchem.8b03409. Epub 2018 Dec 13.
9
Retention Time Prediction Improves Identification in Nontargeted Lipidomics Approaches.保留时间预测可改善非靶向脂质组学方法中的鉴定效果。
Anal Chem. 2015 Aug 4;87(15):7698-704. doi: 10.1021/acs.analchem.5b01139. Epub 2015 Jul 22.
10
Enhanced lipidomics workflows for plasma and extracellular vesicles through advanced liquid chromatography-tandem mass spectrometry integrated.通过先进的液相色谱-串联质谱联用技术实现的针对血浆和细胞外囊泡的增强脂质组学工作流程。
Talanta. 2025 Aug 15;291:127847. doi: 10.1016/j.talanta.2025.127847. Epub 2025 Mar 1.