用于小分子保留时间预测的深度图卷积网络。

Deep graph convolutional network for small-molecule retention time prediction.

机构信息

School of Engineering, Westlake University, Hangzhou, Zhejiang, 310024, China.

School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.

出版信息

J Chromatogr A. 2023 Nov 22;1711:464439. doi: 10.1016/j.chroma.2023.464439. Epub 2023 Oct 13.

DOI:10.1016/j.chroma.2023.464439

PMID:37865024

Abstract

The retention time (RT) is a crucial source of data for liquid chromatography-mass spectrometry (LCMS). A model that can accurately predict the RT for each molecule would empower filtering candidates with similar spectra but differing RT in LCMS-based molecule identification. Recent research shows that graph neural networks (GNNs) outperform traditional machine learning algorithms in RT prediction. However, all of these models use relatively shallow GNNs. This study for the first time investigates how depth affects GNNs' performance on RT prediction. The results demonstrate that a notable improvement can be achieved by pushing the depth of GNNs to 16 layers by the adoption of residual connection. Additionally, we also find that graph convolutional network (GCN) model benefits from the edge information. The developed deep graph convolutional network, DeepGCN-RT, significantly outperforms the previous state-of-the-art method and achieves the lowest mean absolute percentage error (MAPE) of 3.3% and the lowest mean absolute error (MAE) of 26.55 s on the SMRT test set. We also finetune DeepGCN-RT on seven datasets with various chromatographic conditions. The mean MAE of the seven datasets largely decreases 30% compared to previous state-of-the-art method. On the RIKEN-PlaSMA dataset, we also test the effectiveness of DeepGCN-RT in assisting molecular structure identification. By 30% lessening the number of potential structures, DeepGCN-RT is able to improve top-1 accuracy by about 11%.

摘要

保留时间 (RT) 是液相色谱-质谱 (LCMS) 的重要数据来源。如果有一种模型能够准确预测每个分子的 RT，那么在基于 LCMS 的分子识别中，就可以对具有相似光谱但 RT 不同的候选物进行过滤。最近的研究表明，图神经网络 (GNN) 在 RT 预测方面优于传统的机器学习算法。然而，所有这些模型都使用相对较浅的 GNN。本研究首次探讨了深度如何影响 GNN 在 RT 预测中的性能。结果表明，通过采用残差连接将 GNN 的深度推至 16 层，可以显著提高性能。此外，我们还发现图卷积网络 (GCN) 模型受益于边缘信息。所开发的深度图卷积网络 DeepGCN-RT 显著优于先前的最先进方法，在 SMRT 测试集上实现了最低的平均绝对百分比误差 (MAPE) 3.3%和最低的平均绝对误差 (MAE) 26.55 秒。我们还在具有各种色谱条件的七个数据集上微调了 DeepGCN-RT。与先前的最先进方法相比，这七个数据集的平均 MAE 大大降低了 30%。在 RIKEN-PlaSMA 数据集上，我们还测试了 DeepGCN-RT 在辅助分子结构识别方面的有效性。通过将潜在结构的数量减少 30%，DeepGCN-RT 能够将准确率提高约 11%。

相似文献

Deep graph convolutional network for small-molecule retention time prediction.用于小分子保留时间预测的深度图卷积网络。

J Chromatogr A. 2023 Nov 22;1711:464439. doi: 10.1016/j.chroma.2023.464439. Epub 2023 Oct 13.

Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning.基于图神经网络和迁移学习的亲水相互作用液相色谱保留时间预测。

J Chromatogr A. 2021 Oct 25;1656:462536. doi: 10.1016/j.chroma.2021.462536. Epub 2021 Sep 7.

Prediction of Liquid Chromatographic Retention Time with Graph Neural Networks to Assist in Small Molecule Identification.利用图神经网络预测液相色谱保留时间以辅助小分子鉴定

Anal Chem. 2021 Feb 2;93(4):2200-2206. doi: 10.1021/acs.analchem.0c04071. Epub 2021 Jan 7.

Dual-channel deep graph convolutional neural networks.双通道深度图卷积神经网络

Front Artif Intell. 2024 Apr 4;7:1290491. doi: 10.3389/frai.2024.1290491. eCollection 2024.

Deep Neural Network Pretrained by Weighted Autoencoders and Transfer Learning for Retention Time Prediction of Small Molecules.基于加权自动编码器和迁移学习的深度神经网络用于小分子保留时间预测。

Anal Chem. 2021 Nov 30;93(47):15651-15658. doi: 10.1021/acs.analchem.1c03250. Epub 2021 Nov 15.

RT-Transformer: retention time prediction for metabolite annotation to assist in metabolite identification.RT-Transformer：用于代谢物注释的保留时间预测，以辅助代谢物鉴定。

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae084.

Deep learning for retention time prediction in reversed-phase liquid chromatography.基于深度学习的反相液相色谱保留时间预测。

J Chromatogr A. 2022 Feb 8;1664:462792. doi: 10.1016/j.chroma.2021.462792. Epub 2021 Dec 30.

Retention time prediction for small samples based on integrating molecular representations and adaptive network.基于分子表示和自适应网络集成的小样本保留时间预测。

J Chromatogr B Analyt Technol Biomed Life Sci. 2023 Feb 15;1217:123624. doi: 10.1016/j.jchromb.2023.123624. Epub 2023 Feb 4.

Costless Performance Improvement in Machine Learning for Graph-Based Molecular Analysis.基于图的分子分析机器学习中的无成本性能改进。

J Chem Inf Model. 2020 Mar 23;60(3):1137-1145. doi: 10.1021/acs.jcim.9b00816. Epub 2020 Jan 28.

A Comprehensive Survey on Graph Neural Networks.图神经网络综述。

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.

引用本文的文献

Development of an Efficient and Generalized MTSCAM Model to Predict Liquid Chromatography Retention Times of Organic Compounds.开发一种高效通用的MTSCAM模型以预测有机化合物的液相色谱保留时间。

Research (Wash D C). 2025 Feb 7;8:0607. doi: 10.34133/research.0607. eCollection 2025.

Performance and robustness of small molecule retention time prediction with molecular graph neural networks in industrial drug discovery campaigns.小分子保留时间预测的分子图神经网络在工业药物发现中的性能和稳健性。

Sci Rep. 2024 Apr 16;14(1):8733. doi: 10.1038/s41598-024-59620-4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于小分子保留时间预测的深度图卷积网络。

Deep graph convolutional network for small-molecule retention time prediction.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献