• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估图神经网络在预测碰撞横截面方面的通用性。

Evaluating the generalizability of graph neural networks for predicting collision cross section.

作者信息

Engler Hart Chloe, Preto António José, Chanana Shaurya, Healey David, Kind Tobias, Domingo-Fernández Daniel

机构信息

Enveda Biosciences, Inc., 5700 Flatiron Pkwy, Boulder, CO, 80301, USA.

出版信息

J Cheminform. 2024 Aug 29;16(1):105. doi: 10.1186/s13321-024-00899-w.

DOI:10.1186/s13321-024-00899-w
PMID:39210378
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11363525/
Abstract

Ion Mobility coupled with Mass Spectrometry (IM-MS) is a promising analytical technique that enhances molecular characterization by measuring collision cross-section (CCS) values, which are indicative of the molecular size and shape. However, the effective application of CCS values in structural analysis is still constrained by the limited availability of experimental data, necessitating the development of accurate machine learning (ML) models for in silico predictions. In this study, we evaluated state-of-the-art Graph Neural Networks (GNNs), trained to predict CCS values using the largest publicly available dataset to date. Although our results confirm the high accuracy of these models within chemical spaces similar to their training environments, their performance significantly declines when applied to structurally novel regions. This discrepancy raises concerns about the reliability of in silico CCS predictions and underscores the need for releasing further publicly available CCS datasets. To mitigate this, we introduce Mol2CCS which demonstrates how generalization can be partially improved by extending models to account for additional features such as molecular fingerprints, descriptors, and the molecule types. Lastly, we also show how confidence models can support by enhancing the reliability of the CCS estimates.Scientific contributionWe have benchmarked state-of-the-art graph neural networks for predicting collision cross section. Our work highlights the accuracy of these models when trained and predicted in similar chemical spaces, but also how their accuracy drops when evaluated in structurally novel regions. Lastly, we conclude by presenting potential approaches to mitigate this issue.

摘要

离子淌度与质谱联用(IM-MS)是一种很有前景的分析技术,它通过测量碰撞截面(CCS)值来增强分子表征,而碰撞截面值能反映分子的大小和形状。然而,CCS值在结构分析中的有效应用仍受到实验数据有限的限制,因此有必要开发准确的机器学习(ML)模型用于计算机模拟预测。在本研究中,我们评估了先进的图神经网络(GNN),这些网络使用迄今为止最大的公开可用数据集进行训练以预测CCS值。尽管我们的结果证实了这些模型在与其训练环境相似的化学空间内具有较高的准确性,但当应用于结构新颖的区域时,它们的性能会显著下降。这种差异引发了对计算机模拟CCS预测可靠性的担忧,并强调了发布更多公开可用CCS数据集的必要性。为了缓解这一问题,我们引入了Mol2CCS,它展示了如何通过扩展模型以纳入分子指纹、描述符和分子类型等附加特征来部分提高泛化能力。最后,我们还展示了置信模型如何通过提高CCS估计的可靠性来提供支持。

科学贡献

我们对用于预测碰撞截面的先进图神经网络进行了基准测试。我们的工作突出了这些模型在相似化学空间中训练和预测时的准确性,但也展示了在结构新颖的区域进行评估时其准确性是如何下降的。最后,我们通过提出缓解此问题的潜在方法来得出结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/3564b26803b4/13321_2024_899_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/20a2fe5e1fe4/13321_2024_899_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/8e7c1d560129/13321_2024_899_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/24b6952e5677/13321_2024_899_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/5705ccd2c493/13321_2024_899_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/3564b26803b4/13321_2024_899_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/20a2fe5e1fe4/13321_2024_899_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/8e7c1d560129/13321_2024_899_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/24b6952e5677/13321_2024_899_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/5705ccd2c493/13321_2024_899_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e92/11363525/3564b26803b4/13321_2024_899_Fig5_HTML.jpg

相似文献

1
Evaluating the generalizability of graph neural networks for predicting collision cross section.评估图神经网络在预测碰撞横截面方面的通用性。
J Cheminform. 2024 Aug 29;16(1):105. doi: 10.1186/s13321-024-00899-w.
2
Prediction of collision cross section and retention time for broad scope screening in gradient reversed-phase liquid chromatography-ion mobility-high resolution accurate mass spectrometry.梯度反相液相色谱-离子淌度-高分辨率精确质谱法中用于广泛范围筛查的碰撞截面和保留时间预测
J Chromatogr A. 2018 Mar 23;1542:82-88. doi: 10.1016/j.chroma.2018.02.025. Epub 2018 Feb 15.
3
Highly accurate and large-scale collision cross sections prediction with graph neural networks.利用图神经网络进行高精度大规模碰撞截面预测。
Commun Chem. 2023 Jul 4;6(1):139. doi: 10.1038/s42004-023-00939-w.
4
Predicting Collision Cross-Section Values for Small Molecules through Chemical Class-Based Multimodal Graph Attention Network.通过基于化学类别的多模态图注意网络预测小分子的碰撞截面值。
J Chem Inf Model. 2024 Aug 26;64(16):6305-6315. doi: 10.1021/acs.jcim.3c01934. Epub 2024 Jul 3.
5
Collision Cross Section Calculations to Aid Metabolite Annotation.用于辅助代谢物注释的碰撞截面计算。
J Am Soc Mass Spectrom. 2022 May 4;33(5):750-759. doi: 10.1021/jasms.1c00315. Epub 2022 Apr 4.
6
Predicting the Predicted: A Comparison of Machine Learning-Based Collision Cross-Section Prediction Models for Small Molecules.预测中的预测:小分子的基于机器学习的碰撞截面预测模型的比较。
Anal Chem. 2024 Jun 4;96(22):9088-9096. doi: 10.1021/acs.analchem.4c00630. Epub 2024 May 24.
7
Accurate Prediction of Ion Mobility Collision Cross-Section Using Ion's Polarizability and Molecular Mass with Limited Data.利用离子极化率和分子量并基于有限数据准确预测离子迁移率碰撞截面
J Chem Inf Model. 2024 Mar 11;64(5):1533-1542. doi: 10.1021/acs.jcim.3c01491. Epub 2024 Feb 23.
8
Breaking Down Structural Diversity for Comprehensive Prediction of Ion-Neutral Collision Cross Sections.打破结构多样性,全面预测离子-中性碰撞截面。
Anal Chem. 2020 Mar 17;92(6):4548-4557. doi: 10.1021/acs.analchem.9b05772. Epub 2020 Mar 6.
9
[Applications of ion mobility-mass spectrometry in the chemical analysis in traditional Chinese medicines].离子淌度-质谱联用技术在中药化学分析中的应用
Se Pu. 2022 Sep;40(9):782-787. doi: 10.3724/SP.J.1123.2022.01028.
10
Large-Scale Prediction of Collision Cross-Section Values for Metabolites in Ion Mobility-Mass Spectrometry.大规模预测代谢物在离子淌度-质谱中的碰撞截面值。
Anal Chem. 2016 Nov 15;88(22):11084-11091. doi: 10.1021/acs.analchem.6b03091. Epub 2016 Nov 1.

引用本文的文献

1
Defining the limits of plant chemical space: challenges and estimations.界定植物化学空间的界限:挑战与评估
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf033.

本文引用的文献

1
METLIN-CCS Lipid Database: An authentic standards resource for lipid classification and identification.METLIN-CCS脂质数据库:用于脂质分类和鉴定的权威标准资源。
Nat Metab. 2024 Jun;6(6):981-982. doi: 10.1038/s42255-024-01058-z.
2
RT-Transformer: retention time prediction for metabolite annotation to assist in metabolite identification.RT-Transformer:用于代谢物注释的保留时间预测,以辅助代谢物鉴定。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae084.
3
METLIN-CCS: an ion mobility spectrometry collision cross section database.
METLIN-CCS:一个离子淌度谱碰撞截面数据库。
Nat Methods. 2023 Dec;20(12):1836-1837. doi: 10.1038/s41592-023-02078-5.
4
AllCCS2: Curation of Ion Mobility Collision Cross-Section Atlas for Small Molecules Using Comprehensive Molecular Representations.所有 CCS2:使用综合分子表示对小分子的离子淌度碰撞截面图谱进行编目。
Anal Chem. 2023 Sep 19;95(37):13913-13921. doi: 10.1021/acs.analchem.3c02267. Epub 2023 Sep 4.
5
Highly accurate and large-scale collision cross sections prediction with graph neural networks.利用图神经网络进行高精度大规模碰撞截面预测。
Commun Chem. 2023 Jul 4;6(1):139. doi: 10.1038/s42004-023-00939-w.
6
Collision Cross Section Prediction Based on Machine Learning.基于机器学习的碰撞截面预测。
Molecules. 2023 May 12;28(10):4050. doi: 10.3390/molecules28104050.
7
CCS Predictor 2.0: An Open-Source Jupyter Notebook Tool for Filtering Out False Positives in Metabolomics.CCS 预测器 2.0:用于代谢组学中过滤假阳性的开源 Jupyter 笔记本工具。
Anal Chem. 2022 Dec 20;94(50):17456-17466. doi: 10.1021/acs.analchem.2c03491. Epub 2022 Dec 6.
8
DrugTax: package for drug taxonomy identification and explainable feature extraction.DrugTax:用于药物分类识别和可解释特征提取的软件包。
J Cheminform. 2022 Oct 27;14(1):73. doi: 10.1186/s13321-022-00649-w.
9
Collision Cross Section Calculations to Aid Metabolite Annotation.用于辅助代谢物注释的碰撞截面计算。
J Am Soc Mass Spectrom. 2022 May 4;33(5):750-759. doi: 10.1021/jasms.1c00315. Epub 2022 Apr 4.
10
Adduct annotation in liquid chromatography/high-resolution mass spectrometry to enhance compound identification.在液相色谱/高分辨率质谱中进行加合物注释以增强化合物鉴定。
Anal Bioanal Chem. 2021 Jan;413(2):503-517. doi: 10.1007/s00216-020-03019-3. Epub 2020 Oct 29.