• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改进液相色谱-质谱联用中化合物适用性的预测以增强非靶向分析。

Improving predictions of compound amenability for liquid chromatography-mass spectrometry to enhance non-targeted analysis.

作者信息

Charest Nathaniel, Lowe Charles N, Ramsland Christian, Meyer Brian, Samano Vicente, Williams Antony J

机构信息

Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA.

Oakridge Associated Universities, Oak Ridge, Tennessee, 37830, USA.

出版信息

Anal Bioanal Chem. 2024 Apr;416(10):2565-2579. doi: 10.1007/s00216-024-05229-5. Epub 2024 Mar 26.

DOI:10.1007/s00216-024-05229-5
PMID:38530399
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11228616/
Abstract

Mass-spectrometry-based non-targeted analysis (NTA), in which mass spectrometric signals are assigned chemical identities based on a systematic collation of evidence, is a growing area of interest for toxicological risk assessment. Successful NTA results in better identification of potentially hazardous pollutants within the environment, facilitating the development of targeted analytical strategies to best characterize risks to human and ecological health. A supporting component of the NTA process involves assessing whether suspected chemicals are amenable to the mass spectrometric method, which is necessary in order to assign an observed signal to the chemical structure. Prior work from this group involved the development of a random forest model for predicting the amenability of 5517 unique chemical structures to liquid chromatography-mass spectrometry (LC-MS). This work improves the interpretability of the group's prior model of the same endpoint, as well as integrating 1348 more data points across negative and positive ionization modes. We enhance interpretability by feature engineering, a machine learning practice that reduces the input dimensionality while attempting to preserve performance statistics. We emphasize the importance of interpretable machine learning models within the context of building confidence in NTA identification. The novel data were curated by the labeling of compounds as amenable or unamenable by expert curators, resulting in an enhanced set of chemical compounds to expand the applicability domain of the prior model. The balanced accuracy benchmark of the newly developed model is comparable to performance previously reported (mean CV BA is 0.84 vs. 0.82 in positive mode, and 0.85 vs. 0.82 in negative mode), while on a novel external set, derived from this work's data, the Matthews correlation coefficients (MCC) for the novel models are 0.66 and 0.68 for positive and negative mode, respectively. Our group's prior published models scored MCC of 0.55 and 0.54 on the same external sets. This demonstrates appreciable improvement over the chemical space captured by the expanded dataset. This work forms part of our ongoing efforts to develop models with higher interpretability and higher performance to support NTA efforts.

摘要

基于质谱的非靶向分析(NTA)是毒理学风险评估中一个日益受到关注的领域,在该分析中,质谱信号会根据系统整理的证据被赋予化学身份。成功的NTA有助于更好地识别环境中潜在的有害污染物,推动制定有针对性的分析策略,以最佳方式描述对人类和生态健康的风险。NTA过程的一个辅助环节是评估可疑化学品是否适用于质谱方法,这对于将观察到的信号与化学结构进行关联是必要的。该团队之前的工作涉及开发一个随机森林模型,用于预测5517种独特化学结构对液相色谱 - 质谱联用(LC - MS)的适用性。这项工作提高了该团队之前相同终点模型的可解释性,同时在正负离子模式下整合了另外1348个数据点。我们通过特征工程提高可解释性,特征工程是一种机器学习实践,在试图保留性能统计数据的同时降低输入维度。我们强调在建立对NTA识别的信心的背景下,可解释机器学习模型的重要性。新数据由专家策展人将化合物标记为适用或不适用来整理,从而得到一组增强的化合物,以扩大先前模型的适用范围。新开发模型的平衡准确率基准与之前报告的性能相当(正模式下平均交叉验证平衡准确率为0.84对0.82,负模式下为0.85对0.82),而在源自这项工作数据的新外部数据集上,新模型的马修斯相关系数(MCC)在正模式和负模式下分别为0.66和0.68。该团队之前发表的模型在相同外部数据集上的MCC得分分别为0.55和0.54。这表明在扩展数据集所涵盖的化学空间方面有显著改进。这项工作是我们持续努力开发具有更高可解释性和更高性能的模型以支持NTA工作的一部分。

相似文献

1
Improving predictions of compound amenability for liquid chromatography-mass spectrometry to enhance non-targeted analysis.改进液相色谱-质谱联用中化合物适用性的预测以增强非靶向分析。
Anal Bioanal Chem. 2024 Apr;416(10):2565-2579. doi: 10.1007/s00216-024-05229-5. Epub 2024 Mar 26.
2
Predicting compound amenability with liquid chromatography-mass spectrometry to improve non-targeted analysis.利用液相色谱-质谱联用技术预测化合物的可分析性,以改进非靶向分析。
Anal Bioanal Chem. 2021 Dec;413(30):7495-7508. doi: 10.1007/s00216-021-03713-w. Epub 2021 Oct 14.
3
Filling the Gaps in PFAS Detection: Integrating GC-MS Non-Targeted Analysis for Comprehensive Environmental Monitoring and Exposure Assessment.填补全氟和多氟烷基物质(PFAS)检测的空白:整合气相色谱-质谱联用(GC-MS)非靶向分析用于全面的环境监测和暴露评估。
Environ Sci Technol Lett. 2025 Jan 23;12(2):1-9. doi: 10.1021/acs.estlett.4c00930.
4
Non-targeted analysis (NTA) and suspect screening analysis (SSA): a review of examining the chemical exposome.非靶向分析(NTA)和可疑筛选分析(SSA):考察化学暴露组的研究综述。
J Expo Sci Environ Epidemiol. 2023 Jul;33(4):524-536. doi: 10.1038/s41370-023-00574-6. Epub 2023 Jun 28.
5
Exploring chemical space in non-targeted analysis: a proposed ChemSpace tool.探索非靶向分析中的化学空间:一个提议的 ChemSpace 工具。
Anal Bioanal Chem. 2023 Jan;415(1):35-44. doi: 10.1007/s00216-022-04434-4. Epub 2022 Nov 26.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
Applications of Machine Learning to In Silico Quantification of Chemicals without Analytical Standards.机器学习在无需分析标准品的化学品计算机定量分析中的应用。
J Chem Inf Model. 2020 Jun 22;60(6):2718-2727. doi: 10.1021/acs.jcim.9b01096. Epub 2020 May 20.
8
A comparison of three liquid chromatography (LC) retention time prediction models.三种液相色谱(LC)保留时间预测模型的比较。
Talanta. 2018 May 15;182:371-379. doi: 10.1016/j.talanta.2018.01.022. Epub 2018 Jan 11.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Identifying xenobiotic metabolites with prediction tools and LCMS suspect screening analysis.使用预测工具和液相色谱-质谱联用可疑物筛查分析来鉴定外源性代谢物。
Front Toxicol. 2023 Jan 18;5:1051483. doi: 10.3389/ftox.2023.1051483. eCollection 2023.

引用本文的文献

1
Practical Guidance on Selecting Analytical Methods for PFAS in Semiconductor Manufacturing Wastewater.半导体制造废水中全氟和多氟烷基物质(PFAS)分析方法选择的实用指南。
ACS Meas Sci Au. 2025 Jun 20;5(4):399-423. doi: 10.1021/acsmeasuresciau.5c00021. eCollection 2025 Aug 20.
2
Reference library for suspect screening of environmental toxicants using ion mobility spectrometry-mass spectrometry.使用离子迁移谱-质谱联用技术进行环境毒物可疑筛查的参考文库
Commun Chem. 2025 Aug 1;8(1):224. doi: 10.1038/s42004-025-01619-7.
3
Filling the Gaps in PFAS Detection: Integrating GC-MS Non-Targeted Analysis for Comprehensive Environmental Monitoring and Exposure Assessment.填补全氟和多氟烷基物质(PFAS)检测的空白:整合气相色谱-质谱联用(GC-MS)非靶向分析用于全面的环境监测和暴露评估。
Environ Sci Technol Lett. 2025 Jan 23;12(2):1-9. doi: 10.1021/acs.estlett.4c00930.
4
Reference Library for Suspect Non-targeted Screening of Environmental Toxicants Using Ion Mobility Spectrometry-Mass Spectrometry.使用离子迁移谱-质谱法进行可疑环境毒物非靶向筛查的参考库
bioRxiv. 2025 Feb 27:2025.02.22.639656. doi: 10.1101/2025.02.22.639656.
5
Analytical Quality Evaluation of the Tox21 Compound Library.Tox21化合物库的分析质量评估。
Chem Res Toxicol. 2025 Jan 20;38(1):15-41. doi: 10.1021/acs.chemrestox.4c00330. Epub 2024 Dec 31.

本文引用的文献

1
Transparency in Modeling through Careful Application of OECD's QSAR/QSPR Principles via a Curated Water Solubility Data Set.通过精心应用经合组织的 QSAR/QSPR 原则并通过精心制作的水溶性数据集实现建模透明度。
Chem Res Toxicol. 2023 Mar 20;36(3):465-478. doi: 10.1021/acs.chemrestox.2c00379. Epub 2023 Mar 6.
2
Predicting compound amenability with liquid chromatography-mass spectrometry to improve non-targeted analysis.利用液相色谱-质谱联用技术预测化合物的可分析性,以改进非靶向分析。
Anal Bioanal Chem. 2021 Dec;413(30):7495-7508. doi: 10.1007/s00216-021-03713-w. Epub 2021 Oct 14.
3
A Framework for Utilizing High-Resolution Mass Spectrometry and Nontargeted Analysis in Rapid Response and Emergency Situations.利用高分辨率质谱和非靶向分析在快速反应和紧急情况下的框架。
Environ Toxicol Chem. 2022 May;41(5):1117-1130. doi: 10.1002/etc.5196. Epub 2021 Nov 18.
4
30 Years of research on ESI/MS response: Trends, contradictions and applications.ESI/MS 响应研究 30 年:趋势、矛盾与应用。
Anal Chim Acta. 2021 Apr 1;1152:238117. doi: 10.1016/j.aca.2020.11.049. Epub 2020 Dec 1.
5
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation.在二分类混淆矩阵评估中,马修斯相关系数(MCC)比平衡准确率、庄家知情度和标记度更可靠。
BioData Min. 2021 Feb 4;14(1):13. doi: 10.1186/s13040-021-00244-z.
6
The Tox21 10K Compound Library: Collaborative Chemistry Advancing Toxicology.Tox21 十库化合物库:协作化学推动毒理学发展。
Chem Res Toxicol. 2021 Feb 15;34(2):189-216. doi: 10.1021/acs.chemrestox.0c00264. Epub 2020 Nov 3.
7
Revisiting Five Years of CASMI Contests with EPA Identification Tools.利用EPA识别工具回顾五年的CASMI竞赛
Metabolites. 2020 Jun 23;10(6):260. doi: 10.3390/metabo10060260.
8
Examining NTA performance and potential using fortified and reference house dust as part of EPA's Non-Targeted Analysis Collaborative Trial (ENTACT).使用强化和参考室内灰尘作为 EPA 的非靶向分析协作试验 (ENTACT) 的一部分来检验 NTA 的性能和潜力。
Anal Bioanal Chem. 2020 Jul;412(18):4221-4233. doi: 10.1007/s00216-020-02658-w. Epub 2020 Apr 25.
9
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
10
Using prepared mixtures of ToxCast chemicals to evaluate non-targeted analysis (NTA) method performance.使用 ToxCast 化学物质的预混物评估非靶向分析(NTA)方法的性能。
Anal Bioanal Chem. 2019 Feb;411(4):835-851. doi: 10.1007/s00216-018-1526-4. Epub 2019 Jan 5.