• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习在无需分析标准品的化学品计算机定量分析中的应用。

Applications of Machine Learning to In Silico Quantification of Chemicals without Analytical Standards.

作者信息

Abrahamsson Dimitri, Park June-Soo, Singh Randolph R, Sirota Marina, Woodruff Tracey J

机构信息

Program on Reproductive Health and the Environment, Department of Obstetrics and Gynecology, University of California, San Francisco, California 94158, United States.

Environmental Chemistry Laboratory, California Department of Toxic Substances Control, Berkeley, California 94710, United States.

出版信息

J Chem Inf Model. 2020 Jun 22;60(6):2718-2727. doi: 10.1021/acs.jcim.9b01096. Epub 2020 May 20.

DOI:10.1021/acs.jcim.9b01096
PMID:32379974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7328371/
Abstract

Non-targeted analysis provides a comprehensive approach to analyze environmental and biological samples for nearly all chemicals present. One of the main shortcomings of current analytical methods and workflows is that they are unable to provide any quantitative information constituting an important obstacle in understanding environmental fate and human exposure. Herein, we present an in silico quantification method using mahine-learning for chemicals analyzed using electrospray ionization (ESI). We considered three data sets from different instrumental setups: (i) capillary electrophoresis electrospray ionization-mass spectrometry (CE-MS) in positive ionization mode (ESI+), (ii) liquid chromatography quadrupole time-of-flight mass spectrometry (LC-QTOF/MS) in ESI+ and (iii) LC-QTOF/MS in negative ionization mode (ESI-). We developed and applied two different machine-learning algorithms: a random forest (RF) and an artificial neural network (ANN) to predict the relative response factors (RRFs) of different chemicals based on their physicochemical properties. Chemical concentrations can then be calculated by dividing the measured abundance of a chemical, as peak area or peak height, by its corresponding RRF. We evaluated our models and tested their predictive power using 5-fold cross-validation (CV) and  randomization. Both the RF and the ANN models showed great promise in predicting RRFs. However, the accuracy of the predictions was dependent on the data set composition and the experimental setup. For the CE-MS ESI+ data set, the best model predicted measured RRFs with a mean absolute error (MAE) of 0.19 log units and a cross-validation coefficient of determination () of 0.84 for the testing set. For the LC-QTOF/MS ESI+ data set, the best model predicted measured RRFs with an MAE of 0.32 and a of 0.40. For the LC-QTOF/MS ESI- data set, the best model predicted measured RRFs with a MAE of 0.50 and a of 0.20. Our findings suggest that machine-learning algorithms can be used for predicting concentrations of nontargeted chemicals with reasonable uncertainties, especially in ESI+, while the application on ESI- remains a more challenging problem.

摘要

非靶向分析提供了一种全面的方法,可用于分析环境和生物样品中几乎所有存在的化学物质。当前分析方法和工作流程的主要缺点之一是它们无法提供任何定量信息,这构成了理解环境归宿和人类暴露情况的一个重要障碍。在此,我们提出了一种基于机器学习的计算机模拟定量方法,用于对采用电喷雾电离(ESI)分析的化学物质进行定量。我们考虑了来自不同仪器设置的三个数据集:(i)正离子模式(ESI+)下的毛细管电泳电喷雾电离质谱(CE-MS),(ii)ESI+模式下的液相色谱四极杆飞行时间质谱(LC-QTOF/MS),以及(iii)负离子模式(ESI-)下的LC-QTOF/MS。我们开发并应用了两种不同的机器学习算法:随机森林(RF)和人工神经网络(ANN),以根据不同化学物质的物理化学性质预测其相对响应因子(RRF)。然后,可以通过将化学物质的测量丰度(以峰面积或峰高表示)除以其相应的RRF来计算化学物质的浓度。我们使用五折交叉验证(CV)和随机化评估了我们的模型并测试了它们的预测能力。RF模型和ANN模型在预测RRF方面都显示出了很大的潜力。然而,预测的准确性取决于数据集的组成和实验设置。对于CE-MS ESI+数据集,最佳模型预测的测量RRF的平均绝对误差(MAE)为0.19对数单位,测试集的交叉验证决定系数()为0.84。对于LC-QTOF/MS ESI+数据集,最佳模型预测的测量RRF的MAE为0.32,为0.40。对于LC-QTOF/MS ESI-数据集,最佳模型预测的测量RRF的MAE为0.50,为0.20。我们的研究结果表明,机器学习算法可用于以合理的不确定性预测非靶向化学物质的浓度,尤其是在ESI+模式下,而在ESI-模式下的应用仍然是一个更具挑战性的问题。

相似文献

1
Applications of Machine Learning to In Silico Quantification of Chemicals without Analytical Standards.机器学习在无需分析标准品的化学品计算机定量分析中的应用。
J Chem Inf Model. 2020 Jun 22;60(6):2718-2727. doi: 10.1021/acs.jcim.9b01096. Epub 2020 May 20.
2
Virtual quantification of metabolites by capillary electrophoresis-electrospray ionization-mass spectrometry: predicting ionization efficiency without chemical standards.毛细管电泳-电喷雾电离质谱法对代谢物的虚拟定量:无需化学标准品预测电离效率
Anal Chem. 2009 Apr 1;81(7):2506-15. doi: 10.1021/ac802272u.
3
Capillary electrophoresis-mass spectrometry determination of morphine and its isobaric glucuronide metabolites.毛细管电泳-质谱法测定吗啡及其等压葡萄糖醛酸代谢物
J Chromatogr B Analyt Technol Biomed Life Sci. 2015 Feb 1;980:65-71. doi: 10.1016/j.jchromb.2014.11.035. Epub 2014 Dec 19.
4
Modeling the relative response factor of small molecules in positive electrospray ionization.正电喷雾电离中小分子相对响应因子的建模
RSC Adv. 2024 Nov 22;14(50):37470-37482. doi: 10.1039/d4ra06695b. eCollection 2024 Nov 19.
5
Electrospray Ionization Efficiency Predictions and Analytical Standard Free Quantification for SFC/ESI/HRMS.电喷雾电离效率预测及 SFC/ESI/HRMS 的分析标准无内标定量。
J Am Soc Mass Spectrom. 2023 Jul 5;34(7):1511-1518. doi: 10.1021/jasms.3c00156. Epub 2023 Jun 26.
6
Forced degradation and impurity profiling: recent trends in analytical perspectives.强制降解和杂质剖析:分析视角的最新趋势。
J Pharm Biomed Anal. 2013 Dec;86:11-35. doi: 10.1016/j.jpba.2013.07.013. Epub 2013 Jul 31.
7
Qualitative and quantitative analysis of poly(amidoamine) dendrimers in an aqueous matrix by liquid chromatography-electrospray ionization-hybrid quadrupole/time-of-flight mass spectrometry (LC-ESI-QTOF-MS).采用液相色谱-电喷雾电离-混合四极杆/飞行时间质谱法(LC-ESI-QTOF-MS)对水基中的聚(酰胺-胺)树突进行定性和定量分析。
Anal Bioanal Chem. 2013 Jul;405(18):5901-14. doi: 10.1007/s00216-012-6673-4. Epub 2013 Jan 16.
8
Application of a simultaneous screening method for the detection of new psychoactive substances in various matrix samples using liquid chromatography/electrospray ionization tandem mass spectrometry and liquid chromatography/quadrupole time-of-flight mass spectrometry.应用液相色谱/电喷雾串联质谱和液相色谱/四极杆飞行时间质谱同时检测各种基质中新精神活性物质的筛选方法。
Rapid Commun Mass Spectrom. 2021 May 30;35(10):e9067. doi: 10.1002/rcm.9067.
9
Optimization and evaluation of a sheathless capillary electrophoresis-electrospray ionization mass spectrometry platform for peptide analysis: comparison to liquid chromatography-electrospray ionization mass spectrometry.鞘液免进毛细管电泳-电喷雾电离质谱平台用于肽分析的优化与评估:与液相色谱-电喷雾电离质谱的比较。
Anal Chem. 2011 Oct 1;83(19):7297-305. doi: 10.1021/ac2010372. Epub 2011 Sep 1.
10
Coupling of gas chromatography and electrospray ionization high resolution mass spectrometry for the analysis of anabolic steroids as trimethylsilyl derivatives in human urine.采用气相色谱-电喷雾电离高分辨质谱联用技术分析人尿液中作为三甲基硅烷衍生物的合成代谢类固醇。
Anal Chim Acta. 2017 Apr 29;964:123-133. doi: 10.1016/j.aca.2017.01.058. Epub 2017 Feb 1.

引用本文的文献

1
Toward Machine Learning Electrospray Ionization Sensitivity Prediction for Semiquantitative Lipidomics in Stem Cells.迈向用于干细胞半定量脂质组学的机器学习电喷雾电离灵敏度预测
J Chem Inf Model. 2025 Feb 24;65(4):1826-1836. doi: 10.1021/acs.jcim.4c02040. Epub 2025 Feb 5.
2
Modeling the relative response factor of small molecules in positive electrospray ionization.正电喷雾电离中小分子相对响应因子的建模
RSC Adv. 2024 Nov 22;14(50):37470-37482. doi: 10.1039/d4ra06695b. eCollection 2024 Nov 19.
3
A Semi-Quantitative Approach to Nontarget Compositional Analysis of Complex Samples.一种用于复杂样品非目标成分分析的半定量方法。
Anal Chem. 2024 Nov 19;96(46):18349-18358. doi: 10.1021/acs.analchem.4c00819. Epub 2024 Nov 7.
4
Quantification Approaches in Non-Target LC/ESI/HRMS Analysis: An Interlaboratory Comparison.非靶向液相色谱/电喷雾电离/高分辨质谱分析中的定量方法:一项实验室间比较
Anal Chem. 2024 Oct 15;96(41):16215-16226. doi: 10.1021/acs.analchem.4c02902. Epub 2024 Oct 1.
5
Mass Balance in Pharmaceutical Stress Testing: A Review of Principles and Practical Applications.药物强化试验中的质量平衡:原理与实际应用述评。
AAPS J. 2024 Aug 22;26(5):96. doi: 10.1208/s12248-024-00961-3.
6
Progress, applications, and challenges in high-throughput effect-directed analysis for toxicity driver identification - is it time for HT-EDA?用于毒性驱动因素识别的高通量效应导向分析的进展、应用及挑战——是时候采用高通量效应导向分析了吗?
Anal Bioanal Chem. 2025 Jan;417(3):451-472. doi: 10.1007/s00216-024-05424-4. Epub 2024 Jul 12.
7
(Non)targeted Chemical Analysis and Risk Assessment of Organic Contaminants in Darkibor Kale Grown at Rural and Urban Farms.(非)靶向化学分析和城乡农场种植的 Darkibor Kale 中有机污染物的风险评估。
Environ Sci Technol. 2024 Feb 27;58(8):3690-3701. doi: 10.1021/acs.est.3c09106. Epub 2024 Feb 13.
8
Quantification of chemicals in non-targeted analysis without analytical standards - Understanding the mechanism of electrospray ionization and making predictions.无分析标准品情况下非靶向分析中化学物质的定量——理解电喷雾电离机制并进行预测
Curr Opin Environ Sci Health. 2024 Feb;37. doi: 10.1016/j.coesh.2023.100529. Epub 2023 Dec 19.
9
Quantitative Nontarget Analysis of CECs in Environmental Samples Can Be Improved by Considering All Mass Adducts.通过考虑所有质量加合物可改进环境样品中循环内皮细胞的定量非靶向分析。
Anal Chem. 2024 Jan 9;96(1):229-237. doi: 10.1021/acs.analchem.3c03791. Epub 2023 Dec 21.
10
Bypassing the Identification: MS2Quant for Concentration Estimations of Chemicals Detected with Nontarget LC-HRMS from MS Data.绕过鉴定:从 MS 数据中利用非靶向 LC-HRMS 检测到的化学品进行浓度估计的 MS2Quant。
Anal Chem. 2023 Aug 22;95(33):12329-12338. doi: 10.1021/acs.analchem.3c01744. Epub 2023 Aug 7.

本文引用的文献

1
Strategies for Drawing Quantitative Conclusions from Nontargeted Liquid Chromatography-High-Resolution Mass Spectrometry Analysis.从非靶向液相色谱-高分辨率质谱分析中得出定量结论的策略。
Anal Chem. 2020 Apr 7;92(7):4691-4699. doi: 10.1021/acs.analchem.9b03481. Epub 2020 Mar 23.
2
Bayesian machine learning for quantum molecular dynamics.用于量子分子动力学的贝叶斯机器学习
Phys Chem Chem Phys. 2019 Jun 26;21(25):13392-13410. doi: 10.1039/c9cp01883b.
3
Using prepared mixtures of ToxCast chemicals to evaluate non-targeted analysis (NTA) method performance.使用 ToxCast 化学物质的预混物评估非靶向分析(NTA)方法的性能。
Anal Bioanal Chem. 2019 Feb;411(4):835-851. doi: 10.1007/s00216-018-1526-4. Epub 2019 Jan 5.
4
Machine learning in chemoinformatics and drug discovery.机器学习在化学生信学和药物发现中的应用。
Drug Discov Today. 2018 Aug;23(8):1538-1546. doi: 10.1016/j.drudis.2018.05.010. Epub 2018 May 8.
5
Household Dust as a Repository of Chemical Accumulation: New Insights from a Comprehensive High-Resolution Mass Spectrometric Study.家庭灰尘作为化学物质积累的储存库:一项全面的高分辨率质谱研究的新见解。
Environ Sci Technol. 2018 Mar 6;52(5):2878-2887. doi: 10.1021/acs.est.7b05767. Epub 2018 Feb 13.
6
Mordred: a molecular descriptor calculator.莫德雷德:一种分子描述符计算器。
J Cheminform. 2018 Feb 6;10(1):4. doi: 10.1186/s13321-018-0258-y.
7
Suspect Screening Analysis of Chemicals in Consumer Products.消费品中化学物质的可疑筛选分析。
Environ Sci Technol. 2018 Mar 6;52(5):3125-3135. doi: 10.1021/acs.est.7b04781. Epub 2018 Feb 26.
8
A framework to estimate concentrations of potentially unknown substances by semi-quantification in liquid chromatography electrospray ionization mass spectrometry.一种通过液相色谱-电喷雾电离质谱半定量估算潜在未知物质浓度的框架。
Anal Chim Acta. 2017 Jul 4;975:30-41. doi: 10.1016/j.aca.2017.03.054. Epub 2017 Apr 10.
9
Linking high resolution mass spectrometry data with exposure and toxicity forecasts to advance high-throughput environmental monitoring.将高分辨率质谱数据与暴露和毒性预测相联系,以推进高通量环境监测。
Environ Int. 2016 Mar;88:269-280. doi: 10.1016/j.envint.2015.12.008. Epub 2016 Jan 23.
10
Organic Carbon/Water and Dissolved Organic Carbon/Water Partitioning of Cyclic Volatile Methylsiloxanes: Measurements and Polyparameter Linear Free Energy Relationships.环状挥发性甲基硅氧烷的有机碳/水和溶解有机碳/水分配:测量和多参数线性自由能关系。
Environ Sci Technol. 2015 Oct 20;49(20):12161-8. doi: 10.1021/acs.est.5b02483. Epub 2015 Sep 28.