• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的保留时间预测在 LC-HRMS 中农药和农药转化产物可疑筛查中的评估与应用。

Evaluation and application of machine learning-based retention time prediction for suspect screening of pesticides and pesticide transformation products in LC-HRMS.

机构信息

Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China.

Shanghai Changning Center for Disease Control and Prevention, Shanghai, 200051, China.

出版信息

Chemosphere. 2021 May;271:129447. doi: 10.1016/j.chemosphere.2020.129447. Epub 2020 Dec 27.

DOI:10.1016/j.chemosphere.2020.129447
PMID:33476874
Abstract

Computational QSAR models have gradually been preferred for retention time prediction in data mining of emerging environmental contaminants using liquid chromatography coupled with mass spectrometry. Generally, the model performance relies on the components such as machine learning algorithms, chemical features, and example data. In this study, we evaluated the performances of four algorithms on three feature sets, using 321 and 77 pesticides as the training and validation sets, respectively. The results were varied with different combinations of algorithms on distinct feature sets. Two strategies including enhancing the complexity of chemical features and enlarging the size of the training set were proved to improve the results. XGBoost, Random Forest, and lightGBM algorithms exhibited the best results when built on a large-scale chemical descriptors, while the Keras algorithm preferred fingerprints. These four models have comparable prediction accuracies that at least 90% of pesticides in validation set can be successfully predicted with ΔRT <1.0 min. Meanwhile, a blended prediction strategy using average results from four models presented a better result than any single model. This strategy was used for assisting identification of pesticides and pesticide transformation products in 120 strawberry samples from a national survey of food contamination. Twenty pesticides and twelve pesticide transformation products were tentatively identified, where all pesticides and two pesticide transformation products (bifenazate diazene and spirotetramat-enol) were confirmed by standard materials. The outcome of this study suggested that retention time prediction is a valuable approach in compound identification when integrated with in silico MS spectra and other MS identification strategies.

摘要

计算定量构效关系模型在使用液相色谱-质谱联用技术对新兴环境污染物的数据挖掘中逐渐受到青睐,用于预测保留时间。通常,模型性能依赖于机器学习算法、化学特征和示例数据等成分。在这项研究中,我们使用 321 种和 77 种农药分别作为训练集和验证集,评估了四种算法在三个特征集上的性能。结果因不同算法和不同特征集的组合而有所不同。两种策略,包括增强化学特征的复杂性和扩大训练集的大小,被证明可以提高结果。当基于大规模化学描述符构建时,XGBoost、随机森林和轻量级梯度提升机算法表现出最佳结果,而 Keras 算法则偏好指纹。这四个模型的预测准确性相当,至少有 90%的验证集中的农药可以成功预测,ΔRT<1.0min。同时,使用四个模型的平均结果进行混合预测策略的结果优于任何单个模型。该策略用于协助鉴定 120 个草莓样本中的农药和农药转化产物,这些样本来自全国食品污染调查。共鉴定出 20 种农药和 12 种农药转化产物,其中所有农药和两种农药转化产物(双苯氟脲二氮和螺虫乙酯-烯醇)均通过标准物质得到确认。这项研究的结果表明,保留时间预测是一种有价值的化合物鉴定方法,当与计算机 MS 谱和其他 MS 鉴定策略相结合时。

相似文献

1
Evaluation and application of machine learning-based retention time prediction for suspect screening of pesticides and pesticide transformation products in LC-HRMS.基于机器学习的保留时间预测在 LC-HRMS 中农药和农药转化产物可疑筛查中的评估与应用。
Chemosphere. 2021 May;271:129447. doi: 10.1016/j.chemosphere.2020.129447. Epub 2020 Dec 27.
2
Prediction of pesticide retention time in reversed-phase liquid chromatography using quantitative-structure retention relationship models: A comparative study of seven molecular descriptors datasets.采用定量结构保留关系模型预测反相液相色谱中的农药保留时间:七种分子描述符数据集的比较研究。
Chemosphere. 2021 Jul;275:130036. doi: 10.1016/j.chemosphere.2021.130036. Epub 2021 Mar 1.
3
Enhanced database creation with in silico workflows for suspect screening of unknown tebuconazole transformation products in environmental samples by UHPLC-HRMS.通过 UHPLC-HRMS 对环境样品中未知戊唑醇转化产物进行可疑筛选的计算机模拟工作流程,实现增强型数据库创建。
J Hazard Mater. 2022 Oct 15;440:129706. doi: 10.1016/j.jhazmat.2022.129706. Epub 2022 Aug 1.
4
Non-targeted screening of pesticides for food analysis using liquid chromatography high-resolution mass spectrometry-a review.利用液相色谱-高分辨质谱进行食品分析的非靶向农药筛查——综述
Food Addit Contam Part A Chem Anal Control Expo Risk Assess. 2020 Jul;37(7):1180-1201. doi: 10.1080/19440049.2020.1753890. Epub 2020 May 18.
5
Screening and identification of unknown chemical contaminants in food based on liquid chromatography-high-resolution mass spectrometry and machine learning.基于液相色谱-高分辨质谱联用技术和机器学习筛选和鉴定食品中未知化学污染物。
Anal Chim Acta. 2024 Jan 25;1287:342116. doi: 10.1016/j.aca.2023.342116. Epub 2023 Dec 8.
6
New relevant pesticide transformation products in groundwater detected using target and suspect screening for agricultural and urban micropollutants with LC-HRMS.利用 LC-HRMS 对农业和城市微污染物进行目标和可疑筛查,在地下水中检测到新的相关农药转化产物。
Water Res. 2019 Nov 15;165:114972. doi: 10.1016/j.watres.2019.114972. Epub 2019 Aug 14.
7
Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics.提示:用于无靶标代谢组学中化合物注释的保留时间预测。
Anal Chem. 2020 Jun 2;92(11):7515-7522. doi: 10.1021/acs.analchem.9b05765. Epub 2020 May 21.
8
Critical evaluation of a simple retention time predictor based on LogKow as a complementary tool in the identification of emerging contaminants in water.基于 LogKow 的简单保留时间预测器的批判性评价,作为水中新兴污染物识别的辅助工具。
Talanta. 2015 Jul 1;139:143-9. doi: 10.1016/j.talanta.2015.02.055. Epub 2015 Mar 9.
9
Occurrence of pesticide residues and transformation products in different types of dietary supplements.不同类型膳食补充剂中农药残留及转化产物的存在情况。
Food Addit Contam Part A Chem Anal Control Expo Risk Assess. 2015;32(6):849-56. doi: 10.1080/19440049.2015.1028481. Epub 2015 Apr 14.
10
UV-MALDI mass spectrometric quantitation of uracil based pesticides in fruit soft drinks along with matrix effects evaluation.利用紫外光电离-基质辅助激光解吸/电离质谱法定量分析水果软饮料中的尿嘧啶基农药,并评估基质效应。
Ecotoxicol Environ Saf. 2014 Feb;100:233-41. doi: 10.1016/j.ecoenv.2013.08.005. Epub 2013 Sep 6.

引用本文的文献

1
Machine Learning for Enhanced Identification Probability in RPLC/HRMS Nontargeted Workflows.用于提高反相液相色谱/高分辨率质谱非靶向工作流程中鉴定概率的机器学习
Anal Chem. 2025 Aug 26;97(33):18028-18035. doi: 10.1021/acs.analchem.5c01873. Epub 2025 Aug 12.
2
Critical review on in silico methods for structural annotation of chemicals detected with LC/HRMS non-targeted screening.关于液相色谱/高分辨质谱非靶向筛查检测到的化学物质结构注释的计算机模拟方法的批判性综述。
Anal Bioanal Chem. 2025 Jan;417(3):473-493. doi: 10.1007/s00216-024-05471-x. Epub 2024 Aug 14.
3
Systematic approaches to machine learning models for predicting pesticide toxicity.
用于预测农药毒性的机器学习模型的系统方法。
Heliyon. 2024 Mar 25;10(7):e28752. doi: 10.1016/j.heliyon.2024.e28752. eCollection 2024 Apr 15.
4
RT-Transformer: retention time prediction for metabolite annotation to assist in metabolite identification.RT-Transformer:用于代谢物注释的保留时间预测,以辅助代谢物鉴定。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae084.
5
Changes in Microbial Diversity, Soil Function, and Plant Biomass of Cotton Rhizosphere Soil Under the Influence of Chlorpyrifos.氯吡硫磷对棉花根际土壤微生物多样性、土壤功能和植物生物量的影响。
Curr Microbiol. 2022 Sep 20;79(11):323. doi: 10.1007/s00284-022-03015-z.
6
Developments in high-resolution mass spectrometric analyses of new psychoactive substances.新型精神活性物质的高分辨率质谱分析进展。
Arch Toxicol. 2022 Apr;96(4):949-967. doi: 10.1007/s00204-022-03224-2. Epub 2022 Feb 9.
7
Pure Ion Chromatograms Combined with Advanced Machine Learning Methods Improve Accuracy of Discriminant Models in LC-MS-Based Untargeted Metabolomics.纯离子色谱图结合先进的机器学习方法提高了基于 LC-MS 的非靶向代谢组学中判别模型的准确性。
Molecules. 2021 May 5;26(9):2715. doi: 10.3390/molecules26092715.