通过集成机器学习驱动的定量结构保留关系进行离子色谱中的梯度保留时间建模

Gradient Retention Time Modeling in Ion Chromatography through Ensemble Machine Learning-Powered Quantitative Structure-Retention Relationships.

作者信息

Lim Zhen Jia, Žuvela Petar, Ukić Šime, Novak Stankov Mirjana, Bolanča Tomislav, Lovrić Mario, Wong Ming Wah, Buszewski Bogusław

机构信息

Department of Chemistry, National University of Singapore, 3 Science Drive 3, Singapore 117543, Singapore.

Department of Analytical Chemistry, Faculty of Chemical Engineering and Technology, University of Zagreb, Marulićev trg 19, Zagreb 10000, Croatia.

出版信息

ACS Omega. 2025 Feb 4;10(6):5993-6002. doi: 10.1021/acsomega.4c09868. eCollection 2025 Feb 18.

DOI:10.1021/acsomega.4c09868

PMID:39989812

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11840597/

Abstract

Quantitative structure-retention relationships (QSRRs) have been a popular modeling approach in ion chromatography to predict retention time from molecular structures. It is often coupled with solvent strength models to extend it to other isocratic chromatographic conditions. While this approach has achieved reasonable success, potential inconsistencies from the solvent strength model may propagate to the QSRR models, thereby amplifying their errors. In this work, we aim to incorporate information on the isocratic conditions directly into the QSRR model to reduce error propagation and build global models. Four machine learning approaches that can account for both global and local sources of variability in chromatographic retention, random forest regression, gradient boosting regression (GBR), extreme gradient boosting (xgBoost), and adaptive boosting (AdaBoost), were evaluated and compared. The partial least-squares model was built as a baseline to compare against. GBR and xgBoost have shown superior predictive ability among the evaluated models with root-mean-square errors (RMSEs) of isocratic retention of 0.025 (+0.009, -0.006) and 0.025 (+0.008, -0.006), respectively. Developed QSRR models were further incorporated into the isocratic-to-gradient model to predict gradient retention. GBR and xgBoost QSRR models have outperformed the other models with RMSEs of gradient retention of 0.358 (+0.199, -0.107) and 0.385 (+0.387, -0.139) min, respectively. Such an approach demonstrates the benefits of incorporating the eluent composition into prediction models, with the potential to extend to other chromatographic techniques.

摘要

定量结构保留关系（QSRRs）一直是离子色谱中一种流行的建模方法，用于从分子结构预测保留时间。它通常与溶剂强度模型相结合，以将其扩展到其他等度色谱条件。虽然这种方法取得了一定的成功，但溶剂强度模型潜在的不一致性可能会传播到QSRR模型中，从而放大其误差。在这项工作中，我们旨在将等度条件的信息直接纳入QSRR模型，以减少误差传播并建立全局模型。评估并比较了四种能够考虑色谱保留中全局和局部变异性来源的机器学习方法，即随机森林回归、梯度提升回归（GBR）、极端梯度提升（xgBoost）和自适应提升（AdaBoost）。构建了偏最小二乘模型作为基线进行比较。在评估的模型中，GBR和xgBoost表现出卓越的预测能力，等度保留的均方根误差（RMSEs）分别为0.025（+0.009，-0.006）和0.025（+0.008，-0.006）。开发的QSRR模型进一步纳入等度-梯度模型以预测梯度保留。GBR和xgBoost QSRR模型分别以0.358（+0.199，-0.107）和0.385（+0.387，-0.139）分钟的梯度保留RMSEs优于其他模型。这种方法展示了将洗脱液组成纳入预测模型的好处，有可能扩展到其他色谱技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccec/11840597/4de1c3aadc49/ao4c09868_0001.jpg

相似文献

Gradient Retention Time Modeling in Ion Chromatography through Ensemble Machine Learning-Powered Quantitative Structure-Retention Relationships.

ACS Omega. 2025 Feb 4;10(6):5993-6002. doi: 10.1021/acsomega.4c09868. eCollection 2025 Feb 18.

Cross-column density functional theory-based quantitative structure-retention relationship model development powered by machine learning.

Anal Bioanal Chem. 2024 May;416(12):2951-2968. doi: 10.1007/s00216-024-05243-7. Epub 2024 Mar 20.

Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography.

J Chromatogr A. 2017 Nov 10;1523:173-182. doi: 10.1016/j.chroma.2017.02.054. Epub 2017 Feb 24.

[Construction of a machine learning ensemble prediction model for gas chromatographic retention index on stationary phases with different polarities].

Se Pu. 2025 Apr 8;43(4):355-362. doi: 10.3724/SP.J.1123.2024.07014.

Retention prediction of low molecular weight anions in ion chromatography based on quantitative structure-retention relationships applied to the linear solvent strength model.

J Chromatogr A. 2017 Feb 24;1486:68-75. doi: 10.1016/j.chroma.2016.12.048. Epub 2016 Dec 19.

Enhancing the Predictive Performance of Molecularly Imprinted Polymer-Based Electrochemical Sensors Using a Stacking Regressor Ensemble of Machine Learning Models.

ACS Sens. 2025 Apr 25;10(4):3123-3133. doi: 10.1021/acssensors.5c00364. Epub 2025 Apr 17.

Towards a chromatographic similarity index to establish localised Quantitative Structure-Retention Relationships for retention prediction. III Combination of Tanimoto similarity index, logP, and retention factor ratio to identify optimal analyte training sets for ion chromatography.

J Chromatogr A. 2017 Oct 20;1520:107-116. doi: 10.1016/j.chroma.2017.09.016. Epub 2017 Sep 7.

Quantitative structure retention relationship (QSRR) modelling for Analytes' retention prediction in LC-HRMS by applying different Machine Learning algorithms and evaluating their performance.

J Chromatogr B Analyt Technol Biomed Life Sci. 2022 Feb 15;1191:123132. doi: 10.1016/j.jchromb.2022.123132. Epub 2022 Jan 19.

Combination of linear solvent strength model and quantitative structure-retention relationships as a comprehensive procedure of approximate prediction of retention in gradient liquid chromatography.

J Chromatogr A. 2002 Jul 12;962(1-2):41-55. doi: 10.1016/s0021-9673(02)00557-5.

Exploring Ensemble Learning Techniques for Infant Mortality Prediction: A Technical Analysis of XGBoost Stacking AdaBoost and Bagging Models.

Birth Defects Res. 2025 Feb;117(2):e2443. doi: 10.1002/bdr2.2443.

本文引用的文献

Cross-column density functional theory-based quantitative structure-retention relationship model development powered by machine learning.

Anal Bioanal Chem. 2024 May;416(12):2951-2968. doi: 10.1007/s00216-024-05243-7. Epub 2024 Mar 20.

Mechanistic Chromatographic Column Characterization for the Analysis of Flavonoids Using Quantitative Structure-Retention Relationships Based on Density Functional Theory.

Int J Mol Sci. 2020 Mar 17;21(6):2053. doi: 10.3390/ijms21062053.

Column Characterization and Selection Systems in Reversed-Phase High-Performance Liquid Chromatography.

Chem Rev. 2019 Mar 27;119(6):3674-3729. doi: 10.1021/acs.chemrev.8b00246. Epub 2019 Jan 3.

Benchmarking of Computational Methods for Creation of Retention Models in Quantitative Structure-Retention Relationships Studies.

J Chem Inf Model. 2017 Nov 27;57(11):2754-2762. doi: 10.1021/acs.jcim.7b00346. Epub 2017 Oct 27.

Retention prediction of low molecular weight anions in ion chromatography based on quantitative structure-retention relationships applied to the linear solvent strength model.

J Chromatogr A. 2017 Feb 24;1486:68-75. doi: 10.1016/j.chroma.2016.12.048. Epub 2016 Dec 19.

Sum of ranking differences to rank stationary phases used in packed column supercritical fluid chromatography.

J Chromatogr A. 2015 Aug 28;1409:241-50. doi: 10.1016/j.chroma.2015.07.071. Epub 2015 Jul 20.

3D-MoRSE descriptors explained.

J Mol Graph Model. 2014 Nov;54:194-203. doi: 10.1016/j.jmgm.2014.10.006. Epub 2014 Nov 4.

Elimination of uninformative variables for multivariate calibration.

Anal Chem. 1996 Nov 1;68(21):3851-8. doi: 10.1021/ac960321m.

Recent developments and emerging directions in ion chromatography.

J Chromatogr A. 2008 Mar 14;1184(1-2):456-73. doi: 10.1016/j.chroma.2007.10.022. Epub 2007 Oct 12.

QSRR: quantitative structure-(chromatographic) retention relationships.

Chem Rev. 2007 Jul;107(7):3212-46. doi: 10.1021/cr068412z. Epub 2007 Jun 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过集成机器学习驱动的定量结构保留关系进行离子色谱中的梯度保留时间建模

Gradient Retention Time Modeling in Ion Chromatography through Ensemble Machine Learning-Powered Quantitative Structure-Retention Relationships.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献