Park Soo Hyun, Haddad Paul R, Talebi Mohammad, Tyteca Eva, Amos Ruth I J, Szucs Roman, Dolan John W, Pohl Christopher A
Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart, 7001, Australia.
Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart, 7001, Australia.
J Chromatogr A. 2017 Feb 24;1486:68-75. doi: 10.1016/j.chroma.2016.12.048. Epub 2016 Dec 19.
Quantitative Structure-Retention Relationships (QSRRs) represent a popular technique to predict the retention times of analytes, based on molecular descriptors encoding the chemical structures of the analytes. The linear solvent strength (LSS) model relating the retention factor, k to the eluent concentration (log k=a-blog [eluent]), is a well-known and accurate retention model in ion chromatography (IC). In this work, QSRRs for inorganic and small organic anions were used to predict the regression parameters a and b in the LSS model (and hence retention times) for these analytes under a wide range of eluent conditions, based solely on their chemical structures. This approach was performed on retention data of inorganic and small organic anions from the "Virtual Column" software (Thermo Fisher Scientific). These retention data were recalibrated via a "porting" methodology on three columns (AS20, AS19, and AS11HC), prior to the QSRR modeling. This provided retention data more applicable on recently produced columns which may exhibit changes of column behavior due to batch-to-batch variability. Molecular descriptors for the analytes were calculated with Dragon software using the geometry-optimized molecular structures, employing the AM1 semi-empirical method. An optimal subset of molecular descriptors was then selected using an evolutionary algorithm (EA). Finally, the QSRR models were generated by multiple linear regression (MLR). As a result, six QSRR models with good predictive performance were successfully derived for a- and b-values on three columns (R>0.98 and RMSE<0.11). External validation showed the possibility of using the developed QSRR models as predictive tools in IC (Q>0.7 and RMSEP<0.4). Moreover, it was demonstrated that the obtained QSRR models for the a- and b-values can predict the retention times for new analytes with good accuracy and predictability (R of 0.98, RMSE of 0.89min, Q of 0.96 and RMSEP of 1.18min).
定量结构-保留关系(QSRRs)是一种基于编码分析物化学结构的分子描述符来预测分析物保留时间的常用技术。将保留因子k与洗脱液浓度相关联的线性溶剂强度(LSS)模型(log k = a - blog[洗脱液])是离子色谱(IC)中一种著名且准确的保留模型。在本工作中,基于无机和小有机阴离子的化学结构,使用QSRRs来预测这些分析物在广泛洗脱液条件下LSS模型中的回归参数a和b(进而预测保留时间)。该方法是根据来自“虚拟柱”软件(赛默飞世尔科技公司)的无机和小有机阴离子的保留数据进行的。在进行QSRR建模之前,通过“移植”方法在三根柱(AS20、AS19和AS11HC)上对这些保留数据进行了重新校准。这提供了更适用于最近生产的柱的保留数据,这些柱可能因批次间的差异而表现出柱行为的变化。使用Dragon软件,采用AM1半经验方法,根据几何优化的分子结构计算分析物的分子描述符。然后使用进化算法(EA)选择分子描述符的最佳子集。最后,通过多元线性回归(MLR)生成QSRR模型。结果,成功地为三根柱上的a值和b值导出了六个具有良好预测性能的QSRR模型(R>0.98且RMSE<0.11)。外部验证表明,所开发的QSRR模型有可能作为IC中的预测工具(Q>0.7且RMSEP<0.4)。此外,还证明了所获得的a值和b值的QSRR模型能够以良好的准确性和可预测性预测新分析物的保留时间(R为0.98,RMSE为0.89分钟,Q为0.96,RMSEP为1.18分钟)。