Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Hobart, Australia.
Pfizer Global Research and Development, Sandwich, UK.
Electrophoresis. 2019 Sep;40(18-19):2415-2419. doi: 10.1002/elps.201900022. Epub 2019 Apr 29.
The hydrophobic subtraction model (HSM) combined with quantitative structure-retention relationships (QSRR) methodology was utilized to predict retention times in reversed-phase liquid chromatography (RPLC). A selection of new analytes and new RPLC columns that had never been used in the QSRR modeling process were used to verify the proposed approach. This work is designed to facilitate early prediction of co-elution of analytes in pharmaceutical drug discovery applications where it is advantageous to predict whether impurities might be co-eluted with the active drug component. The QSRR models were constructed through partial least squares regression combined with a genetic algorithm (GA-PLS) which was employed as a feature selection method to choose the most informative molecular descriptors calculated using VolSurf+ software. The analyte hydrophobicity coefficient of the HSM was predicted for subsequent calculation of retention. Clustering approaches based on the local compound type and the local second dominant interaction were investigated to select the most appropriate training set of analytes from a larger database. Predicted retention times of five new compounds on five new RPLC C18 columns were compared with their measured retention times with percentage root-mean-square errors of 15.4 and 24.7 for the local compound type and local second dominant interaction clustering methods, respectively.
采用疏水删减模型(HSM)结合定量结构-保留关系(QSRR)方法预测反相液相色谱(RPLC)中的保留时间。选择了从未用于 QSRR 建模过程的新分析物和新的 RPLC 柱来验证所提出的方法。这项工作旨在促进药物发现应用中分析物共洗脱的早期预测,在这些应用中,预测杂质是否可能与活性药物成分共洗脱是有利的。通过偏最小二乘回归结合遗传算法(GA-PLS)构建 QSRR 模型,该算法用作特征选择方法,选择使用 VolSurf+软件计算的最具信息量的分子描述符。预测 HSM 的分析物疏水性系数,以便随后计算保留时间。研究了基于局部化合物类型和局部第二主导相互作用的聚类方法,以从更大的数据库中选择最合适的分析物训练集。将五种新化合物在五种新的 RPLC C18 柱上的预测保留时间与实测保留时间进行比较,局部化合物类型和局部第二主导相互作用聚类方法的均方根误差百分比分别为 15.4%和 24.7%。