Department of Chemistry, Gustavus Adolphus College, 800 W. College Ave, Saint Peter, MN 56082, USA.
Department of Chemistry, Gustavus Adolphus College, 800 W. College Ave, Saint Peter, MN 56082, USA.
J Chromatogr A. 2021 Jan 11;1636:461682. doi: 10.1016/j.chroma.2020.461682. Epub 2020 Nov 5.
The hydrophobic subtraction model (HSM) for characterizing the selectivity of reversed-phase liquid chromatography (LC) columns has been used extensively by the LC community since it was first developed in 2002. Continuing interest in the model is due in part to the large, publicly available set of column descriptors that has been assembled over the past 18 years. In the work described in this report, we sought to refine the HSM with the goal of improving the predictive accuracy of the model without compromising its physico-chemical interpretability. The approach taken here has the following facets. A set of retention measurements for 635 columns and the 16 probe solutes used to characterize new columns using the HSM was assembled. Principal components analysis (PCA) was used as a guide for the development of a refined version of the HSM. Several outlying columns (84) were eliminated from the analysis because they were either inconsistent with the PCA model or were outliers from the original HSM model. With the retention dataset for the 16 probe solutes on the remaining 551 columns, we determined that a six-component model is the most sophisticated form of the model that can be used without overfitting the data. In our refined version of the HSM, the S*σ term has been removed. Two new terms have been added, which more accurately account for the molecular volume of the solute (Vv), and the solute dipolarity (Dd), and the remaining terms have been adjusted to accommodate these changes. The refined model described here provides improved prediction of retention factors, with the model standard error being reduced from 1.0 for the original HSM to 0.35 for the refined model (16 solutes, 551 columns). Furthermore, the number of retention factors with errors greater than 10% are reduced from 231 to 25. A revised metric for column similarity, F, is also proposed as a part of this work.
自 2002 年首次开发以来,反相液相色谱(LC)柱选择性特征描述的疏水删减模型(HSM)已被 LC 界广泛使用。该模型之所以继续受到关注,部分原因是过去 18 年来已积累了大量公开的柱描述符。在本报告所描述的工作中,我们试图改进 HSM,目的是在不影响模型物理化学可解释性的前提下提高模型的预测准确性。所采用的方法具有以下几个方面。我们收集了 635 根色谱柱和用于用 HSM 表征新柱的 16 种探针溶质的保留测量值。主成分分析(PCA)被用作开发 HSM 改进版本的指南。由于与 PCA 模型不一致或与原始 HSM 模型不一致,有 84 根离群柱(outlying columns)被排除在分析之外。对于剩余的 551 根柱上的 16 种探针溶质保留数据集,我们确定六分量模型是最复杂的模型形式,可以在不过度拟合数据的情况下使用。在我们改进的 HSM 版本中,已删除 S*σ项。增加了两个新术语,更准确地描述了溶质的分子体积(Vv)和溶质的偶极矩(Dd),并调整了其余术语以适应这些变化。这里描述的改进模型提供了改进的保留因子预测,模型标准误差从原始 HSM 的 1.0 降低到改进模型的 0.35(16 种溶质,551 根柱)。此外,具有大于 10%误差的保留因子数量从 231 减少到 25。作为这项工作的一部分,还提出了一个新的柱相似性度量 F。