Dipartimento di Chimica, Ingegneria Chimica e Materiali, Università degli Studi dell'Aquila, L'Aquila, Italy.
J Chromatogr A. 2011 Dec 2;1218(48):8679-90. doi: 10.1016/j.chroma.2011.09.071. Epub 2011 Oct 1.
In this paper, we build a multiple-column retention model able to predict the behaviour of polychlorinated biphenyls (PCBs) in capillary gas-chromatography (GC) within a wide range of separation conditions. To this end, GC retention is related to both chemical structure of PCBs, encoded by selected theoretical molecular descriptors, and the kind of stationary phase, represented by the relative retention time (RRT) of a suitable small number of analytes. The model was generated using the retention data of 70 PCBs extracted from the pool of the 209 possible congeners collected on 17 different capillary columns featured by non-polar or moderately polar stationary phases, reported in the literature. Multilinear regression combined with genetic algorithm variable selection was preliminarily applied to generate a four-dimensional quantitative structure-retention relationship (QSRR) for each of the 17 columns, based on theoretical molecular descriptors extracted from the large set provided by the software Dragon. 33 molecular descriptors obtained by merging the non-common descriptors of various single-column QSRRs, combined with RRTs values of the less and the most retained PCB, were considered as the starting independent variables of the multiple-column retention model. A multi-layer artificial neural network (ANN), optimised on a validation set extracted from the calibration data, was applied to generate the multi-column retention model. The influence of starting inputs on the network output was evaluated by a sensitivity analysis and model complexity was reduced through a step-wise elimination of redundant molecular descriptors, while RRTs of further PCBs were included to improve description of the stationary phase. Nine molecular descriptors and RRTs of eight selected PCBs are considered as the independent variables of the final ANN-based model, whose predictive performance was tested on the 139 PCBs excluded from calibration and on six external columns and/or temperature programs.
在本文中,我们构建了一个多柱保留模型,能够在广泛的分离条件下预测多氯联苯(PCBs)在毛细管气相色谱(GC)中的行为。为此,GC 保留与 PCB 的化学结构有关,由选定的理论分子描述符编码,以及固定相的种类,由适当数量的分析物的相对保留时间(RRT)表示。该模型是使用从文献中收集的 209 种可能同系物中提取的 70 种 PCB 的保留数据生成的,这些同系物分别在 17 种不同的毛细管柱上进行了实验,这些柱子具有非极性或中等极性的固定相。多元线性回归结合遗传算法变量选择被初步应用于为每根柱子生成一个四元定量结构保留关系(QSRR),该关系基于从 Dragon 软件提供的大型分子描述符集提取的理论分子描述符。将各种单柱 QSRR 的非公共描述符合并得到的 33 个分子描述符,再加上保留程度最小和最大的 PCB 的 RRT 值,被视为多柱保留模型的初始独立变量。经过优化的多层人工神经网络(ANN)应用于从校准数据中提取的验证集,生成多柱保留模型。通过敏感性分析评估起始输入对网络输出的影响,并通过逐步消除冗余分子描述符来降低模型的复杂性,同时将更多 PCB 的 RRT 值纳入其中,以改善对固定相的描述。9 个分子描述符和 8 种选定 PCB 的 RRT 值被视为最终基于 ANN 的模型的独立变量,该模型的预测性能在从校准数据中排除的 139 种 PCB 和 6 种外部柱子和/或温度程序上进行了测试。