Faculty of Chemistry, University of Mazandaran, Babolsar, Iran.
Mol Divers. 2009 Nov;13(4):483-91. doi: 10.1007/s11030-009-9135-y. Epub 2009 Mar 27.
Quantitative structure-property relationship models for the prediction of the nematic transition temperature (T (N)) were developed by using multilinear regression analysis and a feedforward artificial neural network (ANN). A collection of 42 thermotropic liquid crystals was chosen as the data set. The data set was divided into three sets: for training, and an internal and external test set. Training and internal test sets were used for ANN model development, and the external test set was used for evaluation of the predictive power of the model. In order to build the models, a set of six descriptors were selected by the best multilinear regression procedure of the CODESSA program. These descriptors were: atomic charge weighted partial negatively charged surface area, relative negative charged surface area, polarity parameter/square distance, minimum most negative atomic partial charge, molecular volume, and the A component of moment of inertia, which encode geometrical and electronic characteristics of molecules. These descriptors were used as inputs to ANN. The optimized ANN model had 6:6:1 topology. The standard errors in the calculation of T (N) for the training, internal, and external test sets using the ANN model were 1.012, 4.910, and 4.070, respectively. To further evaluate the ANN model, a crossvalidation test was performed, which produced the statistic Q (2) = 0.9796 and standard deviation of 2.67 based on predicted residual sum of square. Also, the diversity test was performed to ensure the model's stability and prove its predictive capability. The obtained results reveal the suitability of ANN for the prediction of T (N) for liquid crystals using molecular structural descriptors.
采用多元线性回归分析和前馈人工神经网络(ANN)建立了用于预测向列相转变温度(T(N))的定量构效关系模型。选择了 42 种热致液晶作为数据集。数据集分为三组:训练集、内部测试集和外部测试集。训练集和内部测试集用于 ANN 模型的开发,外部测试集用于评估模型的预测能力。为了构建模型,通过 CODESSA 程序的最佳多元线性回归程序选择了一组 6 个描述符。这些描述符是:原子电荷加权部分负表面面积、相对负表面面积、极性参数/平方距离、最小最负原子部分电荷、分子体积和惯性矩的 A 分量,它们编码分子的几何和电子特性。这些描述符用作 ANN 的输入。优化的 ANN 模型具有 6:6:1 的拓扑结构。使用 ANN 模型计算训练集、内部测试集和外部测试集的 T(N)的标准误差分别为 1.012、4.910 和 4.070。为了进一步评估 ANN 模型,进行了交叉验证测试,得到了基于预测残差平方和的统计量 Q(2)= 0.9796 和标准偏差为 2.67。此外,还进行了多样性测试,以确保模型的稳定性并证明其预测能力。所得结果表明,ANN 适用于使用分子结构描述符预测液晶的 T(N)。