Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.
School of Computer Engineering, Jiangsu University of Technology, Changzhou, China.
Chem Biol Drug Des. 2023 Jan;101(1):52-68. doi: 10.1111/cbdd.14121. Epub 2022 Jul 28.
Polycyclic aromatic hydrocarbons (PAHs), a special class of persistent organic pollutants (POPs) with two or more aromatic rings, have received extensive attention owing to their carcinogenic, mutagenic, and teratogenic effects. Quantitative structure-property relationship (QSPR) is powerful chemometric method to correlate structural descriptors of PAHs with their physicochemical properties. In this manuscript, a QSPR study of PAHs was performed to predict their boiling point (bp), octanol-water partition coefficient (LogK ), and retention time index (RI). In addition to traditional molecular descriptors, structural fingerprints play an important role in the correlation of the above properties. Three regression methods, partial least squares (PLS), multiple linear regression (MLR), and genetic function approximation (GFA), were used to establish QSPR models for each property of PAHs. The correlation coefficient (R ) and root mean square error (RMSE) of best model were 0.980 and 24.39% (PLS), 0.979 and 35.80% (GFA), 0.926 and 22.90% (MLR) for bp, LogK and RI, respectively. The model proposed here can be used to estimate physicochemical properties and inform toxicity prediction of environmental chemicals.
多环芳烃(PAHs)是一类具有两个或更多芳环的特殊持久性有机污染物(POPs),由于其致癌、致突变和致畸作用而受到广泛关注。定量构效关系(QSPR)是一种强大的化学计量学方法,可将 PAHs 的结构描述符与其物理化学性质相关联。在本手稿中,对 PAHs 进行了 QSPR 研究,以预测其沸点(bp)、辛醇-水分配系数(LogK)和保留时间指数(RI)。除了传统的分子描述符外,结构指纹在上述性质的相关性中也起着重要作用。使用三种回归方法,即偏最小二乘法(PLS)、多元线性回归(MLR)和遗传函数逼近(GFA),为 PAHs 的每种性质建立了 QSPR 模型。最佳模型的相关系数(R)和均方根误差(RMSE)分别为 0.980 和 24.39%(PLS)、0.979 和 35.80%(GFA)、0.926 和 22.90%(MLR),用于 bp、LogK 和 RI。此处提出的模型可用于估计物理化学性质,并为环境化学物质的毒性预测提供信息。