Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria.
Department of Chemistry, Bauchi State University, Gadau, Nigeria.
SAR QSAR Environ Res. 2023 Apr;34(4):267-284. doi: 10.1080/1062936X.2023.2207039. Epub 2023 May 4.
Some adverse effects of hydroxylated polychlorinated biphenyls (OH-PCBs) in humans are presumed to be initiated via thyroid hormone receptor (TR) binding. Due to the trial-and-error approach adopted for OH-PCB selection in previous studies, experiments designed to test the TR binding hypothesis mostly utilized inactive OH-PCBs, leading to considerable waste of time, effort and other material resources. In this paper, linear discriminant analysis (LDA) and binary logistic regression (LR) were used to develop classification models to group OH-PCBs into active and inactive TR agonists using radial distribution function (RDF) descriptors as predictor variables. The classifications made by both LDA and LR models on the training set compounds resulted in an accuracy of 84.3%, sensitivity of 72.2% and specificity of 90.9%. The areas under the ROC curves, constructed with the training set data, were found to be 0.872 and 0.880 for LDA and LR models, respectively. External validation of the models revealed that 76.5% of the test set compounds were correctly classified by both LDA and LR models. These findings suggest that the two models reported in this paper are good and reliable for classifying OH-PCB congeners into active and inactive TR agonists.
一些羟基化多氯联苯(OH-PCBs)在人类身上的不良影响被认为是通过甲状腺激素受体(TR)结合引发的。由于之前的研究中采用了试错法来选择 OH-PCB,因此旨在测试 TR 结合假说的实验大多使用了无活性的 OH-PCBs,这导致了大量的时间、精力和其他物质资源的浪费。在本文中,线性判别分析(LDA)和二项逻辑回归(LR)被用于建立分类模型,使用径向分布函数(RDF)描述符作为预测变量,将 OH-PCBs 分为活性和非活性 TR 激动剂。LDA 和 LR 模型对训练集化合物的分类结果准确率为 84.3%,灵敏度为 72.2%,特异性为 90.9%。使用训练集数据构建的 ROC 曲线下面积分别为 LDA 和 LR 模型的 0.872 和 0.880。模型的外部验证表明,LDA 和 LR 模型正确分类了 76.5%的测试集化合物。这些发现表明,本文报告的两种模型可用于将 OH-PCB 同系物有效且可靠地分为活性和非活性 TR 激动剂。