QSAR Research Unit in Environmental Chemistry and Ecotoxicology, Department of Structural and Functional Biology, University of Insubria, via Dunant 3, 21100 Varese, Italy.
J Chem Inf Model. 2010 May 24;50(5):861-74. doi: 10.1021/ci100078u.
Computational tools, such as quantitative structure-activity relationship (QSAR), are highly useful as screening support for prioritization of substances of very high concern (SVHC). From the practical point of view, QSAR models should be effective to pick out more active rather than inactive compounds, expressed as sensitivity in classification works. This research investigates the classification of a big data set of endocrine-disrupting chemicals (EDCs)-androgen receptor (AR) antagonists, mainly aiming to improve the external sensitivity and to screen for potential AR binders. The kNN, lazy IB1, and ADTree methods and the consensus approach were used to build different models, which improve the sensitivity on external chemicals from 57.1% (literature) to 76.4%. Additionally, the models' predictive abilities were further validated on a blind collected data set (sensitivity: 85.7%). Then the proposed classifiers were used: (i) to distinguish a set of AR binders into antagonists and agonists; (ii) to screen a combined estrogen receptor binder database to find out possible chemicals that can bind to both AR and ER; and (iii) to virtually screen our in-house environmental chemical database. The in silico screening results suggest: (i) that some compounds can affect the normal endocrine system through a complex mechanism binding both to ER and AR; (ii) new EDCs, which are nonER binders, but can in silico bind to AR, are recognized; and (iii) about 20% of compounds in a big data set of environmental chemicals are predicted as new AR antagonists. The priority should be given to them to experimentally test the binding activities with AR.
计算工具,如定量构效关系(QSAR),对于作为高关注物质(SVHC)优先级排序的筛选支持非常有用。从实际的角度来看,QSAR 模型应该能够有效地挑选出更活跃而不是不活跃的化合物,这在分类工作中表现为敏感性。本研究调查了一大组内分泌干扰化学物质(EDCs)-雄激素受体(AR)拮抗剂的分类,主要目的是提高外部敏感性并筛选潜在的 AR 结合物。使用 kNN、懒惰 IB1 和 ADTree 方法以及共识方法构建了不同的模型,从而将外部化学物质的敏感性从 57.1%(文献)提高到 76.4%。此外,还进一步验证了模型在盲收集数据集上的预测能力(敏感性:85.7%)。然后使用所提出的分类器:(i)将一组 AR 结合物分为拮抗剂和激动剂;(ii)筛选结合雌激素受体的数据库,以找出可能同时与 AR 和 ER 结合的化学物质;(iii)虚拟筛选我们内部的环境化学数据库。基于计算的筛选结果表明:(i)某些化合物可能通过与 ER 和 AR 结合的复杂机制影响正常的内分泌系统;(ii)识别出一些非雌激素结合但可与 AR 基于计算结合的新 EDC;(iii)在环境化学物大数据集中,约 20%的化合物被预测为新的 AR 拮抗剂。应优先考虑对其进行实验测试与 AR 的结合活性。