Nelms Mark D, Antonijevic Todor, Ring Caroline, Harris Danni L, Bever Ronnie Joe, Lynn Scott G, Williams David, Chappell Grace, Boyles Rebecca, Borghoff Susan, Edwards Stephen W, Markey Kristan
RTI International, Research Triangle Park, NC, United States.
ToxStrategies, Katy, TX, United States.
Front Toxicol. 2024 Apr 17;6:1346767. doi: 10.3389/ftox.2024.1346767. eCollection 2024.
The U. S. Environmental Protection Agency's Endocrine Disruptor Screening Program (EDSP) Tier 1 assays are used to screen for potential endocrine system-disrupting chemicals. A model integrating data from 16 high-throughput screening assays to predict estrogen receptor (ER) agonism has been proposed as an alternative to some low-throughput Tier 1 assays. Later work demonstrated that as few as four assays could replicate the ER agonism predictions from the full model with 98% sensitivity and 92% specificity. The current study utilized chemical clustering to illustrate the coverage of the EDSP Universe of Chemicals (UoC) tested in the existing ER pathway models and to investigate the utility of chemical clustering to evaluate the screening approach using an existing 4-assay model as a test case. Although the full original assay battery is no longer available, the demonstrated contribution of chemical clustering is broadly applicable to assay sets, chemical inventories, and models, and the data analysis used can also be applied to future evaluation of minimal assay models for consideration in screening.
Chemical structures were collected for 6,947 substances via the CompTox Chemicals Dashboard from the over 10,000 UoC and grouped based on structural similarity, generating 826 chemical clusters. Of the 1,812 substances run in the original ER model, 1,730 substances had a single, clearly defined structure. The ER model chemicals with a clearly defined structure that were not present in the EDSP UoC were assigned to chemical clusters using a k-nearest neighbors approach, resulting in 557 EDSP UoC clusters containing at least one ER model chemical.
Performance of an existing 4-assay model in comparison with the existing full ER agonist model was analyzed as related to chemical clustering. This was a case study, and a similar analysis can be performed with any subset model in which the same chemicals (or subset of chemicals) are screened. Of the 365 clusters containing >1 ER model chemical, 321 did not have any chemicals predicted to be agonists by the full ER agonist model. The best 4-assay subset ER agonist model disagreed with the full ER agonist model by predicting agonist activity for 122 chemicals from 91 of the 321 clusters. There were 44 clusters with at least two chemicals and at least one agonist based upon the full ER agonist model, which allowed accuracy predictions on a per-cluster basis. The accuracy of the best 4-assay subset ER agonist model ranged from 50% to 100% across these 44 clusters, with 32 clusters having accuracy ≥90%. Overall, the best 4-assay subset ER agonist model resulted in 122 false-positive and only 2 false-negative predictions compared with the full ER agonist model. Most false positives (89) were active in only two of the four assays, whereas all but 11 true positive chemicals were active in at least three assays. False positive chemicals also tended to have lower area under the curve (AUC) values, with 110 out of 122 false positives having an AUC value below 0.214, which is lower than 75% of the positives as predicted by the full ER agonist model. Many false positives demonstrated borderline activity. The median AUC value for the 122 false positives from the best 4-assay subset ER agonist model was 0.138, whereas the threshold for an active prediction is 0.1.
Our results show that the existing 4-assay model performs well across a range of structurally diverse chemicals. Although this is a descriptive analysis of previous results, several concepts can be applied to any screening model used in the future. First, the clustering of the chemicals provides a means of ensuring that future screening evaluations consider the broad chemical space represented by the EDSP UoC. The clusters can also assist in prioritizing future chemicals for screening in specific clusters based on the activity of known chemicals in those clusters. The clustering approach can be useful in providing a framework to evaluate which portions of the EDSP UoC chemical space are reliably covered by and approaches and where predictions from either method alone or both methods combined are most reliable. The lessons learned from this case study can be easily applied to future evaluations of model applicability and screening to evaluate future datasets.
美国环境保护局的内分泌干扰物筛选计划(EDSP)一级检测用于筛选潜在的内分泌系统干扰化学物质。有人提出了一个整合16种高通量筛选检测数据以预测雌激素受体(ER)激动作用的模型,作为一些低通量一级检测的替代方案。后来的研究表明,仅四项检测就能以98%的灵敏度和92%的特异性复制完整模型的ER激动作用预测。本研究利用化学聚类来说明现有ER途径模型中所测试的EDSP化学物质库(UoC)的覆盖范围,并以现有的四项检测模型为测试案例,研究化学聚类在评估筛选方法中的效用。尽管最初的完整检测组合已不再可用,但化学聚类所显示的作用广泛适用于检测集、化学物质清单和模型,并且所使用的数据分析也可应用于未来对筛选中考虑的最小检测模型的评估。
通过CompTox化学物质仪表盘从10000多种UoC中收集了6947种物质的化学结构,并根据结构相似性进行分组,生成了826个化学簇。在最初的ER模型中运行的1812种物质中,1730种物质具有单一、明确的结构。使用k近邻方法将EDSP UoC中不存在的具有明确结构的ER模型化学物质分配到化学簇中,从而得到557个至少包含一种ER模型化学物质的EDSP UoC簇。
将现有的四项检测模型与现有的完整ER激动剂模型的性能进行了分析,以探讨与化学聚类的关系。这是一个案例研究,对于筛选相同化学物质(或化学物质子集)的任何子集模型都可以进行类似的分析。在包含>一种ER模型化学物质的365个簇中,321个簇中没有任何化学物质被完整的ER激动剂模型预测为激动剂。最佳的四项检测子集ER激动剂模型与完整的ER激动剂模型不同,它预测了321个簇中91个簇的122种化学物质的激动剂活性。基于完整的ER激动剂模型,有44个簇至少包含两种化学物质且至少有一种激动剂,这使得可以在每个簇的基础上进行准确性预测。在这44个簇中,最佳的四项检测子集ER激动剂模型的准确性范围为50%至100%,其中32个簇的准确性≥90%。总体而言,与完整的ER激动剂模型相比,最佳的四项检测子集ER激动剂模型产生了122个假阳性预测,而假阴性预测仅2个。大多数假阳性(89个)仅在四项检测中的两项中呈阳性,而除11种真阳性化学物质外,所有真阳性化学物质至少在三项检测中呈阳性。假阳性化学物质的曲线下面积(AUC)值也往往较低,122个假阳性中有110个的AUC值低于0.214,低于完整的ER激动剂模型预测的阳性的75%。许多假阳性表现出临界活性。最佳的四项检测子集ER激动剂模型的122个假阳性的AUC值中位数为0.138,而活性预测阈值为0.1。
我们的结果表明,现有的四项检测模型在一系列结构多样的化学物质中表现良好。尽管这是对先前结果进行的描述性分析,但几个概念可应用于未来使用的任何筛选模型。首先,化学物质的聚类提供了一种手段,可确保未来的筛选评估考虑到EDSP UoC所代表的广阔化学空间。这些簇还可根据特定簇中已知化学物质的活性,协助对未来要在特定簇中筛选的化学物质进行优先级排序。聚类方法有助于提供一个框架,以评估EDSP UoC化学空间的哪些部分被 方法和 方法可靠覆盖,以及单独使用任何一种方法或两种方法结合进行预测时,哪些地方最可靠。从这个案例研究中学到的经验教训可以很容易地应用于未来对模型适用性和筛选的评估,以评估未来的数据集。