Feshuk M, Kolaczkowski L, Dunham K, Davidson-Fritz S E, Carstens K E, Brown J, Judson R S, Paul Friedman K
Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States.
National Student Services Contractor, Oak Ridge Associated Universities, Oak Ridge, TN, United States.
Front Toxicol. 2023 Sep 21;5:1275980. doi: 10.3389/ftox.2023.1275980. eCollection 2023.
The US Environmental Protection Agency Toxicity Forecaster (ToxCast) program makes medium- and high-throughput screening assay data publicly available for prioritization and hazard characterization of thousands of chemicals. The assays employ a variety of technologies to evaluate the effects of chemical exposure on diverse biological targets, from distinct proteins to more complex cellular processes like mitochondrial toxicity, nuclear receptor signaling, immune responses, and developmental toxicity. The ToxCast data pipeline (tcpl) is an open-source R package that stores, manages, curve-fits, and visualizes ToxCast data and populates the linked MySQL Database, invitrodb. Herein we describe major updates to tcpl and invitrodb to accommodate a new curve-fitting approach. The original tcpl curve-fitting models (constant, Hill, and gain-loss models) have been expanded to include Polynomial 1 (Linear), Polynomial 2 (Quadratic), Power, Exponential 2, Exponential 3, Exponential 4, and Exponential 5 based on BMDExpress and encoded by the R package dependency, tcplfit2. Inclusion of these models impacted invitrodb (beta version v4.0) and tcpl v3 in several ways: (1) long-format storage of generic modeling parameters to permit additional curve-fitting models; (2) updated logic for winning model selection; (3) continuous hit calling logic; and (4) removal of redundant endpoints as a result of bidirectional fitting. Overall, the hit call and potency estimates were largely consistent between invitrodb v3.5 and 4.0. Tcpl and invitrodb provide a standard for consistent and reproducible curve-fitting and data management for diverse, targeted assay data with readily available documentation, thus enabling sharing and use of these data in myriad toxicology applications. The software and database updates described herein promote comparability across multiple tiers of data within the US Environmental Protection Agency CompTox Blueprint.
美国环境保护局毒性预测(ToxCast)项目公开了中高通量筛选分析数据,用于对数千种化学物质进行优先级排序和危害特征描述。这些分析采用多种技术来评估化学物质暴露对不同生物靶点的影响,从特定蛋白质到更复杂的细胞过程,如线粒体毒性、核受体信号传导、免疫反应和发育毒性。ToxCast数据管道(tcpl)是一个开源的R包,用于存储、管理、曲线拟合和可视化ToxCast数据,并填充链接的MySQL数据库invitrodb。在此,我们描述了tcpl和invitrodb的主要更新,以适应一种新的曲线拟合方法。原始的tcpl曲线拟合模型(常数、希尔和增益-损失模型)已扩展,基于BMDExpress并由R包依赖项tcplfit2编码,纳入了多项式1(线性)、多项式2(二次)、幂、指数2、指数3、指数4和指数5。纳入这些模型在几个方面影响了invitrodb(测试版v4.0)和tcpl v3:(1)通用建模参数的长格式存储,以允许使用更多曲线拟合模型;(2)获胜模型选择的更新逻辑;(3)连续命中调用逻辑;(4)由于双向拟合而删除冗余端点。总体而言,invitrodb v3.5和4.0之间的命中调用和效力估计在很大程度上是一致的。Tcpl和invitrodb为各种靶向分析数据提供了一致且可重复的曲线拟合和数据管理标准,并提供了易于获取的文档,从而能够在众多毒理学应用中共享和使用这些数据。本文所述的软件和数据库更新促进了美国环境保护局综合毒性蓝图中多层数据之间的可比性。