基于两阶段机器学习的方法预测人类非癌症和发育/生殖效应的起始点。

Two-Stage Machine Learning-Based Approach to Predict Points of Departure for Human Noncancer and Developmental/Reproductive Effects.

机构信息

Department of Veterinary Physiology and Pharmacology, Interdisciplinary Faculty of Toxicology, Texas A&M University, College Station, Texas 77843, United States.

Quantitative Sustainability Assessment, Department of Environmental and Resource Engineering, Technical University of Denmark, Bygningstorvet 115, 2800 Kgs. Lyngby, Denmark.

出版信息

Environ Sci Technol. 2024 Sep 3;58(35):15638-15649. doi: 10.1021/acs.est.4c00172. Epub 2024 May 2.

DOI:10.1021/acs.est.4c00172

PMID:38693844

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11371525/

Abstract

Chemical points of departure (PODs) for critical health effects are crucial for evaluating and managing human health risks and impacts from exposure. However, PODs are unavailable for most chemicals in commerce due to a lack of toxicity data. We therefore developed a two-stage machine learning (ML) framework to predict human-equivalent PODs for oral exposure to organic chemicals based on chemical structure. Utilizing ML-based predictions for structural/physical/chemical/toxicological properties from OPERA 2.9 as features (Stage 1), ML models using random forest regression were trained with human-equivalent PODs derived from data sets for general noncancer effects ( = 1,791) and reproductive/developmental effects ( = 2,228), with robust cross-validation for feature selection and estimating generalization errors (Stage 2). These two-stage models accurately predicted PODs for both effect categories with cross-validation-based root-mean-squared errors less than an order of magnitude. We then applied one or both models to 34,046 chemicals expected to be in the environment, revealing several thousand chemicals of concern and several hundred chemicals of concern for health effects at estimated median population exposure levels. Further application can expand by orders of magnitude the coverage of organic chemicals that can be evaluated for their human health risks and impacts.

摘要

化学起始点 (POD) 对于评估和管理接触暴露对人类健康的风险和影响至关重要。然而，由于缺乏毒性数据，大多数商业用化学品都没有 POD。因此，我们开发了一个两阶段机器学习 (ML) 框架，基于化学结构预测口服暴露于有机化学品的人类等效 POD。利用 ML 基于 OPERA 2.9 的结构/物理/化学/毒理学特性预测作为特征 (第 1 阶段)，使用随机森林回归的 ML 模型，根据一般非癌症效应 ( = 1,791) 和生殖/发育效应 ( = 2,228) 的数据集，利用特征选择和估计泛化误差的稳健交叉验证进行训练 (第 2 阶段)。这两个阶段的模型准确地预测了这两种效应类别的 POD，基于交叉验证的均方根误差小于一个数量级。然后，我们将其中一个或两个模型应用于预计存在于环境中的 34046 种化学物质，发现了数千种有潜在危害的化学物质和数百种对健康有潜在危害的化学物质，估计在人群的中位数暴露水平。进一步的应用可以大大扩展可以评估其对人类健康风险和影响的有机化学物质的覆盖范围。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/11375761/e6e632b153a9/es4c00172_0001.jpg

相似文献

Two-Stage Machine Learning-Based Approach to Predict Points of Departure for Human Noncancer and Developmental/Reproductive Effects.

Environ Sci Technol. 2024 Sep 3;58(35):15638-15649. doi: 10.1021/acs.est.4c00172. Epub 2024 May 2.

Probabilistic Points of Departure and Reference Doses for Characterizing Human Noncancer and Developmental/Reproductive Effects for 10,145 Chemicals.

Environ Health Perspect. 2023 Mar;131(3):37016. doi: 10.1289/EHP11524. Epub 2023 Mar 29.

Probabilistic Reference and 10% Effect Concentrations for Characterizing Inhalation Non-cancer and Developmental/Reproductive Effects for 2,160 Substances.

Environ Sci Technol. 2024 May 14;58(19):8278-8288. doi: 10.1021/acs.est.4c00207. Epub 2024 May 2.

Implementing in vitro bioactivity data to modernize priority setting of chemical inventories.

ALTEX. 2022;39(1):123-139. doi: 10.14573/altex.2106171. Epub 2021 Nov 23.

Optimal selection of learning data for highly accurate QSAR prediction of chemical biodegradability: a machine learning-based approach.

SAR QSAR Environ Res. 2023 Jul-Sep;34(9):729-743. doi: 10.1080/1062936X.2023.2251889. Epub 2023 Sep 7.

The Minderoo-Monaco Commission on Plastics and Human Health.

Ann Glob Health. 2023 Mar 21;89(1):23. doi: 10.5334/aogh.4056. eCollection 2023.

HExpPredict: Exposure Prediction of Human Blood Exposome Using a Random Forest Model and Its Application in Chemical Risk Prioritization.

Environ Health Perspect. 2023 Mar;131(3):37009. doi: 10.1289/EHP11305. Epub 2023 Mar 13.

Conditional Toxicity Value (CTV) Predictor: An Approach for Generating Quantitative Risk Estimates for Chemicals.

Environ Health Perspect. 2018 May 29;126(5):057008. doi: 10.1289/EHP2998. eCollection 2018 May.

Predicting the reproductive toxicity of chemicals using ensemble learning methods and molecular fingerprints.

Toxicol Lett. 2021 Apr 1;340:4-14. doi: 10.1016/j.toxlet.2021.01.002. Epub 2021 Jan 6.

In silico prediction of chemical reproductive toxicity using machine learning.

J Appl Toxicol. 2019 Jun;39(6):844-854. doi: 10.1002/jat.3772. Epub 2019 Jan 27.

引用本文的文献

Developmental toxicity: artificial intelligence-powered assessments.

Trends Pharmacol Sci. 2025 Jun;46(6):486-502. doi: 10.1016/j.tips.2025.04.005. Epub 2025 May 15.

Incorporating new approach methods (NAMs) data in dose-response assessments: The future is now!

J Toxicol Environ Health B Crit Rev. 2025 Jan 2;28(1):28-62. doi: 10.1080/10937404.2024.2412571. Epub 2024 Oct 10.

本文引用的文献

Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization.

Environ Sci Technol. 2023 Nov 21;57(46):18259-18270. doi: 10.1021/acs.est.3c05300. Epub 2023 Nov 1.

Probabilistic Points of Departure and Reference Doses for Characterizing Human Noncancer and Developmental/Reproductive Effects for 10,145 Chemicals.

Environ Health Perspect. 2023 Mar;131(3):37016. doi: 10.1289/EHP11524. Epub 2023 Mar 29.

The NORMAN Suspect List Exchange (NORMAN-SLE): facilitating European and worldwide collaboration on suspect screening in high resolution mass spectrometry.

Environ Sci Eur. 2022;34(1):104. doi: 10.1186/s12302-022-00680-6. Epub 2022 Oct 21.

Chemicals of concern in building materials: A high-throughput screening.

J Hazard Mater. 2022 Feb 15;424(Pt C):127574. doi: 10.1016/j.jhazmat.2021.127574. Epub 2021 Oct 23.

Exposure and Toxicity Characterization of Chemical Emissions and Chemicals in Products: Global Recommendations and Implementation in USEtox.

Int J Life Cycle Assess. 2021 May;26(5):899-915. doi: 10.1007/s11367-021-01889-y. Epub 2021 Apr 5.

Structure-based QSAR Models to Predict Repeat Dose Toxicity Points of Departure.

Comput Toxicol. 2020 Nov 1;16(November 2020). doi: 10.1016/j.comtox.2020.100139.

CATMoS: Collaborative Acute Toxicity Modeling Suite.

Environ Health Perspect. 2021 Apr;129(4):47013. doi: 10.1289/EHP8495. Epub 2021 Apr 30.

Chemicals of concern in plastic toys.

Environ Int. 2021 Jan;146:106194. doi: 10.1016/j.envint.2020.106194. Epub 2020 Oct 22.

High Throughput Risk and Impact Screening of Chemicals in Consumer Products.

Risk Anal. 2021 Apr;41(4):627-644. doi: 10.1111/risa.13604. Epub 2020 Oct 18.

Toward a Global Understanding of Chemical Pollution: A First Comprehensive Analysis of National and Regional Chemical Inventories.

Environ Sci Technol. 2020 Mar 3;54(5):2575-2584. doi: 10.1021/acs.est.9b06379. Epub 2020 Feb 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于两阶段机器学习的方法预测人类非癌症和发育/生殖效应的起始点。

Two-Stage Machine Learning-Based Approach to Predict Points of Departure for Human Noncancer and Developmental/Reproductive Effects.

机构信息

Department of Veterinary Physiology and Pharmacology, Interdisciplinary Faculty of Toxicology, Texas A&M University, College Station, Texas 77843, United States.

Quantitative Sustainability Assessment, Department of Environmental and Resource Engineering, Technical University of Denmark, Bygningstorvet 115, 2800 Kgs. Lyngby, Denmark.

出版信息

Environ Sci Technol. 2024 Sep 3;58(35):15638-15649. doi: 10.1021/acs.est.4c00172. Epub 2024 May 2.

DOI:10.1021/acs.est.4c00172

PMID:38693844

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11371525/

Abstract

摘要

基于两阶段机器学习的方法预测人类非癌症和发育/生殖效应的起始点。

Two-Stage Machine Learning-Based Approach to Predict Points of Departure for Human Noncancer and Developmental/Reproductive Effects.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于两阶段机器学习的方法预测人类非癌症和发育/生殖效应的起始点。

Two-Stage Machine Learning-Based Approach to Predict Points of Departure for Human Noncancer and Developmental/Reproductive Effects.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献