• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习预测跨分类群的化学危害。

Predicting chemical hazard across taxa through machine learning.

作者信息

Wu Jimeng, D'Ambrosi Simone, Ammann Lorenz, Stadnicka-Michalak Julita, Schirmer Kristin, Baity-Jesi Marco

机构信息

Eawag, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland; Department of Environmental Engineering, ETHZ, Zurich, Switzerland.

Department of Statistics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, RM, Italy.

出版信息

Environ Int. 2022 May;163:107184. doi: 10.1016/j.envint.2022.107184. Epub 2022 Mar 17.

DOI:10.1016/j.envint.2022.107184
PMID:35306252
Abstract

We applied machine learning methods to predict chemical hazards focusing on fish acute toxicity across taxa. We analyzed the relevance of taxonomy and experimental setup, showing that taking them into account can lead to considerable improvements in the classification performance. We quantified the gain obtained throught the introduction of taxonomic and experimental information, compared to classification based on chemical information alone. We used our approach with standard machine learning models (K-nearest neighbors, random forests and deep neural networks), as well as the recently proposed Read-Across Structure Activity Relationship (RASAR) models, which were very successful in predicting chemical hazards to mammals based on chemical similarity. We were able to obtain accuracies of over 93% on datasets where, due to noise in the data, the maximum achievable accuracy was expected to be below 96%. The best performances were obtained by random forests and RASAR models. We analyzed metrics to compare our results with animal test reproducibility, and despite most of our models "outperform animal test reproducibility" as measured through recently proposed metrics, we showed that the comparison between machine learning performance and animal test reproducibility should be addressed with particular care. While we focused on fish mortality, our approach, provided that the right data is available, is valid for any combination of chemicals, effects and taxa.

摘要

我们应用机器学习方法来预测化学危害,重点关注跨分类群的鱼类急性毒性。我们分析了分类学和实验设置的相关性,表明将它们考虑在内可显著提高分类性能。我们量化了与仅基于化学信息的分类相比,通过引入分类学和实验信息所获得的收益。我们将我们的方法与标准机器学习模型(K近邻、随机森林和深度神经网络)以及最近提出的基于化学相似性在预测对哺乳动物的化学危害方面非常成功的跨读结构活性关系(RASAR)模型一起使用。在数据存在噪声、预期最大可实现准确率低于96%的数据集上,我们能够获得超过93%的准确率。随机森林和RASAR模型取得了最佳性能。我们分析了各项指标以将我们的结果与动物试验可重复性进行比较,尽管通过最近提出的指标衡量我们的大多数模型“优于动物试验可重复性”,但我们表明机器学习性能与动物试验可重复性之间的比较应格外谨慎。虽然我们关注的是鱼类死亡率,但只要有合适的数据,我们的方法对任何化学物质、效应和分类群的组合都是有效的。

相似文献

1
Predicting chemical hazard across taxa through machine learning.通过机器学习预测跨分类群的化学危害。
Environ Int. 2022 May;163:107184. doi: 10.1016/j.envint.2022.107184. Epub 2022 Mar 17.
2
Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships (RASAR) Outperforming Animal Test Reproducibility.毒理学大数据的机器学习可实现优于动物测试重现性的读交叉结构活性关系 (RASAR)。
Toxicol Sci. 2018 Sep 1;165(1):198-212. doi: 10.1093/toxsci/kfy152.
3
The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset.化学相似性度量在非常规建模框架 c-RASAR 中的应用以及降维技术在具有代表性的肝毒性数据集上的应用。
Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.
4
Prediction-Inspired Intelligent Training for the Development of Classification Read-across Structure-Activity Relationship (c-RASAR) Models for Organic Skin Sensitizers: Assessment of Classification Error Rate from Novel Similarity Coefficients.基于预测的智能训练在分类读靶结构-活性关系(c-RASAR)模型开发中的应用:新型相似系数分类错误率评估。
Chem Res Toxicol. 2023 Sep 18;36(9):1518-1531. doi: 10.1021/acs.chemrestox.3c00155. Epub 2023 Aug 16.
5
Machine learning - based q-RASAR modeling to predict acute contact toxicity of binary organic pesticide mixtures in honey bees.基于机器学习的 q-RASAR 模型预测二元有机农药混合物对蜜蜂的急性接触毒性。
J Hazard Mater. 2023 Oct 15;460:132358. doi: 10.1016/j.jhazmat.2023.132358. Epub 2023 Aug 22.
6
Efficient predictions of cytotoxicity of TiO-based multi-component nanoparticles using a machine learning-based q-RASAR approach.使用基于机器学习的q-RASAR方法对TiO基多组分纳米颗粒的细胞毒性进行高效预测。
Nanotoxicology. 2023 Feb;17(1):78-93. doi: 10.1080/17435390.2023.2186280. Epub 2023 Mar 8.
7
Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery.利用 5000 多个数据集进行药物发现的多种机器学习算法的生物活性比较。
Mol Pharm. 2021 Jan 4;18(1):403-415. doi: 10.1021/acs.molpharmaceut.0c01013. Epub 2020 Dec 16.
8
Assessing Deep and Shallow Learning Methods for Quantitative Prediction of Acute Chemical Toxicity.评估深度和浅层学习方法在定量预测急性化学毒性方面的应用。
Toxicol Sci. 2018 Aug 1;164(2):512-526. doi: 10.1093/toxsci/kfy111.
9
Molecular similarity in chemical informatics and predictive toxicity modeling: from quantitative read-across (q-RA) to quantitative read-across structure-activity relationship (q-RASAR) with the application of machine learning.化学信息学和预测性毒理学建模中的分子相似性:从定量文献外推 (q-RA) 到基于机器学习的定量文献外推结构-活性关系 (q-RASAR)。
Crit Rev Toxicol. 2024 Oct;54(9):659-684. doi: 10.1080/10408444.2024.2386260. Epub 2024 Sep 3.
10
Breaking the Barriers: Machine-Learning-Based c-RASAR Approach for Accurate Blood-Brain Barrier Permeability Prediction.突破壁垒:基于机器学习的 c-RASAR 方法实现精确的血脑屏障通透性预测。
J Chem Inf Model. 2024 May 27;64(10):4298-4309. doi: 10.1021/acs.jcim.4c00433. Epub 2024 May 3.

引用本文的文献

1
MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data.MLinvitroTox 重新加载,用于基于高通量危害的高分辨率质谱数据优先级排序。
J Cheminform. 2025 Jan 31;17(1):14. doi: 10.1186/s13321-025-00950-4.
2
Machine Learning Allowed Interpreting Toxicity of a Fe-Doped CuO NM Library Large Data Set─An Environmental In Vivo Case Study.机器学习助力解读掺铁氧化铜纳米材料库大数据集的毒性——一项环境体内案例研究。
ACS Appl Mater Interfaces. 2024 Aug 14;16(32):42862-42872. doi: 10.1021/acsami.4c07153. Epub 2024 Aug 1.
3
Progress, applications, and challenges in high-throughput effect-directed analysis for toxicity driver identification - is it time for HT-EDA?
用于毒性驱动因素识别的高通量效应导向分析的进展、应用及挑战——是时候采用高通量效应导向分析了吗?
Anal Bioanal Chem. 2025 Jan;417(3):451-472. doi: 10.1007/s00216-024-05424-4. Epub 2024 Jul 12.
4
Systematic approaches to machine learning models for predicting pesticide toxicity.用于预测农药毒性的机器学习模型的系统方法。
Heliyon. 2024 Mar 25;10(7):e28752. doi: 10.1016/j.heliyon.2024.e28752. eCollection 2024 Apr 15.
5
Developing a Novel Read-Across Concept for Ecotoxicological Risk Assessment of Phosphate Chemicals: A Case Study.开发一种用于磷酸盐化学品生态毒理学风险评估的新型类推概念:一个案例研究
Toxics. 2024 Jan 22;12(1):96. doi: 10.3390/toxics12010096.
6
Artificial intelligence (AI)-it's the end of the tox as we know it (and I feel fine).人工智能(AI)——这是我们所知道的毒理学的终结(我感觉很好)。
Arch Toxicol. 2024 Mar;98(3):735-754. doi: 10.1007/s00204-023-03666-2. Epub 2024 Jan 20.
7
A benchmark dataset for machine learning in ecotoxicology.用于生态毒理学机器学习的基准数据集。
Sci Data. 2023 Oct 18;10(1):718. doi: 10.1038/s41597-023-02612-2.
8
Big Question to Developing Solutions: A Decade of Progress in the Development of Aquatic New Approach Methodologies from 2012 to 2022.重大问题解决方案:2012 年至 2022 年,水生新方法学发展十年进展。
Environ Toxicol Chem. 2024 Mar;43(3):559-574. doi: 10.1002/etc.5578. Epub 2023 Mar 20.
9
Outbound Data Legality Analysis in CPTPP Countries under the Environment of Cross-Border Data Flow Governance.《CPTPP 国家跨境数据流治理环境下的数据出境合法性分析》
J Environ Public Health. 2022 Sep 28;2022:6105804. doi: 10.1155/2022/6105804. eCollection 2022.
10
First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability.首次报告 q-RASAR 建模,旨在实现易于解释和高效可迁移性的方法。
Mol Divers. 2022 Oct;26(5):2847-2862. doi: 10.1007/s11030-022-10478-6. Epub 2022 Jun 29.