• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发基于分子结构识别持久性、迁移性和毒性(PMT)以及高持久性和高迁移性(vPvM)候选物质的机器学习方法。

Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure.

作者信息

Han Min, Jin Biao, Liang Jun, Huang Chen, Arp Hans Peter H

机构信息

State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China; CAS Center for Excellence in Deep Earth Science, Guangzhou, 510640, China; University of Chinese Academy of Sciences, Beijing, 10069, China.

State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China; CAS Center for Excellence in Deep Earth Science, Guangzhou, 510640, China; University of Chinese Academy of Sciences, Beijing, 10069, China.

出版信息

Water Res. 2023 Oct 1;244:120470. doi: 10.1016/j.watres.2023.120470. Epub 2023 Aug 9.

DOI:10.1016/j.watres.2023.120470
PMID:37595327
Abstract

Determining which substances on the global market could be classified as persistent, mobile and toxic (PMT) substances or very persistent, very mobile (vPvM) substances is essential to prevent or reduce drinking water contamination from them. This study developed machine learning models based on different molecular descriptors (MDs) and defined applicability domains for the screening of PMT/vPvM substances. The models were trained with 3111 substances with expert weight-of-evidence based PMT/vPvM hazard classifications that considered the highest quality data available. The model was based on the hypothesis that PMT/vPvM substances contain similar MDs, representative of chemical structures resistant to degradation, be associated with low sorption (or high-water solubility) and in some cases be associated with known toxic mechanisms. All possible model combinations were tested by integrating different molecular description methods, data balancing strategies and machine learning algorithms. Our model allows one-step prediction of candidate PMT/vPvM substances, and our method was compared with the approach predicting P, M and T separately (i.e. three-step prediction). The results showed that the one-step model achieved a higher accuracy of 92% for PMT/vPvM identification (i.e. positive samples) for an internal test set, and also resulted in a higher accuracy of 90% for an external test set of chemical pollutants detected in Taihu Lake, China. Furthermore, prediction mechanism of the model was interpreted by Shapley additive explanations (SHAP). This work presents an advance of big data in silico screening models for the identification of substances that potentially meet the PMT/vPvM criteria.

摘要

确定全球市场上哪些物质可被归类为持久性、迁移性和毒性(PMT)物质或高持久性、高迁移性(vPvM)物质,对于预防或减少其对饮用水的污染至关重要。本研究基于不同的分子描述符(MDs)开发了机器学习模型,并定义了用于筛选PMT/vPvM物质的适用域。这些模型使用3111种物质进行训练,这些物质具有基于专家证据权重的PMT/vPvM危害分类,该分类考虑了可获得的最高质量数据。该模型基于这样的假设:PMT/vPvM物质包含相似的MDs,代表抗降解的化学结构,与低吸附(或高水溶性)相关,并且在某些情况下与已知的毒性机制相关。通过整合不同的分子描述方法、数据平衡策略和机器学习算法,对所有可能的模型组合进行了测试。我们的模型允许对候选PMT/vPvM物质进行一步预测,并且我们的方法与分别预测P、M和T的方法(即三步预测)进行了比较。结果表明,一步模型在内部测试集上对PMT/vPvM识别(即阳性样本)的准确率达到了92%,在中国太湖检测到的化学污染物外部测试集上的准确率也达到了90%。此外,通过Shapley加法解释(SHAP)对模型的预测机制进行了解释。这项工作展示了大数据计算机筛选模型在识别潜在符合PMT/vPvM标准的物质方面的进展。

相似文献

1
Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure.开发基于分子结构识别持久性、迁移性和毒性(PMT)以及高持久性和高迁移性(vPvM)候选物质的机器学习方法。
Water Res. 2023 Oct 1;244:120470. doi: 10.1016/j.watres.2023.120470. Epub 2023 Aug 9.
2
Machine Learning-Based Models with High Accuracy and Broad Applicability Domains for Screening PMT/vPvM Substances.基于机器学习的高准确度和广泛适用领域模型,用于筛选 PMT/vPvM 物质。
Environ Sci Technol. 2022 Dec 20;56(24):17880-17889. doi: 10.1021/acs.est.2c06155. Epub 2022 Dec 6.
3
Graph Convolutional Network-Enhanced Model for Screening Persistent, Mobile, and Toxic and Very Persistent and Very Mobile Substances.基于图卷积网络的持久性、迁移性、毒性和高持久性、高迁移性物质筛选模型
Environ Sci Technol. 2024 Apr 9;58(14):6149-6157. doi: 10.1021/acs.est.4c01201. Epub 2024 Apr 1.
4
Grouping strategies for assessing and managing persistent and mobile substances.评估和管理持久性和移动性物质的分组策略。
Environ Sci Eur. 2024;36(1):102. doi: 10.1186/s12302-024-00919-4. Epub 2024 May 21.
5
Occurrence, Distribution, and Environmental Behavior of Persistent, Mobile, and Toxic (PMT) and Very Persistent and Very Mobile (vPvM) Substances in the Sources of German Drinking Water.在德国饮用水源中持久性、迁移性和毒性(PMT)以及高持久性、高迁移性(vPvM)物质的出现、分布和环境行为。
Environ Sci Technol. 2022 Aug 2;56(15):10857-10867. doi: 10.1021/acs.est.2c03659. Epub 2022 Jul 22.
6
[Research Status and Trend Analysis of Environmental and Health Risk and Control of Persistent, Mobile, and Toxic Chemicals].持久性、移动性和毒性化学品的环境与健康风险及控制研究现状与趋势分析
Huan Jing Ke Xue. 2023 Jun 8;44(6):3017-3023. doi: 10.13227/j.hjkx.202207182.
7
Identifying persistent, mobile and toxic (PMT) organic compounds detected in shale gas wastewater.鉴定页岩气废水中存在的持久性、移动性和毒性(PMT)有机化合物。
Sci Total Environ. 2023 Feb 1;858(Pt 2):159821. doi: 10.1016/j.scitotenv.2022.159821. Epub 2022 Nov 2.
8
Assessing the Persistence and Mobility of Organic Substances to Protect Freshwater Resources.评估有机物质的持久性和迁移性以保护淡水资源。
ACS Environ Au. 2022 Nov 16;2(6):482-509. doi: 10.1021/acsenvironau.2c00024. Epub 2022 Aug 2.
9
Managing PMT/vPvM substances in consumer products through the concepts of essential-use and functional substitution: a case-study for cosmetics.通过必要用途和功能替代的概念管理消费品中的持久性有机污染物/准持久性有机污染物物质:以化妆品为例的案例研究。
Environ Sci Process Impacts. 2023 Jun 21;25(6):1067-1081. doi: 10.1039/d3em00025g.
10
Freshwater ecotoxicity characterization factors for PMT/vPvM substances.淡水生态毒性特征化因子对于持久性、生物累积性和毒性物质/高持久性、高生物累积性物质。
Chemosphere. 2024 Jul;360:142391. doi: 10.1016/j.chemosphere.2024.142391. Epub 2024 May 20.

引用本文的文献

1
The Active Soil Layer of Thawing Permafrost Is an Emergent Source for Organic Substances of Concern to Water Resources.融化多年冻土的活跃土层是水资源中令人关注的有机物质的一个新出现的来源。
Environ Sci Technol Lett. 2025 Apr 21;12(5):558-566. doi: 10.1021/acs.estlett.5c00275. eCollection 2025 May 13.
2
Transformers enable accurate prediction of acute and chronic chemical toxicity in aquatic organisms.转化器可实现水生生物急性和慢性化学毒性的准确预测。
Sci Adv. 2024 Mar 8;10(10):eadk6669. doi: 10.1126/sciadv.adk6669. Epub 2024 Mar 6.
3
Machine learning coupled with causal inference to identify COVID-19 related chemicals that pose a high concern to drinking water.
机器学习与因果推断相结合,以识别对饮用水构成高度关注的新冠肺炎相关化学物质。
iScience. 2024 Jan 24;27(2):109012. doi: 10.1016/j.isci.2024.109012. eCollection 2024 Feb 16.