Suppr超能文献

通过聚类减少样本量:一种使风险评估适用于大量有机化合物的方法?

Reducing sample size by clustering: A way to make risk assessment feasible for large groups of organic compounds?

机构信息

KWR Water Research, Groningenhaven 7, Nieuwegein 3433 PE, The Netherlands.

KWR Water Research, Groningenhaven 7, Nieuwegein 3433 PE, The Netherlands; Wetsus, Oostergoweg 9, Leeuwarden 8911 MA, The Netherlands.

出版信息

J Water Health. 2024 Aug;22(8):1527-1540. doi: 10.2166/wh.2024.127. Epub 2024 Jul 4.

Abstract

This research addresses the presence of substances of very high concern (SVHCs) confronting the drinking water sector. Responding adequately to the potential hazards by SVHCs, knowledge of emission pathways, toxicity, presence in drinking water sources, and removability during water production is crucial. As this information cannot be received for each compound individually, we employed a detailed clustering approach based on chemical properties and structures of SVHCs from lists with over 1,000 compounds. Through this process, 915 substances were divided into 51 clusters. We tested this clustering in risk assessment. To assess the risks, we developed toxicity prediction models utilizing random forests and multiple linear regression. These models were applied to make toxicity predictions for the list of compounds. This study shows that clustering is a viable approach to reducing sample size. In addition, the toxicity models provide insights into the potential human health risks. This research contributes to more informed decision-making and improved risk assessment in the drinking water sector, aiding in the protection of human health and the environment. This principle is generally applicable. If in a group a suitable representative is found, data from experiments with this compound can be used to gauge the behaviour of chemicals in this group.

摘要

本研究针对饮用水行业面临的高关注物质(SVHC)问题。为了充分应对 SVHC 带来的潜在危害,了解排放途径、毒性、饮用水源中的存在情况以及在水生产过程中的去除能力至关重要。由于无法为每种化合物单独获取这些信息,我们采用了一种详细的聚类方法,该方法基于 SVHC 列表中超过 1000 种化合物的化学性质和结构。通过这个过程,将 915 种物质分为 51 个聚类。我们在风险评估中对这种聚类进行了测试。为了评估风险,我们利用随机森林和多元线性回归开发了毒性预测模型。这些模型被应用于对化合物列表进行毒性预测。这项研究表明聚类是一种可行的方法,可以减少样本量。此外,毒性模型提供了对潜在人类健康风险的深入了解。本研究有助于在饮用水行业做出更明智的决策和改进风险评估,从而保护人类健康和环境。这一原则具有普遍适用性。如果在一个组中发现了合适的代表性物质,可以使用该化合物的实验数据来评估该组中其他化学品的行为。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验