Suppr超能文献

评估化学相似性作为识别高关注潜在物质的度量。

Evaluating chemical similarity as a measure to identify potential substances of very high concern.

机构信息

National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720BA, Bilthoven, the Netherlands; Institute of Environmental Sciences (CML), Leiden University, P. O. Box 9518, 2300RA, Leiden, the Netherlands.

National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720BA, Bilthoven, the Netherlands.

出版信息

Regul Toxicol Pharmacol. 2021 Feb;119:104834. doi: 10.1016/j.yrtph.2020.104834. Epub 2020 Nov 20.

Abstract

Due to the large amount of chemical substances on the market, fast and reproducible screening is essential to prioritize chemicals for further evaluation according to highest concern. We here evaluate the performance of structural similarity models that are developed to identify potential substances of very high concern (SVHC) based on structural similarity to known SVHCs. These models were developed following a systematic analysis of the performance of 112 different similarity measures for varying SVHC-subgroups. The final models consist of the best combinations of fingerprint, similarity coefficient and similarity threshold, and suggested a high predictive performance (≥80%) on an internal dataset consisting of SVHC and non-SVHC substances. However, the application performance on an external dataset was not evaluated. Here, we evaluated the application performance of the developed similarity models with a 'pseudo-external assessment' on a set of substances (n = 60-100 for the varying SVHC-subgroups) that were putatively assessed as SVHC or non-SVHC based upon consensus scoring using expert elicitations (n = 30 experts). Expert scores were direct evaluations based on structural similarity to the most similar SVHCs according to the similarity models, and did not consider an extensive evaluation of available data. The use of expert opinions is particularly suitable as this is exactly the intended purpose of the chemical similarity models: a quick, reproducible and automated screening tool that mimics the expert judgement that is frequently applied in various screening applications. In addition, model predictions were analyzed via qualitative approaches and discussed via specific examples, to identify the model's strengths and limitations. The results indicate a good statistical performance for carcinogenic, mutagenic or reprotoxic (CMR) and endocrine disrupting (ED) substances, whereas a moderate performance was observed for (very) persistent, (very) bioaccumulative and toxic (PBT/vPvB) substances when compared to expert opinions. For the PBT/vPvB model, particularly false positive substances were identified, indicating the necessity of outcome interpretation. The developed similarity models are made available as a freely-accessible online tool. In general, the structural similarity models showed great potential for screening and prioritization purposes. The models proved to be effective in identifying groups of substances of potential concern, and could be used to identify follow-up directions for substances of potential concern.

摘要

由于市场上存在大量的化学物质,因此快速且可重复的筛选对于根据最高关注程度优先对化学品进行进一步评估至关重要。我们在这里评估了结构相似性模型的性能,这些模型是为了根据已知的高关注物质 (SVHC) 的结构相似性来识别潜在的高关注物质 (SVHC) 而开发的。这些模型是在对 112 种不同的相似性度量方法针对不同的 SVHC 亚组进行系统分析后开发的。最终模型由指纹、相似系数和相似性阈值的最佳组合组成,并在包含 SVHC 和非-SVHC 物质的内部数据集上表现出较高的预测性能(≥80%)。然而,并没有评估其在外部数据集上的应用性能。在这里,我们通过对一组物质(SVHC 亚组的数量为 60-100)进行“伪外部评估”,评估了开发的相似性模型的应用性能。这些物质被根据专家共识评分(30 位专家)假定为 SVHC 或非-SVHC。专家评分是根据相似性模型与最相似的 SVHC 之间的结构相似性进行的直接评估,并未考虑对可用数据的广泛评估。使用专家意见特别合适,因为这正是化学相似性模型的预期目的:一种快速、可重复和自动化的筛选工具,可模拟在各种筛选应用中经常应用的专家判断。此外,还通过定性方法分析了模型预测,并通过具体示例进行了讨论,以确定模型的优势和局限性。结果表明,对于致癌、致突变或生殖毒性 (CMR) 和内分泌干扰 (ED) 物质,模型具有良好的统计学性能,而与专家意见相比,对于(非常)持久性、(非常)生物累积性和毒性 (PBT/vPvB) 物质,模型则具有中等性能。对于 PBT/vPvB 模型,特别是鉴定出了假阳性物质,这表明需要对结果进行解释。开发的相似性模型可作为免费访问的在线工具使用。总的来说,结构相似性模型在筛选和优先级确定方面具有很大的潜力。这些模型证明了它们在识别潜在关注物质群体方面的有效性,并可用于确定潜在关注物质的后续方向。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验