Quantitative Sustainability Assessment, Department of Environmental and Resource Engineering, Technical University of Denmark, Produktionstorvet 424, 2800, Kgs. Lyngby, Denmark.
Quantitative Sustainability Assessment, Department of Environmental and Resource Engineering, Technical University of Denmark, Produktionstorvet 424, 2800, Kgs. Lyngby, Denmark.
Chemosphere. 2022 Sep;302:134886. doi: 10.1016/j.chemosphere.2022.134886. Epub 2022 May 7.
Chemical data for thousands of substances are available for safety, risk, life cycle and substitution assessments, as submitted for example under the European Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) Regulation. However, to widely disseminate reported physicochemical properties as well as human and ecological exposure and toxicological data for use in various science and policy fields, systematic methods for data harmonization and selection are necessary. In response to this need, we developed a semi-automated method for deriving appropriate substance property values as input for various assessment frameworks with different requirements for resolution and data quality. Starting with data reported for a given substance and property, we propose a set of aligned data selection and harmonization criteria to obtain a representative mean value and related confidence intervals per chemical-property combination. The proposed method was tested on a set of octanol-water partition coefficients (K) for an illustrative set of 20 substances, reported under the REACH regulation as example data source. Our method is generally applicable to any set of substances, and can assess specific distributions in quality and variability across reported data. Further research can likely extend our method for mining information from text fields and adapt it to available data reported or collected from other sources and other substance properties to improve the reliability of input data for risk and impact assessments.
有成千上万种物质的化学数据可用于安全、风险、生命周期和替代评估,例如根据欧洲化学品注册、评估、授权和限制法规(REACH)提交的数据。然而,为了广泛传播报告的物理化学性质以及人类和生态暴露和毒理学数据,以便在各种科学和政策领域使用,需要系统的数据协调和选择方法。针对这一需求,我们开发了一种半自动化方法,用于为具有不同分辨率和数据质量要求的各种评估框架提供适当的物质属性值作为输入。从给定物质和属性报告的数据开始,我们提出了一组对齐的数据选择和协调标准,以获得每个化学-属性组合的代表性平均值和相关置信区间。该方法在一组 20 种物质的辛醇-水分配系数(K)上进行了测试,这些物质是作为示例数据源根据 REACH 法规报告的。我们的方法通常适用于任何物质集,并可以评估报告数据中质量和可变性的特定分布。进一步的研究可能会扩展我们的方法,以从文本字段中挖掘信息,并将其适应其他来源和其他物质属性可用的数据报告或收集,以提高风险和影响评估输入数据的可靠性。