Institute for Chemical and Bioengineering, Swiss Federal Institute of Technology (ETH) Zürich, Zürich, Switzerland.
Environ Toxicol Chem. 2013 Apr;32(5):1187-95. doi: 10.1002/etc.2150. Epub 2013 Apr 1.
The present study presents a data-oriented, tiered approach to assessing the bioaccumulation potential of chemicals according to the European chemicals regulation on Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH). The authors compiled data for eight physicochemical descriptors (partition coefficients, degradation half-lives, polarity, and so forth) for a set of 713 organic chemicals for which experimental values of the bioconcentration factor (BCF) are available. The authors employed supervised machine learning methods (conditional inference trees and random forests) to derive relationships between the physicochemical descriptors and the BCF values. In a first tier, the authors established rules for classifying a chemical as bioaccumulative (B) or nonbioaccumulative (non-B). In a second tier, the authors developed a new tool for estimating numerical BCF values. For both cases the optimal set of relevant descriptors was determined; these are biotransformation half-life and octanol-water distribution coefficient (log D) for the classification rules and log D, biotransformation half-life, and topological polar surface area for the BCF estimation tool. The uncertainty of the BCF estimates obtained with the new estimation tool was quantified by comparing the estimated and experimental BCF values of the 713 chemicals. Comparison with existing BCF estimation methods indicates that the performance of this new BCF estimation tool is at least as high as that of existing methods. The authors recommend the present study's classification rules and BCF estimation tool for a consensus application in combination with existing BCF estimation methods.
本研究提出了一种面向数据的、分层的方法,根据欧盟化学品注册、评估、授权和限制法规 (REACH) 评估化学品的生物积累潜力。作者为一组 713 种有机化学品编制了 8 种物理化学描述符(分配系数、降解半衰期、极性等)的数据,这些化学品都有实验测定的生物浓缩因子 (BCF) 值。作者采用有监督的机器学习方法(条件推断树和随机森林)来推导物理化学描述符与 BCF 值之间的关系。在第一层,作者建立了将化学物质分类为生物积累性(B)或非生物积累性(非-B)的规则。在第二层,作者开发了一种新的工具来估计数值 BCF 值。对于这两种情况,都确定了最佳的相关描述符集;这些是生物转化半衰期和辛醇-水分配系数 (log D) 用于分类规则,以及 log D、生物转化半衰期和拓扑极性表面积用于 BCF 估计工具。通过比较 713 种化学品的估计和实验 BCF 值,量化了新估计工具获得的 BCF 估计值的不确定性。与现有的 BCF 估计方法相比,表明这种新的 BCF 估计工具的性能至少与现有方法一样高。作者建议在与现有的 BCF 估计方法结合使用时,采用本研究的分类规则和 BCF 估计工具来达成共识。