Luechtefeld Thomas, Rowlands Craig, Hartung Thomas
Center for Alternatives to Animal Testing at Johns Hopkins Bloomberg School of Public Health , 615 N. Wolfe Street , Baltimore , MD 21205 , USA . Email:
Underwriters Laboratories (UL) , UL Product Supply Chain Intelligence , 333 Pfingsten Road , Northbrook , IL 60062 , USA.
Toxicol Res (Camb). 2018 May 1;7(5):732-744. doi: 10.1039/c8tx00051d. eCollection 2018 Sep 1.
The creation of large toxicological databases and advances in machine-learning techniques have empowered computational approaches in toxicology. Work with these large databases based on regulatory data has allowed reproducibility assessment of animal models, which highlight weaknesses in traditional methods. This should lower the bars for the introduction of new approaches and represents a benchmark that is achievable for any alternative method validated against these methods. Quantitative Structure Activity Relationships (QSAR) models for skin sensitization, eye irritation, and other human health hazards based on these big databases, however, also have made apparent some of the challenges facing computational modeling, including validation challenges, model interpretation issues, and model selection issues. A first implementation of machine learning-based predictions termed REACH achieved unprecedented sensitivities of >80% with specificities >70% in predicting the six most common acute and topical hazards covering about two thirds of the chemical universe. While this is awaiting formal validation, it demonstrates the new quality introduced by big data and modern data-mining technologies. The rapid increase in the diversity and number of computational models, as well as the data they are based on, create challenges and opportunities for the use of computational methods.
大型毒理学数据库的创建以及机器学习技术的进步,推动了毒理学领域的计算方法发展。基于监管数据处理这些大型数据库,使得对动物模型进行可重复性评估成为可能,这凸显了传统方法的不足之处。这应该会降低引入新方法的门槛,并代表了任何与这些方法验证过的替代方法都能达到的一个基准。然而,基于这些大型数据库建立的用于皮肤致敏、眼刺激及其他人类健康危害的定量构效关系(QSAR)模型,也凸显了计算建模面临的一些挑战,包括验证挑战、模型解释问题和模型选择问题。一种名为REACH的基于机器学习预测的首次应用,在预测涵盖约三分之二化学物质的六种最常见急性和局部危害时,实现了前所未有的灵敏度>80%、特异性>70%。虽然这有待正式验证,但它展示了大数据和现代数据挖掘技术带来的新质量。计算模型及其所基于的数据在多样性和数量上的快速增长,给计算方法的使用带来了挑战和机遇。