Johns Hopkins Bloomberg School of Public Health, Center for Alternatives to Animal Testing (CAAT), Baltimore, USA.
University of Konstanz, CAAT-Europe, Konstanz, Germany.
ALTEX. 2016;33(2):83-93. doi: 10.14573/altex.1603091.
Modern information technologies have made big data available in safety sciences, i.e., extremely large data sets that may be analyzed only computationally to reveal patterns, trends and associations. This happens by (1) compilation of large sets of existing data, e.g., as a result of the European REACH regulation, (2) the use of omics technologies and (3) systematic robotized testing in a high-throughput manner. All three approaches and some other high-content technologies leave us with big data--the challenge is now to make big sense of these data. Read-across, i.e., the local similarity-based intrapolation of properties, is gaining momentum with increasing data availability and consensus on how to process and report it. It is predominantly applied to in vivo test data as a gap-filling approach, but can similarly complement other incomplete datasets. Big data are first of all repositories for finding similar substances and ensure that the available data is fully exploited. High-content and high-throughput approaches similarly require focusing on clusters, in this case formed by underlying mechanisms such as pathways of toxicity. The closely connected properties, i.e., structural and biological similarity, create the confidence needed for predictions of toxic properties. Here, a new web-based tool under development called REACH-across, which aims to support and automate structure-based read-across, is presented among others.
现代信息技术使得安全科学领域能够获取大数据,即通过计算分析才能揭示其模式、趋势和关联的超大数据集。这可以通过以下三种方法实现:(1)编译大量现有数据集,例如,由于欧洲 REACH 法规的要求;(2)使用组学技术;(3)以高通量方式进行系统的机器人测试。所有这三种方法和一些其他高通量技术都产生了大数据,现在的挑战是如何从这些数据中获得有意义的信息。读通(read-across),即基于局部相似性的属性内插,随着数据可用性的增加以及对如何处理和报告数据的共识的增加,其应用也越来越广泛。它主要应用于体内测试数据作为填补空白的方法,但也可以类似地补充其他不完整的数据集。大数据首先是发现相似物质的存储库,可确保充分利用现有数据。高通量和高内涵方法同样需要关注聚类,在这种情况下,聚类由毒性途径等潜在机制形成。密切相关的性质,即结构和生物相似性,为毒性性质的预测提供了所需的置信度。在此,介绍了一种名为 REACH-across 的新的基于网络的工具,旨在支持和自动化基于结构的读通。