Department of Food, Medicines and Consumer Safety, Scientific Institute of Public Health, 1050 Brussels, Belgium.
Department of In Vitro Toxicology and Dermato-Cosmetology, Vrije Universiteit Brussel, 1090 Brussels, Belgium.
Toxicol Sci. 2018 Jun 1;163(2):632-638. doi: 10.1093/toxsci/kfy057.
In silico methodologies, such as (quantitative) structure-activity relationships ([Q]SARs), are available to predict a wide variety of toxicological properties and biological activities for structurally diverse substances. To obtain insights in the scientific value of these predictions, the capacity of the prediction models to generate (sufficiently) reliable results for a particular type of compounds needs to be evaluated. In the current study, performance parameters to predict the endpoint "bacterial mutagenicity" were calculated for a battery of common (Q)SAR tools, namely Toxtree, Derek Nexus, VEGA Consensus, and Sarah Nexus. Printed paper and board food contact material (FCM) constituents were chosen as study substances because many of these lack experimental data, making them an interesting group for in silico screening. Accuracy, sensitivity, specificity, positive predictivity, negative predictivity, and Matthews correlation coefficient for the individual models and for the combination of VEGA Consensus and Sarah Nexus were determined and compared. Our results demonstrate that performance varies among the four models, but can be increased by applying a combination strategy. Furthermore, the importance of the applicability domain is illustrated. Limited performance to predict the mutagenic potential of substances that are new to the model (ie, not included in the training set) is reported. In this context, the generally poor sensitivity for these new substances is also addressed.
在计算机中,有许多方法可以预测具有不同结构的物质的各种毒理学性质和生物活性,例如(定量)构效关系(QSARs)。为了深入了解这些预测的科学价值,需要评估预测模型对特定类型化合物产生(足够)可靠结果的能力。在本研究中,我们计算了一组常见的(QSAR)工具(如 Toxtree、Derek Nexus、VEGA Consensus 和 Sarah Nexus)预测“细菌致突变性”终点的性能参数。选择印刷纸和纸板食品接触材料(FCM)成分作为研究物质,因为其中许多物质缺乏实验数据,因此它们是计算机筛选的有趣群体。确定并比较了个体模型以及 VEGA Consensus 和 Sarah Nexus 组合的准确性、灵敏度、特异性、阳性预测值、阴性预测值和 Matthews 相关系数。我们的结果表明,四个模型之间的性能存在差异,但通过应用组合策略可以提高性能。此外,还说明了适用性域的重要性。报告了模型对新物质(即在训练集中未包含的物质)的突变潜力预测能力有限。在这种情况下,还解决了这些新物质的一般灵敏度较差的问题。