Festing Michael F W
Medical Research Council Toxicology Unit, Hodgkin Building, Lancaster Road, Leicester, United Kingdom.
PLoS One. 2014 Nov 26;9(11):e112955. doi: 10.1371/journal.pone.0112955. eCollection 2014.
The safety of chemicals, drugs, novel foods and genetically modified crops is often tested using repeat-dose sub-acute toxicity tests in rats or mice. It is important to avoid misinterpretations of the results as these tests are used to help determine safe exposure levels in humans. Treated and control groups are compared for a range of haematological, biochemical and other biomarkers which may indicate tissue damage or other adverse effects. However, the statistical analysis and presentation of such data poses problems due to the large number of statistical tests which are involved. Often, it is not clear whether a "statistically significant" effect is real or a false positive (type I error) due to sampling variation. The author's conclusions appear to be reached somewhat subjectively by the pattern of statistical significances, discounting those which they judge to be type I errors and ignoring any biomarker where the p-value is greater than p = 0.05. However, by using standardised effect sizes (SESs) a range of graphical methods and an over-all assessment of the mean absolute response can be made. The approach is an extension, not a replacement of existing methods. It is intended to assist toxicologists and regulators in the interpretation of the results. Here, the SES analysis has been applied to data from nine published sub-acute toxicity tests in order to compare the findings with those of the author's. Line plots, box plots and bar plots show the pattern of response. Dose-response relationships are easily seen. A "bootstrap" test compares the mean absolute differences across dose groups. In four out of seven papers where the no observed adverse effect level (NOAEL) was estimated by the authors, it was set too high according to the bootstrap test, suggesting that possible toxicity is under-estimated.
化学品、药物、新型食品和转基因作物的安全性通常通过对大鼠或小鼠进行重复剂量亚急性毒性试验来测试。避免对结果产生误解很重要,因为这些试验用于帮助确定人类的安全暴露水平。比较处理组和对照组的一系列血液学、生化和其他生物标志物,这些标志物可能表明组织损伤或其他不良反应。然而,由于涉及大量统计检验,此类数据的统计分析和呈现存在问题。由于抽样变异,通常不清楚“统计学显著”效应是真实的还是假阳性(I型错误)。作者的结论似乎是通过统计显著性模式主观得出的,剔除他们认为是I型错误的那些,并忽略任何p值大于p = 0.05的生物标志物。然而,通过使用标准化效应大小(SESs),可以进行一系列图形方法和对平均绝对反应的总体评估。该方法是对现有方法的扩展,而非替代。它旨在协助毒理学家和监管机构解释结果。在此,SES分析已应用于九项已发表的亚急性毒性试验的数据,以便将结果与作者的结果进行比较。线图、箱线图和柱状图显示了反应模式。剂量反应关系很容易看出。“自举”检验比较各剂量组的平均绝对差异。在作者估计未观察到不良反应水平(NOAEL)的七篇论文中,有四篇根据自举检验,该水平设定得过高,这表明可能的毒性被低估了。