Chawla Raymond, Hunter Paul R
UCD School of Biomolecular and Biomedical Sciences, Conway Institute, University College Dublin, Belfield, Dublin 4, Ireland.
Water Res. 2005 Nov;39(18):4552-8. doi: 10.1016/j.watres.2005.08.022. Epub 2005 Oct 5.
Analyses of Irish bathing water quality data sets are reported to investigate whether the parametric calculations proposed in the draft Bathing Water Directive are valid. Faecal coliforms (assumed to be Escherichia coli) and faecal streptococci (assumed to be intestinal enterococci) have been analysed separately. It was noted that classifications based on the parametric 95th percentile calculations disagreed with those based on percentage compliance with the standards on 13.8% of occasions. When these disagreements were studied, it was found that the datasets frequently contained many censored data points (Result < 1). Also, the datasets were not log normally distributed on at least 85% of occasions. Both these findings fatally undermine the validity of using a parametric method for calculating 95th percentiles to classify bathing water quality. By contrast the non-parametric Hazen method is a better estimate of true population 95th percentiles, but essentially gives very similar classifications to the percentage compliance approach, fully agreeing on over 95% of occasions. The same is also true when considering 90th percentiles. A series of Monte Carlo studies were also conducted to determine the impact of small numbers. It was ascertained that small sample sizes are very undependable in determining bathing water classification and the parametric method in particular is particularly unreliable. In conclusion, the parametric method for calculating bathing water compliance is so severely and fatally flawed statistically that it should not form the basis of any legislation. The Hazen method gives a better estimate of true 90th or 95th percentiles, though as the resultant classifications agree with percentage compliance so closely it is doubtful that there is any statistical value in using a percentile approach over the long established and well understood percentage compliance approach.
本文报告了对爱尔兰浴场水质数据集的分析,以调查《浴场水指令》草案中提出的参数计算是否有效。分别分析了粪大肠菌群(假定为大肠杆菌)和粪链球菌(假定为肠道肠球菌)。值得注意的是,基于参数第95百分位数计算的分类与基于符合标准百分比的分类在13.8%的情况下不一致。在研究这些不一致情况时发现,数据集经常包含许多截尾数据点(结果<1)。此外,数据集在至少85%的情况下不呈对数正态分布。这两个发现都严重损害了使用参数方法计算第95百分位数来对浴场水质进行分类的有效性。相比之下,非参数哈森方法能更好地估计总体的第95百分位数,但基本上给出的分类与符合百分比方法非常相似,在超过95%的情况下完全一致。在考虑第90百分位数时也是如此。还进行了一系列蒙特卡罗研究以确定小样本数量的影响。已确定小样本量在确定浴场水分类时非常不可靠,尤其是参数方法特别不可靠。总之,计算浴场水合规性的参数方法在统计上存在严重且致命的缺陷,不应作为任何立法的基础。哈森方法能更好地估计真实的第90或第95百分位数,不过由于所得分类与符合百分比非常接近,所以怀疑使用百分位数方法相对于长期确立且广为人知的符合百分比方法是否有任何统计价值。