Ngatia Murage, Gonzalez David, San Julian Steve, Conner Arin
Municipal Water Quality Investigations Program, Division of Environmental Services, California Department of Water Resources, P.O. Box 942836, Sacramento, CA, USA.
J Environ Monit. 2010 Jan;12(1):172-7. doi: 10.1039/b912098j. Epub 2009 Oct 21.
To evaluate whether two unattended field organic carbon instruments could provide data comparable to laboratory-generated data, we needed a practical assessment. Null hypothesis statistical testing (NHST) is commonly utilized for such evaluations in environmental assessments, but researchers in other disciplines have identified weaknesses that may limit NHST's usefulness. For example, in NHST, large sample sizes change p-values and a statistically significant result can be obtained by merely increasing the sample size. In addition, p-values can indicate that observed results are statistically significantly different, but in reality the differences could be trivial in magnitude. Equivalence tests, on the other hand, allow the investigator to incorporate decision criteria that have practical relevance to the study. In this paper, we demonstrate the potential use of equivalence tests as an alternative to NHST. We first compare data between the two field instruments, and then compare the field instruments' data to laboratory-generated data using both NHST and equivalence tests. NHST indicated that the data between the two field instruments and the data between the field instruments and the laboratory were significantly different. Equivalence tests showed that the data were equivalent because they fell within a pre-determined equivalence interval based on our knowledge of laboratory precision. We conclude that equivalence tests provide more useful comparisons and interpretation of water quality data than NHST and should be more widely used in similar environmental assessments.
为了评估两台无人值守的现场有机碳仪器能否提供与实验室生成的数据相当的数据,我们需要进行实际评估。在环境评估中,零假设统计检验(NHST)通常用于此类评估,但其他学科的研究人员已经发现了可能限制NHST效用的弱点。例如,在NHST中,大样本量会改变p值,仅仅通过增加样本量就可以获得具有统计学意义的结果。此外,p值可以表明观察到的结果在统计学上有显著差异,但实际上差异可能在量级上微不足道。另一方面,等效性检验允许研究者纳入与研究具有实际相关性的决策标准。在本文中,我们展示了等效性检验作为NHST替代方法的潜在用途。我们首先比较两台现场仪器之间的数据,然后使用NHST和等效性检验将现场仪器的数据与实验室生成的数据进行比较。NHST表明两台现场仪器之间的数据以及现场仪器与实验室之间的数据存在显著差异。等效性检验表明数据是等效的,因为根据我们对实验室精度的了解,它们落在了预先确定的等效区间内。我们得出结论,与NHST相比,等效性检验能为水质数据提供更有用的比较和解释,应在类似的环境评估中更广泛地使用。