Institute of Nursing Science, University of Basel, Bernoullistrasse 28, 4056 Basel, Switzerland.
Int J Nurs Stud. 2012 Aug;49(8):1039-47. doi: 10.1016/j.ijnurstu.2012.01.015. Epub 2012 Feb 27.
While the p-value will tell the reader a study's results are statistically significant, it does not provide any information about the practical or clinical importance of the findings. Furthermore, p-values are influenced by sample size. They are more likely to be significant when sample size is large and less likely if the sample is small. Effect size estimates, on the other hand, are not sensitive to sample size and provide information about the direction and strength of the relationship between variables (e.g., a treatment and an outcome). In addition to providing valuable clinical information about study findings, effect size estimates can provide a common metric to compare results across studies. Despite their usefulness, effect size estimates are often not reported as part of the research results. Consequently, effect sizes often have to be calculated based on summary and test statistics reported in research articles.
This article provides the formulas utilized to directly calculate common effective size estimates using summary statistics reported in research studies, as well as methods to more indirectly estimate these effect sizes when basis summary statistics are not reported. In addition we present formulas to compute the corresponding confidence interval for each effect size.
虽然 p 值会告诉读者研究结果在统计学上是显著的,但它并没有提供关于研究结果实际或临床重要性的任何信息。此外,p 值受样本量的影响。当样本量较大时,p 值更有可能显著,而当样本量较小时,p 值则不太可能显著。另一方面,效应量估计值不受样本量的影响,提供了变量(例如治疗和结果)之间关系的方向和强度的信息。除了提供有关研究结果的有价值的临床信息外,效应量估计值还可以提供一个共同的指标来比较不同研究的结果。尽管它们很有用,但效应量估计值通常不作为研究结果的一部分报告。因此,通常必须根据研究文章中报告的汇总和检验统计数据来计算效应量。
本文提供了使用研究报告中报告的汇总统计数据直接计算常见有效大小估计值的公式,以及在未报告基础汇总统计数据时更间接估计这些效应大小的方法。此外,我们还提出了计算每个效应大小的相应置信区间的公式。