• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统计显著性概述:基本原理、有效性和实用性。

Précis of statistical significance: rationale, validity, and utility.

作者信息

Chow S L

机构信息

Department of Psychology, University of Regina, Saskatchewan, Canada.

出版信息

Behav Brain Sci. 1998 Apr;21(2):169-94; discussion 194-239. doi: 10.1017/s0140525x98001162.

DOI:10.1017/s0140525x98001162
PMID:10097013
Abstract

The null-hypothesis significance-test procedure (NHSTP) is defended in the context of the theory-corroboration experiment, as well as the following contrasts: (a) substantive hypotheses versus statistical hypotheses, (b) theory corroboration versus statistical hypothesis testing, (c) theoretical inference versus statistical decision, (d) experiments versus nonexperimental studies, and (e) theory corroboration versus treatment assessment. The null hypothesis can be true because it is the hypothesis that errors are randomly distributed in data. Moreover, the null hypothesis is never used as a categorical proposition. Statistical significance means only that chance influences can be excluded as an explanation of data; it does not identify the nonchance factor responsible. The experimental conclusion is drawn with the inductive principle underlying the experimental design. A chain of deductive arguments gives rise to the theoretical conclusion via the experimental conclusion. The anomalous relationship between statistical significance and the effect size often used to criticize NHSTP is more apparent than real. The absolute size of the effect is not an index of evidential support for the substantive hypothesis. Nor is the effect size, by itself, informative as to the practical importance of the research result. Being a conditional probability, statistical power cannot be the a priori probability of statistical significance. The validity of statistical power is debatable because statistical significance is determined with a single sampling distribution of the test statistic based on H0, whereas it takes two distributions to represent statistical power or effect size. Sample size should not be determined in the mechanical manner envisaged in power analysis. It is inappropriate to criticize NHSTP for nonstatistical reasons. At the same time, neither effect size, nor confidence interval estimate, nor posterior probability can be used to exclude chance as an explanation of data. Neither can any of them fulfill the nonstatistical functions expected of them by critics.

摘要

在理论确证实验的背景下,以及在以下对比中对零假设显著性检验程序(NHSTP)进行了辩护:(a)实质性假设与统计假设,(b)理论确证与统计假设检验,(c)理论推断与统计决策,(d)实验研究与非实验研究,以及(e)理论确证与治疗评估。零假设可能是正确的,因为它是假设误差在数据中随机分布。此外,零假设从未被用作一个绝对命题。统计显著性仅意味着可以排除偶然因素作为数据的一种解释;它并未识别出造成这种情况的非偶然因素。实验结论是根据实验设计所依据的归纳原则得出的。一系列演绎论证通过实验结论得出理论结论。常用于批评NHSTP的统计显著性与效应大小之间的反常关系,实际上比表面上看起来更为明显。效应的绝对大小并非对实质性假设的证据支持的指标。效应大小本身对于研究结果的实际重要性也并无信息价值。作为一个条件概率,统计功效不可能是统计显著性的先验概率。统计功效的有效性存在争议,因为统计显著性是基于原假设(H0)用检验统计量的单一抽样分布来确定的,而表示统计功效或效应大小则需要两个分布。样本量不应按照功效分析中设想的机械方式来确定。出于非统计原因批评NHSTP是不合适的。同时,效应大小、置信区间估计或后验概率都不能用来排除偶然因素作为数据的一种解释。它们也都无法履行批评者期望它们具备的非统计功能。

相似文献

1
Précis of statistical significance: rationale, validity, and utility.统计显著性概述:基本原理、有效性和实用性。
Behav Brain Sci. 1998 Apr;21(2):169-94; discussion 194-239. doi: 10.1017/s0140525x98001162.
2
[Principles of tests of hypotheses in statistics: alpha, beta and P].[统计学中假设检验的原理:α、β与P值]
Ann Fr Anesth Reanim. 1998;17(9):1168-80. doi: 10.1016/s0750-7658(00)80015-5.
3
Does the P Value Have a Future in Plant Pathology?P值在植物病理学中还有未来吗?
Phytopathology. 2015 Nov;105(11):1400-7. doi: 10.1094/PHYTO-07-15-0165-LE. Epub 2015 Oct 1.
4
Statistics in ophthalmology revisited: the (effect) size matters.眼科统计学再探:(效应)大小很重要。
Acta Ophthalmol. 2018 Nov;96(7):e885-e888. doi: 10.1111/aos.13756. Epub 2018 Sep 5.
5
Failed refutations: further comments on parsimony and likelihood methods and their relationship to Popper's degree of corroboration.失败的反驳:关于简约法和似然法及其与波普尔确证度关系的进一步评论
Syst Biol. 2003 Jun;52(3):352-67.
6
The significance of non-significance.无显著性的意义。
QJM. 1998 Sep;91(9):647-53. doi: 10.1093/qjmed/91.9.647.
7
Hypothesis testing.假设检验。
Clin Nurse Spec. 1996 Jul;10(4):186-8. doi: 10.1097/00002800-199607000-00009.
8
A logical analysis of null hypothesis significance testing using popular terminology.使用通俗术语对零假设显著性检验进行逻辑分析。
BMC Med Res Methodol. 2022 Sep 19;22(1):244. doi: 10.1186/s12874-022-01696-5.
9
Interpretation of research data: hypothesis testing.研究数据的解读:假设检验。
Am J Hosp Pharm. 1980 Nov;37(11):1539-45.
10
The power of a statistical test. What does insignificance mean?统计检验的功效。不显著意味着什么?
Vet Surg. 1991 May-Jun;20(3):209-14. doi: 10.1111/j.1532-950x.1991.tb00336.x.

引用本文的文献

1
Classical Statistics and Statistical Learning in Imaging Neuroscience.影像神经科学中的经典统计学与统计学习
Front Neurosci. 2017 Oct 6;11:543. doi: 10.3389/fnins.2017.00543. eCollection 2017.
2
Reporting Practices and Use of Quantitative Methods in Canadian Journal Articles in Psychology.加拿大心理学领域期刊文章中的报告实践与定量方法的使用
Can Psychol. 2017 May;58(2):140-147. doi: 10.1037/cap0000074. Epub 2016 Oct 6.
3
Reducing alcohol-related aggression: Effects of a self-awareness manipulation and locus of control in heavy drinking males.
减少与酒精相关的攻击性:自我意识操控及控制点对重度饮酒男性的影响
Addict Behav. 2016 Jul;58:31-4. doi: 10.1016/j.addbeh.2016.02.010. Epub 2016 Feb 9.
4
Robust misinterpretation of confidence intervals.对置信区间的严重误解。
Psychon Bull Rev. 2014 Oct;21(5):1157-64. doi: 10.3758/s13423-013-0572-3.
5
Assessing environmentally significant effects: a better strength-of-evidence than a single P value?评估具有环境意义的影响:比单个 P 值更有力的证据吗?
Environ Monit Assess. 2014 May;186(5):2729-40. doi: 10.1007/s10661-013-3574-8. Epub 2013 Dec 20.
6
On universal common ancestry, sequence similarity, and phylogenetic structure: the sins of P-values and the virtues of Bayesian evidence.在普遍共同祖先、序列相似性和系统发育结构方面:P 值之过与贝叶斯证据之德。
Biol Direct. 2011 Nov 24;6(1):60. doi: 10.1186/1745-6150-6-60.
7
Factorial validity, reliability of assessments and prevalence of ADHD behavioural symptoms in day and residential treatment centres for children with behavioural problems.行为问题儿童日间和寄宿治疗中心的因素效度、评估的可靠性及注意缺陷多动障碍行为症状的患病率。
Int J Methods Psychiatr Res. 2002;11(1):33-44. doi: 10.1002/mpr.121.