• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统计学显著性的预测能力。

Predictive power of statistical significance.

作者信息

Heston Thomas F, King Jackson M

机构信息

Department of Family Medicine, University of Washington, Seattle, WA 98195-6340, United States.

Department of Medical Education and Clinical Sciences, Elson S. Floyd College of Medicine, Washington State University, Spokane, WA 99210-1495, United States.

出版信息

World J Methodol. 2017 Dec 26;7(4):112-116. doi: 10.5662/wjm.v7.i4.112.

DOI:10.5662/wjm.v7.i4.112
PMID:29354483
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5746664/
Abstract

A statistically significant research finding should not be defined as a -value of 0.05 or less, because this definition does not take into account study power. Statistical significance was originally defined by Fisher RA as a -value of 0.05 or less. According to Fisher, any finding that is likely to occur by random variation no more than 1 in 20 times is considered significant. Neyman J and Pearson ES subsequently argued that Fisher's definition was incomplete. They proposed that statistical significance could only be determined by analyzing the chance of incorrectly considering a study finding was significant (a Type I error) or incorrectly considering a study finding was insignificant (a Type II error). Their definition of statistical significance is also incomplete because the error rates are considered separately, not together. A better definition of statistical significance is the positive predictive value of a -value, which is equal to the power divided by the sum of power and the -value. This definition is more complete and relevant than Fisher's or Neyman-Peason's definitions, because it takes into account both concepts of statistical significance. Using this definition, a statistically significant finding requires a -value of 0.05 or less when the power is at least 95%, and a -value of 0.032 or less when the power is 60%. To achieve statistical significance, -values must be adjusted downward as the study power decreases.

摘要

具有统计学意义的研究结果不应被定义为P值等于或小于0.05,因为这个定义没有考虑检验效能。统计学意义最初由费希尔(R.A. Fisher)定义为P值等于或小于0.05。按照费希尔的说法,任何由随机变异导致的结果,其发生概率不超过二十分之一的,都被认为是显著的。内曼(J. Neyman)和皮尔逊(E.S. Pearson)随后指出费希尔的定义并不完整。他们提出,统计学意义只能通过分析错误地认为研究结果具有显著性(I型错误)或错误地认为研究结果不具有显著性(II型错误)的概率来确定。他们对统计学意义的定义也不完整,因为错误率是分开考虑的,而不是综合起来考虑。对统计学意义更好的定义是P值的阳性预测值,它等于检验效能除以检验效能与P值之和。这个定义比费希尔或内曼 - 皮尔逊的定义更完整、更相关,因为它考虑了统计学意义的两个概念。使用这个定义,当检验效能至少为95%时,具有统计学意义的结果要求P值等于或小于0.05;当检验效能为60%时,P值等于或小于0.032。为了达到统计学意义,随着研究检验效能的降低,P值必须向下调整。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feaf/5746664/d691f6a366d1/WJM-7-112-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feaf/5746664/b61d508a94e3/WJM-7-112-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feaf/5746664/d691f6a366d1/WJM-7-112-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feaf/5746664/b61d508a94e3/WJM-7-112-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/feaf/5746664/d691f6a366d1/WJM-7-112-g002.jpg

相似文献

1
Predictive power of statistical significance.统计学显著性的预测能力。
World J Methodol. 2017 Dec 26;7(4):112-116. doi: 10.5662/wjm.v7.i4.112.
2
A shift from significance test to hypothesis test through power analysis in medical research.医学研究中通过功效分析从显著性检验向假设检验的转变。
J Postgrad Med. 2006 Apr-Jun;52(2):148-50.
3
Health care research: alternative approaches to study design and data analysis.医疗保健研究:研究设计与数据分析的替代方法
Qual Manag Health Care. 2008 Oct-Dec;17(4):304-11. doi: 10.1097/01.QMH.0000338551.05512.1b.
4
Significance Testing Needs a Taxonomy: Or How the Fisher, Neyman-Pearson Controversy Resulted in the Inferential Tail Wagging the Measurement Dog.显著性检验需要一种分类法:或者说费希尔与奈曼 - 皮尔逊之争如何导致推断结果本末倒置。
Psychol Rep. 2016 Oct;119(2):487-504. doi: 10.1177/0033294116662659. Epub 2016 Aug 8.
5
A note on the power of Fisher's least significant difference procedure.关于费舍尔最小显著差异法功效的一则注释。
Pharm Stat. 2006 Oct-Dec;5(4):253-63. doi: 10.1002/pst.210.
6
Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing.费歇尔、奈曼-皮尔逊还是 NHST?数据测试教学的教程。
Front Psychol. 2015 Mar 3;6:223. doi: 10.3389/fpsyg.2015.00223. eCollection 2015.
7
Tests of Statistical Significance Made Sound.统计显著性检验变得合理。
Educ Psychol Meas. 2017 Jun;77(3):489-506. doi: 10.1177/0013164416667981. Epub 2016 Oct 5.
8
The power of a statistical test. What does insignificance mean?统计检验的功效。不显著意味着什么?
Vet Surg. 1991 May-Jun;20(3):209-14. doi: 10.1111/j.1532-950x.1991.tb00336.x.
9
Definitions of fitness in age-structured populations: Comparison in the haploid case.年龄结构种群中适合度的定义:单倍体情形下的比较。
J Theor Biol. 2016 Feb 21;391:65-73. doi: 10.1016/j.jtbi.2015.11.017. Epub 2015 Dec 10.
10
Power, effects, confidence, and significance: an investigation of statistical practices in nursing research.功效、效应、置信度与显著性:护理研究中的统计实践调查
Int J Nurs Stud. 2014 May;51(5):795-806. doi: 10.1016/j.ijnurstu.2013.09.014. Epub 2013 Oct 9.

引用本文的文献

1
Predicting the unpredictable: a robust nomogram for predicting recurrence in patients with ampullary carcinoma.预测不可预测的情况:一种用于预测壶腹癌患者复发的稳健列线图。
BMC Cancer. 2024 Feb 15;24(1):212. doi: 10.1186/s12885-024-11960-0.
2
Do Patients with Chronic Spinal Pain and Comorbid Insomnia Have More Features of Central Sensitization? A Case-Control Study.慢性脊柱疼痛合并失眠的患者是否具有更多中枢敏化特征?一项病例对照研究。
Healthcare (Basel). 2023 Dec 12;11(24):3152. doi: 10.3390/healthcare11243152.
3
Morphology of the antegonial notch and its utility in the determination of sex on skeletal materials.

本文引用的文献

1
Significance Testing Needs a Taxonomy: Or How the Fisher, Neyman-Pearson Controversy Resulted in the Inferential Tail Wagging the Measurement Dog.显著性检验需要一种分类法:或者说费希尔与奈曼 - 皮尔逊之争如何导致推断结果本末倒置。
Psychol Rep. 2016 Oct;119(2):487-504. doi: 10.1177/0033294116662659. Epub 2016 Aug 8.
2
Biostatistics Series Module 1: Basics of Biostatistics.生物统计学系列模块1:生物统计学基础。
Indian J Dermatol. 2016 Jan-Feb;61(1):10-20. doi: 10.4103/0019-5154.173988.
3
Systematic Reviews of Anesthesiologic Interventions Reported as Statistically Significant: Problems with Power, Precision, and Type 1 Error Protection.
关节结节切迹的形态及其在骨骼性别鉴定中的应用。
J Anat. 2022 Oct;241(4):919-927. doi: 10.1111/joa.13731. Epub 2022 Jul 27.
4
Fractional CO laser to improve noticeable scars after skin cancer surgery: An appraisal by the patients, laypersons, and experts.分束 CO2 激光改善皮肤癌手术后明显疤痕:患者、非专业人士和专家评估。
Dermatol Ther. 2021 Jul;34(4):e14999. doi: 10.1111/dth.14999. Epub 2021 May 31.
5
Brain activations associated with scientific reasoning: a literature review.与科学推理相关的大脑激活:文献综述。
Cogn Process. 2019 May;20(2):139-161. doi: 10.1007/s10339-018-0896-z. Epub 2018 Dec 8.
报告具有统计学意义的麻醉学干预措施的系统评价:效能、精度和一类错误保护方面的问题。
Anesth Analg. 2015 Dec;121(6):1611-22. doi: 10.1213/ANE.0000000000000892.
4
The tyranny of power: is there a better way to calculate sample size?权力的专制:是否有更好的方法来计算样本量?
BMJ. 2009 Oct 6;339:b3985. doi: 10.1136/bmj.b3985.
5
Can't miss: conquer any number task by making important statistics simple. Part 6. Tests of statistical significance (z test statistic, rejecting the null hypothesis, p value), t test, z test for proportions, statistical significance versus meaningful difference.不容错过:通过简化重要统计数据来攻克任何数量任务。第6部分。统计显著性检验(z检验统计量、拒绝原假设、p值)、t检验、比例的z检验、统计显著性与有意义差异。
J Healthc Qual. 2004 Jul-Aug;26(4):43-53. doi: 10.1111/j.1945-1474.2004.tb00507.x.
6
Sifting the evidence-what's wrong with significance tests?筛选证据——显著性检验存在哪些问题?
BMJ. 2001 Jan 27;322(7280):226-31. doi: 10.1136/bmj.322.7280.226.
7
P-value interpretation and alpha allocation in clinical trials.临床试验中的P值解读与α分配
Ann Epidemiol. 1998 Aug;8(6):351-7. doi: 10.1016/s1047-2797(98)00003-9.