• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

P 值排舞:音乐何时停止?

The P Value Line Dance: When Does the Music Stop?

机构信息

Department of Health, Medicine and Caring Sciences, Division of Society and Health, Linköping, Sweden.

出版信息

J Med Internet Res. 2020 Aug 27;22(8):e21345. doi: 10.2196/21345.

DOI:10.2196/21345
PMID:32852275
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7484773/
Abstract

When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not spend P values on interim analyses. However, the endeavor to dichotomize evidence into significant and nonsignificant has led to the basic driving force of science, namely uncertainty, to take a back seat. In this viewpoint we discuss that if testing the null hypothesis is the ultimate goal of science, then we need not worry about writing protocols, consider ethics, apply for funding, or run any experiments at all-all null hypotheses will be rejected at some point-everything has an effect. The job of science should be to unearth the uncertainties of the effects of treatments, not to test their difference from zero. We also show the fickleness of P values, how they may one day point to statistically significant results; and after a few more participants have been recruited, the once statistically significant effect suddenly disappears. We show plots which we hope would intuitively highlight that all assessments of evidence will fluctuate over time. Finally, we discuss the remedy in the form of Bayesian methods, where uncertainty leads; and which allows for continuous decision making to stop or continue recruitment, as new data from a trial is accumulated.

摘要

何时应该停止试验?这个看似简单的问题,却引发了那些认为不确定性可以产生确定性的人,以及那些被教导需要仔细计算样本量、考虑多重性、不要在中期分析中使用 P 值的研究人员对 I 类和 II 类错误的担忧。然而,将证据二分法为有意义和无意义的努力,导致了科学的基本驱动力,即不确定性,退居次要地位。在这篇观点文章中,我们讨论了如果检验零假设是科学的最终目标,那么我们就不必担心编写方案、考虑伦理、申请资金或进行任何实验——所有的零假设都会在某个时候被拒绝——一切都有影响。科学的工作应该是揭示治疗效果的不确定性,而不是检验其与零的差异。我们还展示了 P 值的多变性,它们如何有一天会指向具有统计学意义的结果;在招募了更多的参与者之后,曾经具有统计学意义的效果突然消失了。我们展示了一些图表,希望这些图表能够直观地说明,所有对证据的评估都会随着时间的推移而波动。最后,我们讨论了以贝叶斯方法为形式的补救措施,其中不确定性占主导地位;并且允许根据试验的新数据进行持续的决策,以停止或继续招募。

相似文献

1
The P Value Line Dance: When Does the Music Stop?P 值排舞:音乐何时停止?
J Med Internet Res. 2020 Aug 27;22(8):e21345. doi: 10.2196/21345.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
How many do I need? Basic principles of sample size estimation.我需要多少?样本量估计的基本原则。
J Adv Nurs. 2004 Aug;47(3):297-302. doi: 10.1111/j.1365-2648.2004.03093.x.
5
Inappropriate use of statistical power.统计功效的误用。
Bone Marrow Transplant. 2023 May;58(5):474-477. doi: 10.1038/s41409-023-01935-3. Epub 2023 Mar 3.
6
Ethical pitfalls in neonatal comparative effectiveness trials.新生儿比较疗效试验中的伦理陷阱。
Neonatology. 2014;105(4):350-1. doi: 10.1159/000360650. Epub 2014 May 30.
7
Significance, Errors, Power, and Sample Size: The Blocking and Tackling of Statistics.意义、误差、功效和样本量:统计学中的拦阻与擒抱。
Anesth Analg. 2018 Feb;126(2):691-698. doi: 10.1213/ANE.0000000000002741.
8
Safety and Efficacy of Imatinib for Hospitalized Adults with COVID-19: A structured summary of a study protocol for a randomised controlled trial.COVID-19 住院成人患者使用伊马替尼的安全性和疗效:一项随机对照试验研究方案的结构化总结。
Trials. 2020 Oct 28;21(1):897. doi: 10.1186/s13063-020-04819-9.
9
Bayesian sample size determination under hypothesis tests.贝叶斯假设检验下的样本量确定。
Contemp Clin Trials. 2011 May;32(3):393-8. doi: 10.1016/j.cct.2010.12.012. Epub 2011 Jan 1.
10
Bayesian evaluation of informative hypotheses in cluster-randomized trials.贝叶斯评价在整群随机试验中信息性假设。
Behav Res Methods. 2019 Feb;51(1):126-137. doi: 10.3758/s13428-018-1149-x.

引用本文的文献

1
Bayesian interim analysis and efficiency of phase III randomized trials.贝叶斯期中分析与III期随机试验的效率
Br J Cancer. 2025 Aug 11. doi: 10.1038/s41416-025-03156-5.
2
Effects of a drinking motives and readiness to change tailored digital alcohol intervention among online help-seekers: protocol for a randomised controlled trial.饮酒动机及改变意愿对在线求助者量身定制的数字酒精干预效果:一项随机对照试验方案
BMJ Open. 2025 Jul 11;15(7):e100532. doi: 10.1136/bmjopen-2025-100532.
3
The effectiveness of a drink-counting and a breathalyser-coupled smartphone application for reduced heavy drinking among alcohol-dependent adults in Sweden: A randomised controlled trial.

本文引用的文献

1
A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.洛匹那韦-利托那韦治疗成人重症 COVID-19 患者的临床试验。
N Engl J Med. 2020 May 7;382(19):1787-1799. doi: 10.1056/NEJMoa2001282. Epub 2020 Mar 18.
2
mHealth smoking cessation intervention among high school students: 3-month primary outcome findings from a randomized controlled trial.**题目**:高中生的 mHealth 戒烟干预:一项随机对照试验的 3 个月主要结局发现 **解析**:题目中,“mHealth”是指“移动医疗”,是指通过移动通信技术和互联网技术来提供医疗服务和信息;“smoking cessation”是指“戒烟”;“intervention”是指“干预”;“high school students”是指“高中生”;“randomized controlled trial”是指“随机对照试验”。因此,题目可以直译为“高中生的 mHealth 戒烟干预:一项随机对照试验的 3 个月主要结局发现”。
PLoS One. 2020 Mar 6;15(3):e0229411. doi: 10.1371/journal.pone.0229411. eCollection 2020.
3
一款用于减少瑞典酒精依赖成年人重度饮酒的计饮量与呼气酒精含量测定仪联用智能手机应用程序的有效性:一项随机对照试验。
Addiction. 2025 May;120(5):905-918. doi: 10.1111/add.16769. Epub 2025 Jan 16.
4
Effects of a text messaging smoking cessation intervention amongst online help-seekers and primary health care visitors: findings from a randomised controlled trial.短信戒烟干预对在线求助者和基层医疗保健访问者的影响:一项随机对照试验的结果。
BMC Med. 2023 Oct 4;21(1):382. doi: 10.1186/s12916-023-03073-5.
5
Study protocol: the pragmatic, exploratory DELTA-JU trial of the group-based multimodal DELTA intervention for abstinent adolescents with substance use disorders living in youth welfare institutions.研究方案:针对生活在青少年福利机构中患有物质使用障碍的戒断青少年的基于小组的多模式DELTA干预措施的务实、探索性DELTA-JU试验。
Front Psychiatry. 2023 May 26;14:1025347. doi: 10.3389/fpsyt.2023.1025347. eCollection 2023.
6
The effect of question order on outcomes in the orbital core outcome set for alcohol brief interventions among online help-seekers (QOBCOS): Findings from a randomised factorial trial.问题顺序对在线求助者酒精简短干预的眼眶核心结局集(QOBCOS)结果的影响:一项随机析因试验的结果
Digit Health. 2023 Feb 12;9:20552076231155684. doi: 10.1177/20552076231155684. eCollection 2023 Jan-Dec.
7
Avoiding Under- and Overrecruitment in Behavioral Intervention Trials Using Bayesian Sequential Designs: Tutorial.避免贝叶斯序贯设计在行为干预试验中的低招募和高招募问题:教程。
J Med Internet Res. 2022 Dec 16;24(12):e40730. doi: 10.2196/40730.
8
Digital multiple health behaviour change intervention targeting online help seekers: protocol for the COACH randomised factorial trial.针对在线求助者的数字多重健康行为改变干预措施:COACH 随机析因试验方案。
BMJ Open. 2022 Jul 26;12(7):e061024. doi: 10.1136/bmjopen-2022-061024.
9
Effectiveness of a digital intervention versus alcohol information for online help-seekers in Sweden: a randomised controlled trial.数字干预与酒精信息对瑞典在线求助者的有效性:一项随机对照试验。
BMC Med. 2022 May 17;20(1):176. doi: 10.1186/s12916-022-02374-5.
10
Six-Month Outcomes from the NEXit Junior Trial of a Text Messaging Smoking Cessation Intervention for High School Students: Randomized Controlled Trial With Bayesian Analysis.NEXit 青少年短信戒烟干预试验的 6 个月结果:采用贝叶斯分析的随机对照试验
JMIR Mhealth Uhealth. 2021 Oct 21;9(10):e29913. doi: 10.2196/29913.
An Electronic Screening and Brief Intervention for Hazardous and Harmful Drinking Among Swedish University Students: Reanalysis of Findings From a Randomized Controlled Trial Using a Bayesian Framework.瑞典大学生有害及危险饮酒的电子筛查与简短干预:使用贝叶斯框架对一项随机对照试验结果的重新分析
J Med Internet Res. 2019 Dec 17;21(12):e14420. doi: 10.2196/14420.
4
Electronic Screening for Alcohol Use and Brief Intervention by Email for University Students: Reanalysis of Findings From a Randomized Controlled Trial Using a Bayesian Framework.大学生酒精使用情况的电子筛查及通过电子邮件进行简短干预:使用贝叶斯框架对一项随机对照试验结果的重新分析
J Med Internet Res. 2019 Nov 7;21(11):e14419. doi: 10.2196/14419.
5
Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015.评估 2010 年至 2015 年期间《自然》和《科学》杂志上社会科学实验的可重复性。
Nat Hum Behav. 2018 Sep;2(9):637-644. doi: 10.1038/s41562-018-0399-z. Epub 2018 Aug 27.
6
Reducing Alcohol Consumption Among Risky Drinkers in the General Population of Sweden Using an Interactive Mobile Health Intervention: Protocol for a Randomized Controlled Trial.使用交互式移动健康干预措施减少瑞典普通人群中高危饮酒者的酒精摄入量:一项随机对照试验方案
JMIR Res Protoc. 2019 Apr 18;8(4):e13119. doi: 10.2196/13119.
7
Remove, rather than redefine, statistical significance.去除而非重新定义统计学显著性。
Nat Hum Behav. 2018 Jan;2(1):4. doi: 10.1038/s41562-017-0224-0.
8
Redefine statistical significance.重新定义统计学显著性。
Nat Hum Behav. 2018 Jan;2(1):6-10. doi: 10.1038/s41562-017-0189-z.
9
mHealth smoking cessation intervention among high-school pupils (NEXit Junior): study protocol for a randomized controlled trial.针对高中生的移动健康戒烟干预措施(NEXit Junior):一项随机对照试验的研究方案
Trials. 2018 Nov 22;19(1):648. doi: 10.1186/s13063-018-3028-2.
10
A Gentle Introduction to the Comparison Between Null Hypothesis Testing and Bayesian Analysis: Reanalysis of Two Randomized Controlled Trials.零假设检验与贝叶斯分析比较的简要介绍:两项随机对照试验的重新分析
J Med Internet Res. 2018 Oct 24;20(10):e10873. doi: 10.2196/10873.