• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

初步试验的统计学解释:是否应重新考虑显著性阈值?

The statistical interpretation of pilot trials: should significance thresholds be reconsidered?

机构信息

Medical Statistics Group, School of Health and Related Research (ScHARR), University of Sheffield, 30 Regent Street, Sheffield S1 4DA, UK.

出版信息

BMC Med Res Methodol. 2014 Mar 20;14:41. doi: 10.1186/1471-2288-14-41.

DOI:10.1186/1471-2288-14-41
PMID:24650044
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3994566/
Abstract

BACKGROUND

In an evaluation of a new health technology, a pilot trial may be undertaken prior to a trial that makes a definitive assessment of benefit. The objective of pilot studies is to provide sufficient evidence that a larger definitive trial can be undertaken and, at times, to provide a preliminary assessment of benefit.

METHODS

We describe significance thresholds, confidence intervals and surrogate markers in the context of pilot studies and how Bayesian methods can be used in pilot trials. We use a worked example to illustrate the issues raised.

RESULTS

We show how significance levels other than the traditional 5% should be considered to provide preliminary evidence for efficacy and how estimation and confidence intervals should be the focus to provide an estimated range of possible treatment effects. We also illustrate how Bayesian methods could also assist in the early assessment of a health technology.

CONCLUSIONS

We recommend that in pilot trials the focus should be on descriptive statistics and estimation, using confidence intervals, rather than formal hypothesis testing and that confidence intervals other than 95% confidence intervals, such as 85% or 75%, be used for the estimation. The confidence interval should then be interpreted with regards to the minimum clinically important difference. We also recommend that Bayesian methods be used to assist in the interpretation of pilot trials. Surrogate endpoints can also be used in pilot trials but they must reliably predict the overall effect on the clinical outcome.

摘要

背景

在评估一项新的卫生技术时,可能会在进行明确评估效益的试验之前进行试点研究。试点研究的目的是提供足够的证据,证明可以进行更大规模的确定性试验,有时还可以对效益进行初步评估。

方法

我们在试点研究的背景下描述了显著性阈值、置信区间和替代指标,以及贝叶斯方法如何在试点试验中使用。我们使用一个实例来说明所提出的问题。

结果

我们展示了如何考虑传统的 5%以外的显著性水平来提供疗效的初步证据,以及如何将估计和置信区间作为重点,提供可能的治疗效果的估计范围。我们还说明了贝叶斯方法如何有助于早期评估卫生技术。

结论

我们建议在试点研究中,重点应放在描述性统计和估计上,使用置信区间,而不是正式的假设检验,并且应该使用 85%或 75%等除 95%置信区间以外的置信区间进行估计。然后,应根据最小临床重要差异来解释置信区间。我们还建议使用贝叶斯方法来协助解释试点研究。替代终点也可以在试点研究中使用,但它们必须可靠地预测对临床结局的总体影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f103/3994566/052f6677d1f1/1471-2288-14-41-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f103/3994566/290606c83a60/1471-2288-14-41-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f103/3994566/052f6677d1f1/1471-2288-14-41-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f103/3994566/290606c83a60/1471-2288-14-41-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f103/3994566/052f6677d1f1/1471-2288-14-41-2.jpg

相似文献

1
The statistical interpretation of pilot trials: should significance thresholds be reconsidered?初步试验的统计学解释:是否应重新考虑显著性阈值?
BMC Med Res Methodol. 2014 Mar 20;14:41. doi: 10.1186/1471-2288-14-41.
2
Estimation issues in clinical trials and overviews.临床试验及综述中的估计问题。
Stat Med. 1990 Jun;9(6):657-71. doi: 10.1002/sim.4780090612.
3
The thresholds for statistical and clinical significance - a five-step procedure for evaluation of intervention effects in randomised clinical trials.统计和临床意义的阈值——评价随机临床试验中干预效果的五步程序。
BMC Med Res Methodol. 2014 Mar 4;14:34. doi: 10.1186/1471-2288-14-34.
4
Applying Bayesian ideas in drug development and clinical trials.贝叶斯思想在药物研发与临床试验中的应用。
Stat Med. 1993 Aug;12(15-16):1501-11; discussion 1513-7. doi: 10.1002/sim.4780121516.
5
Interpreting the results of clinical trials, embracing uncertainty: A Bayesian approach.解读临床试验结果,拥抱不确定性:贝叶斯方法。
Acta Anaesthesiol Scand. 2021 Feb;65(2):146-150. doi: 10.1111/aas.13725. Epub 2020 Oct 24.
6
Frequentist, Bayesian Analysis and Complementary Statistical Tools for Geriatric and Rehabilitation Fields: Are Traditional Null-Hypothesis Significance Testing Methods Sufficient?老年医学和康复领域的频率派、贝叶斯分析和补充统计工具:传统的零假设显著性检验方法是否足够?
Clin Interv Aging. 2024 Feb 16;19:277-287. doi: 10.2147/CIA.S441799. eCollection 2024.
7
Bayesian methods for pilot studies.贝叶斯方法在初步研究中的应用。
Clin Trials. 2020 Aug;17(4):414-419. doi: 10.1177/1740774520914306. Epub 2020 Apr 16.
8
Comparing methods for calculating confidence intervals for vaccine efficacy.比较计算疫苗效力置信区间的方法。
Stat Med. 1996;15(21-22):2379-92. doi: 10.1002/(SICI)1097-0258(19961115)15:21<2379::AID-SIM457>3.0.CO;2-L.
9
Bayesian statistics in medical research: an intuitive alternative to conventional data analysis.医学研究中的贝叶斯统计学:传统数据分析的直观替代方法。
J Eval Clin Pract. 2000 May;6(2):193-204. doi: 10.1046/j.1365-2753.2000.00216.x.
10
Significance testing, interval estimation or Bayesian inference: comments to "Extracting a maximum of useful information from statistical research data" by S. Sohlberg and G. Andersson.显著性检验、区间估计或贝叶斯推断:对S. 索尔贝里和G. 安德森所著《从统计研究数据中提取最大有用信息》的评论
Scand J Psychol. 2005 Feb;46(1):79-82. doi: 10.1111/j.1467-9450.2005.00437.x.

引用本文的文献

1
Structural brain improvements following individually tailored serious exergame-based training in mild neurocognitive disorders: exploratory randomized controlled trial.轻度神经认知障碍患者基于个性化定制的严肃健身游戏训练后的大脑结构改善:探索性随机对照试验
Alzheimers Res Ther. 2025 Sep 8;17(1):190. doi: 10.1186/s13195-025-01835-2.
2
Feasibility, acceptability, and exploratory outcomes of a virtual cognitive behavioural therapy-based group intervention for persistent fatigue in endometriosis: The Managing Fatigue in Endometriosis (MEND) programme.基于虚拟认知行为疗法的针对子宫内膜异位症患者持续性疲劳的团体干预的可行性、可接受性及探索性结果:子宫内膜异位症疲劳管理(MEND)项目
Br J Health Psychol. 2025 Sep;30(3):e70014. doi: 10.1111/bjhp.70014.
3

本文引用的文献

1
A randomised controlled pilot study of standardised counselling and cost-free pharmacotherapy for smoking cessation among stroke and TIA patients.一项针对中风和短暂性脑缺血发作(TIA)患者戒烟的标准化咨询和免费药物治疗的随机对照试验研究。
BMJ Open. 2011 Nov 28;1(2):e000366. doi: 10.1136/bmjopen-2011-000366. Print 2011.
2
Optimal sample sizes for phase II clinical trials and pilot studies.二期临床试验和初步研究的最佳样本量。
Stat Med. 2012 May 20;31(11-12):1031-42. doi: 10.1002/sim.4357. Epub 2011 Nov 3.
3
Designing clinical trials for assessing the effects of cognitive training and physical activity interventions on cognitive outcomes: the Seniors Health and Activity Research Program Pilot (SHARP-P) study, a randomized controlled trial.
Memory dependent modulation of hippocampal theta power through frontoparietal phase synchronous brain stimulation.通过额顶叶相位同步脑刺激对海马θ波功率进行记忆依赖性调制。
Sci Rep. 2025 Jul 21;15(1):26504. doi: 10.1038/s41598-025-09841-y.
4
Open label placebo to treat fatigue in people with multiple sclerosis: feasibility and preliminary effects.开放标签安慰剂治疗多发性硬化症患者疲劳:可行性及初步效果
Pilot Feasibility Stud. 2025 Jul 3;11(1):93. doi: 10.1186/s40814-025-01674-w.
5
Effects of a 5-Day Back Squat Overreaching Protocol on Strength Performance, Perceived Recovery and Wellness Responses: A Pilot Trial.为期5天的后深蹲过度训练方案对力量表现、恢复感知及健康反应的影响:一项初步试验
J Funct Morphol Kinesiol. 2025 Jun 13;10(2):227. doi: 10.3390/jfmk10020227.
6
The Brain in Cross-Cultural Adjustment: A Pilot Study of Japanese Expatriates Living in the United States.跨文化适应中的大脑:对生活在美国的日本侨民的一项初步研究。
Brain Sci. 2025 Jun 7;15(6):617. doi: 10.3390/brainsci15060617.
7
Effectiveness of an eye movement desensitization and reprocessing intervention for the prevention of post- traumatic symptoms in perinatal loss: a randomized pilot controlled trial.眼动脱敏与再加工干预对预防围产期丧亲创伤后症状的有效性:一项随机对照试验。
Front Psychiatry. 2025 Jun 9;16:1593306. doi: 10.3389/fpsyt.2025.1593306. eCollection 2025.
8
A pilot study to evaluate the feasibility and acceptability of a mobile phone counselling intervention for perinatal women living with HIV and depression in India.一项评估针对印度感染艾滋病毒且患有抑郁症的围产期妇女的手机咨询干预措施的可行性和可接受性的试点研究。
Oxf Open Digit Health. 2025 May 21;3:oqaf012. doi: 10.1093/oodh/oqaf012. eCollection 2025.
9
Using the Q-method to explore pregnant and breastfeeding women's perceptions of antibiotic use in Mahajanga, Northwest Madagascar.运用Q方法探究马达加斯加西北部马哈赞加地区孕妇和哺乳期妇女对抗生素使用的认知。
BMC Public Health. 2025 May 24;25(1):1922. doi: 10.1186/s12889-025-23140-1.
10
HOPE for Hypermobile Ehlers-Danlos Syndrome (hEDS) and Hypermobility Spectrum Disorder (HSD)-A Pilot Randomised Controlled Trial of Feasibility, Acceptability and Appropriateness.针对可活动过度型埃勒斯-当洛综合征(hEDS)和活动过度谱系障碍(HSD)的希望——一项关于可行性、可接受性和适宜性的试点随机对照试验
Eur J Pain. 2025 Jul;29(6):e70030. doi: 10.1002/ejp.70030.
设计临床试验评估认知训练和身体活动干预对认知结果的影响:老年人健康和活动研究计划试点 (SHARP-P) 研究,一项随机对照试验。
BMC Geriatr. 2011 May 26;11:27. doi: 10.1186/1471-2318-11-27.
4
A guide to the design and analysis of small clinical studies.小型临床研究的设计与分析指南。
Pharm Stat. 2011 Jul-Aug;10(4):363-8. doi: 10.1002/pst.477. Epub 2010 Dec 8.
5
What is a pilot or feasibility study? A review of current practice and editorial policy.什么是试点或可行性研究?对当前实践和社论政策的回顾。
BMC Med Res Methodol. 2010 Jul 16;10:67. doi: 10.1186/1471-2288-10-67.
6
A tutorial on pilot studies: the what, why and how.关于预试验的教程:是什么、为什么以及怎么做。
BMC Med Res Methodol. 2010 Jan 6;10:1. doi: 10.1186/1471-2288-10-1.
7
Consultants' forum: should post hoc sample size calculations be done?顾问论坛:是否应该进行事后样本量计算?
Pharm Stat. 2009 Apr-Jun;8(2):163-9. doi: 10.1002/pst.334.
8
Sample size and the probability of a successful trial.样本量与试验成功的概率。
Pharm Stat. 2006 Oct-Dec;5(4):305-9. doi: 10.1002/pst.232.
9
Design and analysis of pilot studies: recommendations for good practice.初步研究的设计与分析:良好实践建议
J Eval Clin Pract. 2004 May;10(2):307-12. doi: 10.1111/j..2002.384.doc.x.
10
Considerations in the evaluation of surrogate endpoints in clinical trials. summary of a National Institutes of Health workshop.临床试验中替代终点评估的考量。国立卫生研究院研讨会总结
Control Clin Trials. 2001 Oct;22(5):485-502. doi: 10.1016/s0197-2456(01)00153-2.