• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大量关于功效、效应大小和样本量的假设检验的成本。

The cost of large numbers of hypothesis tests on power, effect size and sample size.

机构信息

Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94305-5723, USA.

出版信息

Mol Psychiatry. 2012 Jan;17(1):108-14. doi: 10.1038/mp.2010.117. Epub 2010 Nov 9.

DOI:10.1038/mp.2010.117
PMID:21060308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3252610/
Abstract

Advances in high-throughput biology and computer science are driving an exponential increase in the number of hypothesis tests in genomics and other scientific disciplines. Studies using current genotyping platforms frequently include a million or more tests. In addition to the monetary cost, this increase imposes a statistical cost owing to the multiple testing corrections needed to avoid large numbers of false-positive results. To safeguard against the resulting loss of power, some have suggested sample sizes on the order of tens of thousands that can be impractical for many diseases or may lower the quality of phenotypic measurements. This study examines the relationship between the number of tests on the one hand and power, detectable effect size or required sample size on the other. We show that once the number of tests is large, power can be maintained at a constant level, with comparatively small increases in the effect size or sample size. For example at the 0.05 significance level, a 13% increase in sample size is needed to maintain 80% power for ten million tests compared with one million tests, whereas a 70% increase in sample size is needed for 10 tests compared with a single test. Relative costs are less when measured by increases in the detectable effect size. We provide an interactive Excel calculator to compute power, effect size or sample size when comparing study designs or genome platforms involving different numbers of hypothesis tests. The results are reassuring in an era of extreme multiple testing.

摘要

高通量生物学和计算机科学的进步正在推动基因组学和其他科学领域中假设检验数量的指数级增长。使用当前基因分型平台的研究通常包含一百万个或更多的测试。除了货币成本之外,由于需要进行多次测试校正以避免大量假阳性结果,这种增加还会带来统计成本。为了防止由此导致的效力损失,有人建议采用数万的样本量,但对于许多疾病来说,这可能不切实际,或者可能降低表型测量的质量。本研究考察了测试数量与效力、可检测的效应大小或所需样本量之间的关系。我们表明,一旦测试数量很大,效力就可以保持在一个恒定的水平,而效应大小或样本量的相对较小的增加即可实现。例如,在 0.05 的显着性水平下,与 100 万次测试相比,需要将样本量增加 13%,才能维持 1000 万次测试的 80%效力,而与单次测试相比,10 次测试则需要将样本量增加 70%。当以可检测的效应大小的增加来衡量时,相对成本较低。我们提供了一个交互式 Excel 计算器,用于在比较涉及不同数量假设检验的研究设计或基因组平台时计算效力、效应大小或样本量。在极端多重检验的时代,这些结果令人放心。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e6/3252610/a150b350cbb3/mp2010117f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e6/3252610/5770a25a3759/mp2010117f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e6/3252610/a150b350cbb3/mp2010117f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e6/3252610/5770a25a3759/mp2010117f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95e6/3252610/a150b350cbb3/mp2010117f2.jpg

相似文献

1
The cost of large numbers of hypothesis tests on power, effect size and sample size.大量关于功效、效应大小和样本量的假设检验的成本。
Mol Psychiatry. 2012 Jan;17(1):108-14. doi: 10.1038/mp.2010.117. Epub 2010 Nov 9.
2
Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives.随机对照试验中的亚组分析:量化假阳性和假阴性风险
Health Technol Assess. 2001;5(33):1-56. doi: 10.3310/hta5330.
3
Evaluation of genome-wide power of genetic association studies based on empirical data from the HapMap project.基于国际人类基因组单体型图计划(HapMap计划)的经验数据评估全基因组遗传关联研究的效能。
Hum Mol Genet. 2007 Oct 15;16(20):2494-505. doi: 10.1093/hmg/ddm205. Epub 2007 Jul 31.
4
Clinician's Guide to Understanding Effect Size, Alpha Level, Power, and Sample Size.临床医生理解效应大小、α水平、功效和样本量指南。
Nutr Clin Pract. 2021 Jun;36(3):598-605. doi: 10.1002/ncp.10674. Epub 2021 May 6.
5
Significance, Errors, Power, and Sample Size: The Blocking and Tackling of Statistics.意义、误差、功效和样本量:统计学中的拦阻与擒抱。
Anesth Analg. 2018 Feb;126(2):691-698. doi: 10.1213/ANE.0000000000002741.
6
Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico.真实世界中的稀有变异关联分析场景:不平衡和样本量对计算机模拟功效的影响。
BMC Bioinformatics. 2019 Jan 22;20(1):46. doi: 10.1186/s12859-018-2591-6.
7
Optimal sample sizes for Welch's test under various allocation and cost considerations.在各种分配和成本考虑下,Welch 检验的最优样本量。
Behav Res Methods. 2011 Dec;43(4):1014-22. doi: 10.3758/s13428-011-0095-7.
8
On design considerations and randomization-based inference for community intervention trials.关于社区干预试验的设计考量及基于随机化的推断
Stat Med. 1996 Jun 15;15(11):1069-92. doi: 10.1002/(SICI)1097-0258(19960615)15:11<1069::AID-SIM220>3.0.CO;2-Q.
9
Testing two variances for superiority/non-inferiority and equivalence: Using the exhaustion algorithm for sample size allocation with cost.检验两个方差的优效性/非劣效性和等效性:使用带成本的样本量分配穷举算法。
Br J Math Stat Psychol. 2020 May;73(2):316-332. doi: 10.1111/bmsp.12172. Epub 2019 Jun 12.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

1
A scoping review of statistical methods used to report EORTC QLQ-C30 quality of life scores measured longitudinally.一项关于用于报告纵向测量的欧洲癌症研究与治疗组织核心生活质量问卷(EORTC QLQ-C30)生活质量评分的统计方法的范围综述。
BMC Med Res Methodol. 2025 Aug 2;25(1):188. doi: 10.1186/s12874-025-02622-1.
2
Methodological approaches and author-reported limitations in evaluation studies of digital health technologies (DHT): A scoping review of DHT interventions for cancer, diabetes mellitus, and cardiovascular diseases.数字健康技术(DHT)评估研究中的方法学途径及作者报告的局限性:对癌症、糖尿病和心血管疾病的DHT干预措施的范围综述
PLOS Digit Health. 2025 Apr 24;4(4):e0000806. doi: 10.1371/journal.pdig.0000806. eCollection 2025 Apr.
3

本文引用的文献

1
So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests.这么多相关的测试,时间却这么少!多个相关测试的 P 值的快速调整。
Am J Hum Genet. 2007 Dec;81(6):1158-68. doi: 10.1086/522036.
2
Gene expression profile of the PDAPP mouse model for Alzheimer's disease with and without Apolipoprotein E.
Neurobiol Aging. 2009 Apr;30(4):574-90. doi: 10.1016/j.neurobiolaging.2007.08.006. Epub 2007 Sep 29.
3
Statistical false positive or true disease pathway?统计假阳性还是真正的疾病途径?
The effect of oxytocin, sublingual, and intrauterine misoprostol on blood loss in cesarean delivery: A randomized clinical trial.
缩宫素、舌下含服米索前列醇及宫腔内使用米索前列醇对剖宫产术中失血的影响:一项随机临床试验。
Eur J Obstet Gynecol Reprod Biol X. 2025 Feb 3;25:100369. doi: 10.1016/j.eurox.2025.100369. eCollection 2025 Mar.
4
Effects of Baduanjin exercise on cancer-related fatigue in patients with prostate cancer treated with androgen deprivation therapy in Shanghai, China: a study protocol for a randomised controlled trial.八段锦锻炼对中国上海接受雄激素剥夺治疗的前列腺癌患者癌因性疲乏的影响:一项随机对照试验研究方案。
BMJ Open. 2024 Nov 13;14(11):e092363. doi: 10.1136/bmjopen-2024-092363.
5
Improving rigor and reproducibility in western blot experiments with the blotRig analysis.通过blotRig分析提高蛋白质印迹实验的严谨性和可重复性。
Sci Rep. 2024 Sep 17;14(1):21644. doi: 10.1038/s41598-024-70096-0.
6
Cytokine- and Vascular Endothelial Growth Factor-Related Gene-Based Genome-Wide Association Study of Low-Dose Ketamine Infusion in Patients with Treatment-Resistant Depression.细胞因子和血管内皮生长因子相关基因的全基因组关联研究:低剂量氯胺酮输注治疗难治性抑郁症。
CNS Drugs. 2023 Mar;37(3):243-253. doi: 10.1007/s40263-023-00989-7. Epub 2023 Feb 10.
7
Lifestyle's influence on community-dwelling older adults' health: A mixed-methods study design.生活方式对社区居住老年人健康的影响:一项混合方法研究设计。
Contemp Clin Trials Commun. 2020 Dec 13;21:100687. doi: 10.1016/j.conctc.2020.100687. eCollection 2021 Mar.
8
A P-value model for theoretical power analysis and its applications in multiple testing procedures.用于理论功效分析的P值模型及其在多重检验程序中的应用。
BMC Med Res Methodol. 2016 Oct 10;16(1):135. doi: 10.1186/s12874-016-0233-0.
9
Paired proteomics, transcriptomics and miRNomics in non-small cell lung cancers: known and novel signaling cascades.非小细胞肺癌中的配对蛋白质组学、转录组学和微小RNA组学:已知和新的信号级联反应
Oncotarget. 2016 Nov 1;7(44):71514-71525. doi: 10.18632/oncotarget.11723.
10
P-values in genomics: apparent precision masks high uncertainty.基因组学中的P值:表面的精确掩盖了高度的不确定性。
Mol Psychiatry. 2014 Dec;19(12):1336-40. doi: 10.1038/mp.2013.184. Epub 2014 Jan 14.
Nat Genet. 2006 Jul;38(7):731-3. doi: 10.1038/ng0706-731.
4
When to combine hypotheses and adjust for multiple tests.何时合并假设并针对多重检验进行调整。
Health Serv Res. 2006 Jun;41(3 Pt 1):804-18. doi: 10.1111/j.1475-6773.2006.00512.x.
5
Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix.使用相关矩阵的特征值在多位点分析中调整多重检验。
Heredity (Edinb). 2005 Sep;95(3):221-7. doi: 10.1038/sj.hdy.6800717.
6
Genome-wide association studies: theoretical and practical concerns.全基因组关联研究:理论与实际问题
Nat Rev Genet. 2005 Feb;6(2):109-18. doi: 10.1038/nrg1522.
7
A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other.对处于连锁不平衡状态的单核苷酸多态性进行多重检验的一种简单校正方法。
Am J Hum Genet. 2004 Apr;74(4):765-9. doi: 10.1086/383251. Epub 2004 Mar 2.
8
Statistical significance for genomewide studies.全基因组研究的统计学显著性
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5. doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.
9
Thresholding of statistical maps in functional neuroimaging using the false discovery rate.使用错误发现率对功能神经成像中的统计地图进行阈值处理。
Neuroimage. 2002 Apr;15(4):870-8. doi: 10.1006/nimg.2001.1037.
10
Controlling the false discovery rate in behavior genetics research.控制行为遗传学研究中的错误发现率
Behav Brain Res. 2001 Nov 1;125(1-2):279-84. doi: 10.1016/s0166-4328(01)00297-2.