可以用预期误差成本来证明在多个 α 水平上检验假设是合理的，而不是寻找难以捉摸的最优 α 吗？

Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?

机构信息

Complexity Science, Meraglim Holdings Corporation, Palm Beach Gardens, FL, United States of America.

出版信息

PLoS One. 2024 Sep 25;19(9):e0304675. doi: 10.1371/journal.pone.0304675. eCollection 2024.

DOI:10.1371/journal.pone.0304675

PMID:39321172

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11424007/

Abstract

Simultaneous testing of one hypothesis at multiple alpha levels can be performed within a conventional Neyman-Pearson framework. This is achieved by treating the hypothesis as a family of hypotheses, each member of which explicitly concerns test level as well as effect size. Such testing encourages researchers to think about error rates and strength of evidence in both the statistical design and reporting stages of a study. Here, we show that these multi-alpha level tests can deliver acceptable expected total error costs. We first present formulas for expected error costs from single alpha and multiple alpha level tests, given prior probabilities of effect sizes that have either dichotomous or continuous distributions. Error costs are tied to decisions, with different decisions assumed for each of the potential outcomes in the multi-alpha level case. Expected total costs for tests at single and multiple alpha levels are then compared with optimal costs. This comparison highlights how sensitive optimization is to estimated error costs and to assumptions about prevalence. Testing at multiple default thresholds removes the need to formally identify decisions, or to model costs and prevalence as required in optimization approaches. Although total expected error costs with this approach will not be optimal, our results suggest they may be lower, on average, than when "optimal" test levels are based on mis-specified models.

摘要

在传统的 Neyman-Pearson 框架内，可以对多个α水平下的一个假设进行同时检验。这可以通过将假设视为一个假设族来实现，其中每个成员都明确涉及检验水平和效应大小。这种检验方法鼓励研究人员在研究的统计设计和报告阶段考虑错误率和证据强度。在这里，我们表明这些多α水平检验可以提供可接受的预期总误差成本。我们首先给出了基于效应大小的先验概率具有二项分布或连续分布时，单个α和多个α水平检验的误差成本公式。误差成本与决策相关联，在多α水平情况下的每个潜在结果都假设了不同的决策。然后将单α和多α水平检验的预期总费用与最优费用进行比较。这种比较突出了优化对估计的误差成本和对流行率的假设的敏感性。在多个默认阈值下进行检验，无需正式确定决策，也无需在优化方法中对成本和流行率进行建模。尽管这种方法的总预期误差成本不会是最优的，但我们的结果表明，与基于错误指定模型的“最优”检验水平相比，它们的平均值可能更低。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3019/11424007/49ada2d3b7c4/pone.0304675.g001.jpg

相似文献

Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?可以用预期误差成本来证明在多个 α 水平上检验假设是合理的，而不是寻找难以捉摸的最优 α 吗？

PLoS One. 2024 Sep 25;19(9):e0304675. doi: 10.1371/journal.pone.0304675. eCollection 2024.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Setting an optimal α that minimizes errors in null hypothesis significance tests.设置一个最优的α，使零假设显著性检验中的错误最小化。

PLoS One. 2012;7(2):e32734. doi: 10.1371/journal.pone.0032734. Epub 2012 Feb 28.

Waldian t tests: Sequential Bayesian t tests with controlled error probabilities.瓦尔德检验：具有受控误差概率的序贯贝叶斯 t 检验。

Psychol Methods. 2024 Feb;29(1):99-116. doi: 10.1037/met0000492. Epub 2022 Apr 14.

Errors in Statistical Inference Under Model Misspecification: Evidence, Hypothesis Testing, and AIC.模型设定错误下统计推断中的误差：证据、假设检验与赤池信息准则

Front Ecol Evol. 2019;7. doi: 10.3389/fevo.2019.00372. Epub 2019 Oct 21.

Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.在普通人群中进行 SARS-CoV-2 监测的四种不同策略的有效性和成本效益（CoV-Surv 研究）：一项关于集群随机、双因素对照试验的研究方案的结构化总结。

Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.

An alternative foundation for the planning and evaluation of linkage analysis. II. Implications for multiple test adjustments.连锁分析规划与评估的另一种基础。II. 多重检验校正的影响

Hum Hered. 2006;61(4):200-9. doi: 10.1159/000094775. Epub 2006 Jul 27.

Statistical Significance统计学显著性

Negative consequences of using α = 0.05 for environmental monitoring decisions: a case study from a decade of Canada's Environmental Effects Monitoring Program.使用 α = 0.05 进行环境监测决策的负面后果：来自加拿大环境影响监测计划十年的案例研究。

Environ Sci Technol. 2012 Sep 4;46(17):9249-55. doi: 10.1021/es301320n. Epub 2012 Aug 17.

Why and how we should join the shift from significance testing to estimation.我们为何以及如何应该从显著性检验转向估计。

J Evol Biol. 2022 Jun;35(6):777-787. doi: 10.1111/jeb.14009. Epub 2022 May 18.

本文引用的文献

Effects of a therapeutic weight loss diet on weight loss and metabolic health in overweight and obese dogs.超重和肥胖犬的减肥饮食对体重减轻和代谢健康的影响。

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad183.

Are most published research findings false in a continuous universe?在一个连续的宇宙中，大多数已发表的研究结果都是错误的吗？

PLoS One. 2022 Dec 20;17(12):e0277935. doi: 10.1371/journal.pone.0277935. eCollection 2022.

Cost-Effectiveness Analysis of Molnupiravir Versus Best Supportive Care for the Treatment of Outpatient COVID-19 in Adults in the US.莫努匹韦与最佳支持治疗用于美国成人门诊 COVID-19 治疗的成本-效果分析。

Pharmacoeconomics. 2022 Jul;40(7):699-714. doi: 10.1007/s40273-022-01168-0. Epub 2022 Jul 2.

Analysis goals, error-cost sensitivity, and analysis hacking: Essential considerations in hypothesis testing and multiple comparisons.分析目标、误差成本敏感性与分析操纵：假设检验和多重比较中的重要考量因素

Paediatr Perinat Epidemiol. 2021 Jan;35(1):8-23. doi: 10.1111/ppe.12711. Epub 2020 Dec 2.

Semantic and cognitive tools to aid statistical science: replace confidence and significance by compatibility and surprise.辅助统计科学的语义和认知工具：用兼容性和惊奇取代置信度和显著性。

BMC Med Res Methodol. 2020 Sep 30;20(1):244. doi: 10.1186/s12874-020-01105-9.

Science is not a signal detection problem.科学不是一个信号检测问题。

Proc Natl Acad Sci U S A. 2020 Mar 17;117(11):5559-5567. doi: 10.1073/pnas.1914237117. Epub 2020 Mar 3.

The quest for an optimal alpha.追求最优阿尔法。

PLoS One. 2019 Jan 2;14(1):e0208631. doi: 10.1371/journal.pone.0208631. eCollection 2019.

Success of a weight loss plan for overweight dogs: The results of an international weight loss study.超重犬减肥计划的成效：一项国际减肥研究的结果

PLoS One. 2017 Sep 8;12(9):e0184199. doi: 10.1371/journal.pone.0184199. eCollection 2017.

Optimizing Research Payoff.优化研究回报。

Perspect Psychol Sci. 2016 Sep;11(5):664-691. doi: 10.1177/1745691616649170.

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations.统计检验、P 值、置信区间与检验效能：误解指南

Eur J Epidemiol. 2016 Apr;31(4):337-50. doi: 10.1007/s10654-016-0149-3. Epub 2016 May 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

可以用预期误差成本来证明在多个 α 水平上检验假设是合理的，而不是寻找难以捉摸的最优 α 吗？

Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献