重症监护中个体多中心试验的假阳性和假阴性风险。

False-positive and false-negative risks for individual multicentre trials in critical care.

作者信息

Sidebotham David, Barlow C Jake

机构信息

Department of Anaesthesia, Auckland City Hospital, Auckland, New Zealand.

Cardiothoracic and Vascular Intensive Care Unit, Auckland City Hospital, Auckland, New Zealand.

出版信息

BJA Open. 2022 Mar 1;1:100003. doi: 10.1016/j.bjao.2022.100003. eCollection 2022 Mar.

DOI:10.1016/j.bjao.2022.100003

PMID:37588693

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10430847/

Abstract

BACKGROUND

In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating -values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, -values and confidence intervals cannot tell us the probability that the hypothesis is true. In contrast, false-positive risk (FPR) and false-negative risk (FNR) are post-test probabilities concerning the truth of the hypothesis, that is to say, the probability a real effect exists.

METHODS

We calculated the FPR or FNR for 53 individual multicentre trials in critical care based on a pretest probability of 0.5 that the hypothesis was true.

RESULTS

For trials reporting statistical significance, the FPR varied between 0.1% and 57.6%. For trials reporting non-significance, the FNR varied between 1.7% and 36.9%. Twenty-six of 47 trials (55.3%) reporting non-significance provided strong or very strong evidence in favour of the null hypothesis; the remaining trials provided limited evidence. There was no obvious relationship between the -value and the FNR.

CONCLUSIONS

The FPR and FNR showed marked variability, indicating that the probability of a real or absent treatment effect differed substantially between trials. Only one trial reporting statistical significance provided convincing evidence of a real treatment effect, and nearly half of all trials reporting non-significance provided limited evidence for the absence of a treatment effect. Our findings suggest that the quality of evidence from multicentre trials in critical care is highly variable.

摘要

背景

在医学研究中，零假设显著性检验（NHST）是统计推断的主导框架。NHST涉及计算P值和置信区间，以量化反对无效应零假设的证据。然而，P值和置信区间无法告诉我们假设为真的概率。相比之下，假阳性风险（FPR）和假阴性风险（FNR）是关于假设真实性的检验后概率，也就是说，存在真实效应的概率。

方法

我们基于假设为真的先验概率0.5，计算了53项重症监护多中心个体试验的FPR或FNR。

结果

对于报告具有统计学显著性的试验，FPR在0.1%至57.6%之间变化。对于报告无显著性的试验，FNR在1.7%至36.9%之间变化。在报告无显著性的47项试验中，有26项（55.3%）提供了支持零假设的强或非常强的证据；其余试验提供的证据有限。P值与FNR之间没有明显关系。

结论

FPR和FNR显示出显著的变异性，表明不同试验之间真实治疗效应存在或不存在的概率有很大差异。只有一项报告具有统计学显著性的试验提供了真实治疗效应的令人信服的证据，而几乎所有报告无显著性的试验中有近一半提供了治疗效应不存在的有限证据。我们的研究结果表明，重症监护多中心试验的证据质量高度可变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d9f/10430847/4d8f86afd1a7/gr1.jpg

相似文献

False-positive and false-negative risks for individual multicentre trials in critical care.重症监护中个体多中心试验的假阳性和假阴性风险。

BJA Open. 2022 Mar 1;1:100003. doi: 10.1016/j.bjao.2022.100003. eCollection 2022 Mar.

Statistically significant differences versus convincing evidence of real treatment effects: an analysis of the false positive risk for single-centre trials in anaesthesia.与真实治疗效果的令人信服证据相比的统计学显著差异：对麻醉学中单中心试验的假阳性风险的分析。

Br J Anaesth. 2024 Jan;132(1):116-123. doi: 10.1016/j.bja.2023.10.036. Epub 2023 Nov 28.

Many High-Quality Randomized Controlled Trials in Sports Physical Therapy Are Making False-Positive Claims of Treatment Effect: A Systematic Survey.许多运动物理治疗的高质量随机对照试验存在治疗效果的虚假阳性声称：系统调查。

J Orthop Sports Phys Ther. 2020 Feb;50(2):104-109. doi: 10.2519/jospt.2020.9264.

Decision qualities of Bayes factor and p value-based hypothesis testing.贝叶斯因子和基于 p 值的假设检验的决策质量。

Psychol Methods. 2017 Jun;22(2):340-360. doi: 10.1037/met0000140.

Fooled by Significance Testing: An Analysis of the LOVIT Vitamin C Trial.被显著性检验误导：LOVIT 维生素 C 试验分析。

J Extra Corpor Technol. 2022 Dec;54(4):324-329. doi: 10.1182/ject-2200030.

Are most randomised trials in anaesthesia and critical care wrong? An analysis using Bayes' theorem.大多数麻醉和重症监护的随机试验都错了吗？运用贝叶斯定理进行的分析。

Anaesthesia. 2020 Oct;75(10):1386-1393. doi: 10.1111/anae.15029. Epub 2020 Apr 7.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

A Bayesian analysis of non-significant rehabilitation findings: Evaluating the evidence in favour of truly absent treatment effects.贝叶斯分析无显著康复结果：评估真正不存在治疗效果的证据。

Ann Phys Rehabil Med. 2021 Jul;64(4):101425. doi: 10.1016/j.rehab.2020.07.008. Epub 2020 Oct 6.

Moving Beyond p < 0.05 in Ecotoxicology: A Guide for Practitioners.超越生态毒理学中的 p < 0.05：从业者指南。

Environ Toxicol Chem. 2020 Sep;39(9):1657-1669. doi: 10.1002/etc.4800.

Consequences of relying on statistical significance: Some illustrations.依赖统计显著性的后果：一些例证。

Eur J Clin Invest. 2018 May;48(5):e12912. doi: 10.1111/eci.12912. Epub 2018 Feb 28.

引用本文的文献

Understanding Bayesian analysis of clinical trials: an overview for clinicians.理解临床试验的贝叶斯分析：临床医生概述

Crit Care Sci. 2025 May 26;37:e20250267. doi: 10.62675/2965-2774.20250267. eCollection 2025.

Interpreting frequentist hypothesis tests: insights from Bayesian inference.贝叶斯推理视角下的频率派假设检验解读

Can J Anaesth. 2023 Oct;70(10):1560-1575. doi: 10.1007/s12630-023-02557-5. Epub 2023 Oct 4.

Fooled by Significance Testing: An Analysis of the LOVIT Vitamin C Trial.被显著性检验误导：LOVIT 维生素 C 试验分析。

J Extra Corpor Technol. 2022 Dec;54(4):324-329. doi: 10.1182/ject-2200030.

本文引用的文献

Understanding significance testing.理解显著性检验。

Anaesthesia. 2021 Dec;76(12):1659-1664. doi: 10.1111/anae.15591. Epub 2021 Sep 21.

A Bayesian analysis of mortality outcomes in multicentre clinical trials in critical care.贝叶斯分析在重症监护多中心临床试验中的死亡率结局。

Br J Anaesth. 2021 Sep;127(3):487-494. doi: 10.1016/j.bja.2021.06.026. Epub 2021 Jul 16.

Association Between Administration of Systemic Corticosteroids and Mortality Among Critically Ill Patients With COVID-19: A Meta-analysis.COVID-19 重症患者全身使用皮质类固醇与死亡率的关联：一项荟萃分析。

JAMA. 2020 Oct 6;324(13):1330-1341. doi: 10.1001/jama.2020.17023.

Are most randomised trials in anaesthesia and critical care wrong? An analysis using Bayes' theorem.大多数麻醉和重症监护的随机试验都错了吗？运用贝叶斯定理进行的分析。

Anaesthesia. 2020 Oct;75(10):1386-1393. doi: 10.1111/anae.15029. Epub 2020 Apr 7.

Effect of Stress Ulcer Prophylaxis With Proton Pump Inhibitors vs Histamine-2 Receptor Blockers on In-Hospital Mortality Among ICU Patients Receiving Invasive Mechanical Ventilation: The PEPTIC Randomized Clinical Trial.质子泵抑制剂与组胺 2 受体拮抗剂预防应激性溃疡对接受有创机械通气的 ICU 患者院内死亡率的影响：PEPTIC 随机临床试验。

JAMA. 2020 Feb 18;323(7):616-626. doi: 10.1001/jama.2019.22190.

Extracorporeal Membrane Oxygenation for Severe Acute Respiratory Distress Syndrome.体外膜肺氧合治疗严重急性呼吸窘迫综合征。

N Engl J Med. 2018 May 24;378(21):1965-1975. doi: 10.1056/NEJMoa1800385.

Bayesian reanalysis of null results reported in medicine: Strong yet variable evidence for the absence of treatment effects.贝叶斯重新分析医学中报告的无效结果：强有力但可变的证据表明治疗效果不存在。

PLoS One. 2018 Apr 25;13(4):e0195474. doi: 10.1371/journal.pone.0195474. eCollection 2018.

Hydrocortisone plus Fludrocortisone for Adults with Septic Shock.氢化可的松联合氟氢可的松治疗脓毒性休克成人患者。

N Engl J Med. 2018 Mar 1;378(9):809-818. doi: 10.1056/NEJMoa1705716.

Effect of Haloperidol on Survival Among Critically Ill Adults With a High Risk of Delirium: The REDUCE Randomized Clinical Trial.氟哌啶醇对有谵妄高风险的危重症成年患者生存的影响：REDUCE随机临床试验

JAMA. 2018 Feb 20;319(7):680-690. doi: 10.1001/jama.2018.0160.

Effect of Lung Recruitment and Titrated Positive End-Expiratory Pressure (PEEP) vs Low PEEP on Mortality in Patients With Acute Respiratory Distress Syndrome: A Randomized Clinical Trial.肺复张与滴定式呼气末正压通气（PEEP）对比低PEEP对急性呼吸窘迫综合征患者死亡率的影响：一项随机临床试验

JAMA. 2017 Oct 10;318(14):1335-1345. doi: 10.1001/jama.2017.14171.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

重症监护中个体多中心试验的假阳性和假阴性风险。

False-positive and false-negative risks for individual multicentre trials in critical care.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献