• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于在公共卫生中正确使用频率学派推断统计学。

For a proper use of frequentist inferential statistics in public health.

作者信息

Rovetta Alessandro, Mansournia Mohammad Ali, Vitale Alessandro

机构信息

Research & Disclosure, R&C Research, Bovezzo (BS), Italy.

Department of Epidemiology and Biostatistics, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran.

出版信息

Glob Epidemiol. 2024 Jun 15;8:100151. doi: 10.1016/j.gloepi.2024.100151. eCollection 2024 Dec.

DOI:10.1016/j.gloepi.2024.100151
PMID:39021384
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11252774/
Abstract

As widely noted in the literature and by international bodies such as the American Statistical Association, severe misinterpretations of -values, confidence intervals, and statistical significance are sadly common in public health. This scenario poses serious risks concerning terminal decisions such as the approval or rejection of therapies. Cognitive distortions about statistics likely stem from poor teaching in schools and universities, overly simplified interpretations, and - as we suggest - the reckless use of calculation software with predefined standardized procedures. In light of this, we present a framework to recalibrate the role of frequentist-inferential statistics within clinical and epidemiological research. In particular, we stress that statistics is only a set of rules and numbers that make sense only when properly placed within a well-defined scientific context beforehand. Practical examples are discussed for educational purposes. Alongside this, we propose some tools to better evaluate statistical outcomes, such as multiple compatibility or surprisal intervals or tuples of various point hypotheses. Lastly, we emphasize that every conclusion must be informed by different kinds of scientific evidence (e.g., biochemical, clinical, statistical, etc.) and must be based on a careful examination of costs, risks, and benefits.

摘要

正如文献以及美国统计协会等国际机构广泛指出的那样,在公共卫生领域,对p值、置信区间和统计显著性的严重误解令人遗憾地普遍存在。这种情况给诸如批准或拒绝治疗等最终决策带来了严重风险。对统计学的认知扭曲可能源于中小学和大学教学质量不佳、过于简化的解释,以及——如我们所指出的——对具有预定义标准化程序的计算软件的轻率使用。有鉴于此,我们提出了一个框架,以重新校准频率推断统计在临床和流行病学研究中的作用。特别是,我们强调统计学只是一组规则和数字,只有事先恰当地置于明确界定的科学背景中才有意义。为了教学目的,我们讨论了一些实际例子。与此同时,我们提出了一些工具,以更好地评估统计结果,例如多重兼容性或意外区间或各种点假设的元组。最后,我们强调,每一个结论都必须以不同类型的科学证据(如生化、临床、统计等)为依据,并且必须基于对成本、风险和收益的仔细审查。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/d4316c082b61/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/ae6a4833f27a/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/e725234f9f19/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/d4316c082b61/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/ae6a4833f27a/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/e725234f9f19/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8e0/11252774/d4316c082b61/gr3.jpg

相似文献

1
For a proper use of frequentist inferential statistics in public health.关于在公共卫生中正确使用频率学派推断统计学。
Glob Epidemiol. 2024 Jun 15;8:100151. doi: 10.1016/j.gloepi.2024.100151. eCollection 2024 Dec.
2
Semantic and cognitive tools to aid statistical science: replace confidence and significance by compatibility and surprise.辅助统计科学的语义和认知工具:用兼容性和惊奇取代置信度和显著性。
BMC Med Res Methodol. 2020 Sep 30;20(1):244. doi: 10.1186/s12874-020-01105-9.
3
To curb research misreporting, replace significance and confidence by compatibility: A Preventive Medicine Golden Jubilee article.为了遏制研究报告错误,用兼容性替代显著性和置信度:预防医学金禧文章。
Prev Med. 2022 Nov;164:107127. doi: 10.1016/j.ypmed.2022.107127. Epub 2022 Jul 3.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Statistical hypothesis testing and common misinterpretations: Should we abandon p-value in forensic science applications?统计假设检验及常见误解:在法医学应用中我们应该摒弃p值吗?
Forensic Sci Int. 2016 Feb;259:e32-6. doi: 10.1016/j.forsciint.2015.11.013. Epub 2015 Dec 12.
6
Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations.统计检验、P 值、置信区间与检验效能:误解指南
Eur J Epidemiol. 2016 Apr;31(4):337-50. doi: 10.1007/s10654-016-0149-3. Epub 2016 May 21.
7
Multiple Confidence Intervals and Surprisal Intervals to Avoid Significance Fallacy.避免显著性谬误的多个置信区间和意外区间。
Cureus. 2024 Jan 9;16(1):e51964. doi: 10.7759/cureus.51964. eCollection 2024 Jan.
8
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
9
Inappropriate use of statistical power.统计功效的误用。
Bone Marrow Transplant. 2023 May;58(5):474-477. doi: 10.1038/s41409-023-01935-3. Epub 2023 Mar 3.
10
Clinical significance not statistical significance: a simple Bayesian alternative to p values.临床意义而非统计学意义:一种替代p值的简单贝叶斯方法。
J Epidemiol Community Health. 1998 May;52(5):318-23. doi: 10.1136/jech.52.5.318.

引用本文的文献

1
Sex-based differences in survival after liver transplantation for colorectal cancer liver metastases: A multivariable analysis.结直肠癌肝转移患者肝移植术后生存的性别差异:一项多变量分析
JHEP Rep. 2025 Jun 27;7(10):101505. doi: 10.1016/j.jhepr.2025.101505. eCollection 2025 Oct.
2
Why is p-Value Controversial?为什么p值存在争议?
Cardiovasc Intervent Radiol. 2025 Jul 22. doi: 10.1007/s00270-025-04139-y.
3
Hydroxychloroquine use during the first COVID-19 wave: a case study highlighting the urgent need to enhance research practices within the publication ecosystem.

本文引用的文献

1
Recommendations for accurate reporting in medical research statistics.医学研究统计中准确报告的建议。
Lancet. 2024 Feb 17;403(10427):611-612. doi: 10.1016/S0140-6736(24)00139-9.
2
Multiple Confidence Intervals and Surprisal Intervals to Avoid Significance Fallacy.避免显著性谬误的多个置信区间和意外区间。
Cureus. 2024 Jan 9;16(1):e51964. doi: 10.7759/cureus.51964. eCollection 2024 Jan.
3
Dialling back 'impact' claims: researchers should not be compelled to make policy claims based on single studies.淡化“影响”主张:不应强迫研究人员基于单一研究提出政策主张。
新冠疫情第一波期间羟氯喹的使用:一项案例研究,凸显了在出版生态系统内加强研究实践的迫切需求。
Arch Public Health. 2025 Apr 27;83(1):115. doi: 10.1186/s13690-025-01596-2.
4
p-Values and confidence intervals as compatibility measures: guidelines for interpreting statistical studies in clinical research.作为兼容性度量的P值和置信区间:临床研究中统计研究解读指南
Lancet Reg Health Southeast Asia. 2025 Jan 28;33:100534. doi: 10.1016/j.lansea.2025.100534. eCollection 2025 Feb.
5
Interaction between opium use and cigarette smoking on bladder cancer: An inverse probability weighting approach based on a multicenter case-control study in Iran.阿片类药物使用与吸烟对膀胱癌的相互作用:基于伊朗一项多中心病例对照研究的逆概率加权法
Glob Epidemiol. 2024 Dec 30;9:100182. doi: 10.1016/j.gloepi.2024.100182. eCollection 2025 Jun.
6
A Very Low-Carbohydrate Program in Adults With Metabolic Dysfunction-Associated Steatotic Liver Disease and Phospholipase Domain-Containing Protein 3 Risk Genotype: Pre-Post Intervention Study.针对患有代谢功能障碍相关脂肪性肝病且携带含磷脂酶结构域蛋白3风险基因型的成年人的极低碳水化合物计划:干预前后研究。
JMIR Form Res. 2025 Jan 10;9:e60051. doi: 10.2196/60051.
7
P>0.05 Is Good: The NORD-h Protocol for Several Hypothesis Analysis Based on Known Risks, Costs, and Benefits.P>0.05 是有益的:基于已知风险、成本和效益的几种假设分析的NORD-h方案。
J Prev Med Public Health. 2024 Nov;57(6):511-520. doi: 10.3961/jpmph.24.250. Epub 2024 Sep 20.
Int J Epidemiol. 2024 Feb 1;53(1). doi: 10.1093/ije/dyad181.
4
-value, compatibility, and S-value.价值、兼容性和S值。
Glob Epidemiol. 2022 Sep 12;4:100085. doi: 10.1016/j.gloepi.2022.100085. eCollection 2022 Dec.
5
To curb research misreporting, replace significance and confidence by compatibility: A Preventive Medicine Golden Jubilee article.为了遏制研究报告错误,用兼容性替代显著性和置信度:预防医学金禧文章。
Prev Med. 2022 Nov;164:107127. doi: 10.1016/j.ypmed.2022.107127. Epub 2022 Jul 3.
6
Rewriting results in the language of compatibility.以兼容性语言进行重写会产生结果。
Trends Ecol Evol. 2022 Jul;37(7):567-568. doi: 10.1016/j.tree.2022.02.001. Epub 2022 Feb 25.
7
Rewriting results sections in the language of evidence.改写结果部分的证据语言。
Trends Ecol Evol. 2022 Mar;37(3):203-210. doi: 10.1016/j.tree.2021.10.009. Epub 2021 Nov 16.
8
Reproducibility: expect less of the scientific paper.可重复性:对科学论文的期望要降低。
Nature. 2021 Sep;597(7876):329-331. doi: 10.1038/d41586-021-02486-7.
9
Users' Guide to Medical Decision Analysis.《医学决策分析使用指南》。
Mayo Clin Proc. 2021 Aug;96(8):2205-2217. doi: 10.1016/j.mayocp.2021.02.003. Epub 2021 Jul 3.
10
A CHecklist for statistical Assessment of Medical Papers (the CHAMP statement): explanation and elaboration.用于医学论文统计评估的清单(CHAMP 声明):解释和说明。
Br J Sports Med. 2021 Sep;55(18):1009-1017. doi: 10.1136/bjsports-2020-103652. Epub 2021 Jan 29.