Suppr超能文献

统计检验中的无意义解读及其对健康风险评估的影响。

Null misinterpretation in statistical testing and its impact on health risk assessment.

机构信息

Department of Epidemiology, University of California Los Angeles, CA, USA.

出版信息

Prev Med. 2011 Oct;53(4-5):225-8. doi: 10.1016/j.ypmed.2011.08.010. Epub 2011 Aug 17.

Abstract

Statistical methods play a pivotal role in health risk assessment, but not always an enlightened one. Problems well known to academics are frequently overlooked in crucial nonacademic venues such as litigation, even though those venues can have profound impacts on population health and medical practice. Statisticians have focused heavily on how statistical significance overstates evidence against null hypotheses, but less on how statistical nonsignificance does not correspond to evidence for the null. I thus present an example of a highly credentialed statistical expert conflating high "nonsignificance" with strong support for the null, via misinterpretation of a P-value as a posterior probability of the null hypothesis. Reverse-Bayes analyses reveal that nearly all the support for the null claimed by the expert must have come from the expert's prior, rather than the data, there being no background data that could support a strong prior. The example illustrates how inattention to the actual meaning of P-values and confidence limits allow extremely biased prior opinions (including null-spiked opinions) to be presented as if they were objective inferences from the data.

摘要

统计方法在健康风险评估中起着关键作用,但并不总是明智的。在诉讼等非学术关键场所,学术界熟知的问题经常被忽视,尽管这些场所会对人群健康和医疗实践产生深远影响。统计学家高度关注统计显著性如何夸大了对零假设的证据,但对统计不显著性如何不对应于零假设的证据关注较少。因此,我举了一个例子,一位非常有声望的统计专家通过将 P 值误解为零假设后验概率,将高“不显著性”与对零假设的有力支持混为一谈。逆贝叶斯分析表明,专家声称的几乎所有对零假设的支持都必须来自专家的先验,而不是数据,因为没有背景数据可以支持强有力的先验。该示例说明了对 P 值和置信区间实际含义的不关注如何允许极其有偏见的先验意见(包括零假设峰值意见)被呈现为好像它们是从数据中得出的客观推论。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验